70 resultados para coding sequence
em Biblioteca Digital da Produção Intelectual da Universidade de São Paulo (BDPI/USP)
Resumo:
Despite the wide distribution of transposable elements (TEs) in mammalian genomes, part of their evolutionary significance remains to be discovered. Today there is a substantial amount of evidence showing that TEs are involved in the generation of new exons in different species. In the present study, we searched 22,805 genes and reported the occurrence of TE-cassettes in coding sequences of 542 cow genes using the RepeatMasker program. Despite the significant number (542) of genes with TE insertions in exons only 14 (2.6%) of them were translated into protein, which we characterized as chimeric genes. From these chimeric genes, only the FAST kinase domains 3 (FASTKD3) gene, present on chromosome BTA 20, is a functional gene and showed evidence of the exaptation event. The genome sequence analysis showed that the last exon coding sequence of bovine FASTKD3 is similar to 85% similar to the ART2A retrotransposon sequence. In addition, comparison among FASTKD3 proteins shows that the last exon is very divergent from those of Homo sapiens, Pan troglodytes and Canis familiares. We suggest that the gene structure of bovine FASTKD3 gene could have originated by several ectopic recombinations between TE copies. Additionally, the absence of TE sequences in all other species analyzed suggests that the TE insertion is clade-specific, mainly in the ruminant lineage.
Resumo:
The secreted phospholipases A(2) (sPLA(2)s) are water-soluble enzymes that bind to the surface of both artificial and biological lipid bilayers and hydrolyze the membrane phospholipids. The tissue expression pattern of the human group IID secretory phospholipase A(2) (hsPLA(2)-IID) suggests that the enzyme is involved in the regulation of the immune and inflammatory responses. With an aim to establish an expression system for the hsPLA(2)-IID in Escherichia coli, the DNA-coding sequence for hsPLA(2)-IID was subcloned into the vector pET3a, and expressed as inclusion bodies in E. coli (BL21). A protocol has been developed to refold the recombinant protein in the presence of guanidinium hydrochloride, using a size-exclusion chromatography matrix followed by dilution and dialysis to remove the excess denaturant. After purification by cation-exchange chromatography, far ultraviolet circular dichroism spectra of the recombinant hsPLA(2)-IID indicated protein secondary structure content similar to the homologous human group IIA secretory phospholipase A(2). The refolded recombinant hsPLA(2)-IID demonstrated Ca(2+)-dependent hydrolytic activity, as measuring the release free fatty acid from phospholipid liposomes. This protein expression and purification system may be useful for site-directed mutagenesis experiments of the hsPLA(2)-IID which will advance our understanding of the structure-function relationship and biological effects of the protein. (C) 2009 Elsevier Inc. All rights reserved.
Resumo:
Context: Although numerous reports of mutations in GH1 and GHRHR (GHRH receptor) causing isolated GH deficiency (IGHD) have been published, mutations in GHRH itself have not been hitherto reported but are obvious candidates for GH deficiency. Objective: The aim of this study was to identify mutations in GHRH in a large cohort of patients with IGHD. Patients and Methods: DNA was isolated from 151 patients diagnosed with IGHD at national and international centers. Seventy-two patients fulfilled all the following criteria: severe short stature (height SD score <= -2.5), low peakGHafter stimulation (peak <= 5 ng/ml), eutopic posterior pituitary lobe, and absence of mutations in GH1 and GHRHR and therefore were strong candidates for GHRH mutations. The coding sequence and splice sites of GHRH were amplified by PCR with intronic primers and sequenced. Results: In five of 151 patients (four of 42 from Brazil), the GHRH c. 223 C>T, p. L75F change was identified in heterozygosity. This variant has been previously reported as a polymorphism and is more frequent in African than European and Asian populations. Six allelic variants (five novel) that do not predict change of amino acids or splice sites were identified in five patients: c. 147 C>T, p.S49S, IVS1 -70 G>A, IVS1 -74 T>C, IVS3 -47 del1, and IVS3 +7 G>A/IVS3 + 41 G>A. No functional mutations were found in this cohort. Conclusions: GHRH mutations were not identified in a selected cohort of patients with IGHD, suggesting that, if they exist, they may be an extremely rare cause of IGHD. Other, as-yet-unidentified genetic factors may be implicated in the genetic etiology of IGHD in our cohort. (J Clin Endocrinol Metab 96: E1457-E1460, 2011)
Resumo:
The pathogenic mechanisms of Leptospira interrogans, the causal agent of leptospirosis, remain largely unknown. This is mainly due to the lack of tools for genetically manipulating pathogenic Leptospira species. Thus, homologous recombination between introduced DNA and the corresponding chromosomal locus has never been demonstrated for this pathogen. Leptospiral immunoglobulin-like repeat (Lig) proteins were previously identified as putative Leptospira virulence factors. In this study, a ligB mutant was constructed by allelic exchange in L. interrogans; in this mutant a spectinomycin resistance (Spc(r)) gene replaced a portion of the ligB coding sequence. Gene disruption was confirmed by PCR, immunoblot analysis, and immunofluorescence studies. The ligB mutant did not show decrease virulence compared to the wild-type strain in the hamster model of leptospirosis. In addition, inoculation of rats with the ligB mutant induced persistent colonization of the kidneys. Finally, LigB was not required to mediate bacterial adherence to cultured cells. Taken together, our data provide the first evidence of site-directed homologous recombination in pathogenic Leptospira species. Furthermore, our data suggest that LigB does not play a major role in dissemination of the pathogen in the host and in the development of acute disease manifestations or persistent renal colonization.
Resumo:
Transposon elements are important tools for gene function analysis, for example they can be used to easily create genome-wide collections of insertion mutants. Transposons may also carry sequences coding for an epitope or fluorescent marker useful for protein expression and localization analysis. We have developed three new Tn5-based transposons that incorporate a GFP (green fluorescent protein) coding sequence to generate fusion proteins in the important fungal pathogen Candida albicans. Each transposon also contains the URA3 and Kan(R) genes for yeast and bacterial selection, respectively. After in vitro transposition, the insertional allele is transferred to the chromosomal locus by homologous recombination. Transposons Tn5-CaGFP and Tn5-CaGFP-URA3:FLIP can generate C-terminal truncated GFP fusions. A URA3 flipper recycling cassette was incorporated into the transposon Th5-CaGFP-UFRA3:FLIP. After the induction of Flip recombinase to excise the marker, the heterozygous strain is transformed again in order to obtain a GFP-tagged homozygous strains. In the Tn5-CaGFP-FL transposon the markers are flanked by a rare-cutting enzyme. After in vitro transposition into a plasmid-borne target gene, the markers are eliminated by restriction digestion and religation, resulting in a construct coding for full-length GFP-fusion proteins. This transposon can generate plasmid libraries of GFP insertions in proteins where N- or C-terminal tagging may alter localization. We tested our transposon system by mutagenizing the essential septin CDC3 gene. The results indicate that the Cdc3 C-terminal extension is important for correct septin filament assembly. The transposons described here provide a new system to obtain global gene expression and protein localization data in C. albicans. (c) 2008 Elsevier B.V. All rights reserved.
Resumo:
Context: Mutations in TAC3 and TACR3 (encoding neurokinin B and its receptor) have been identified in Turkish patients with idiopathic hypogonadotropic hypogonadism (IHH), but broader populations have not yet been tested and genotype-phenotype correlations have not been established. Objective: A broad cohort of normosmic IHH probands was screened for mutations in TAC3/TACR3 to evaluate the prevalence of such mutations and define the genotype/phenotype relationships. Design and Setting: The study consisted of sequencing of TAC3/TACR3, in vitro functional assays, and neuroendocrine phenotyping conducted in tertiary care centers worldwide. Patients or Other Participants: 345 probands, 18 family members, and 292 controls were studied. Intervention: Reproductive phenotypes throughout reproductive life and before and after therapy were examined. Main Outcome Measure: Rare sequence variants in TAC3/TACR3 were detected. Results: In TACR3, 19 probands harbored 13 distinct coding sequence rare nucleotide variants [three nonsense mutations, six nonsynonymous, four synonymous (one predicted to affect splicing)]. In TAC3, one homozygous single base pair deletion was identified, resulting in complete loss of the neurokinin B decapeptide. Phenotypic information was available on 16 males and seven females with coding sequence variants in TACR3/TAC3. Of the 16 males, 15 had microphallus; none of the females had spontaneous thelarche. Seven of the 16 males and five of the seven females were assessed after discontinuation of therapy; six of the seven males and four of the five females demonstrated evidence for reversibility of their hypogonadotropism. Conclusions: Mutations in the neurokinin B pathway are relatively common as causes of hypogonadism. Although the neurokinin B pathway appears essential during early sexual development, its importance in sustaining the integrity of the hypothalamic-pituitary-gonadal axis appears attenuated over time. (J Clin Endocrinol Metab 95: 2857-2867, 2010)
Resumo:
Type XVIII collagen is a component of basement membranes, and expressed prominently in the eye, blood vessels, liver, and the central nervous system. Homozygous mutations in COL18A1 lead to Knobloch Syndrome, characterized by ocular defects and occipital encephalocele. However, relatively little has been described on the role of type XVIII collagen in development, and nothing is known about the regulation of its tissue-specific expression pattern. We have used zebrafish transgenesis to identify and characterize cis-regulatory sequences controlling expression of the human gene. Candidate enhancers were selected from non-coding sequence associated with COL18A1 based on sequence conservation among mammals. Although these displayed no overt conservation with orthologous zebrafish sequences, four regions nonetheless acted as tissue-specific transcriptional enhancers in the zebrafish embryo, and together recapitulated the major aspects of col18a1 expression. Additional post-hoc computational analysis on positive enhancer sequences revealed alignments between mammalian and teleost sequences, which we hypothesize predict the corresponding zebrafish enhancers; for one of these, we demonstrate functional overlap with the orthologous human enhancer sequence. Our results provide important insight into the biological function and regulation of COL18A1, and point to additional sequences that may contribute to complex diseases involving COL18A1. More generally, we show that combining functional data with targeted analyses for phylogenetic conservation can reveal conserved cis-regulatory elements in the large number of cases where computational alignment alone falls short. (C) 2009 Elsevier Inc. All rights reserved.
Resumo:
DNA puffs are genomic regions of polytene chromosomes that undergo developmentally controlled DNA amplification and transcription in salivary glands of sciarid flies. Here, we tested the hypothesis that DNA puff genes code for salivary proteins in Trichosia pubescens. To do that, we generated antibodies against saliva and immunoscreened a cDNA library made from salivary glands. We isolated clones corresponding to DNA puff regions, including clone D-50 that contained the entire coding sequence of the previously isolated C4B1 gene from puff 4C. Indeed, we showed that puff 4C is a DNA puff region detecting its local transcription and its extra rounds of DNA incorporation compared to neighboring regions. We further confirmed D-50 clone identity in Western blots reacted with the anti-saliva anitiserum. We detected a recombinant protein expressed by this clone that had the expected size for a full-length product of the gene. We end with a discussion of the relationship between DNA puff genes and their products.
Resumo:
The cold shock response in bacteria involves the expression of low-molecular weight cold shock proteins (CSPs) containing a nucleic acid-binding cold shock domain (CSD), which are known to destabilize secondary structures on mRNAs, facilitating translation at low temperatures. Caulobacter crescentus cspA and cspB are induced upon cold shock, while cspC and cspD are induced during stationary phase. In this work, we determined a new coding sequence for the cspC gene, revealing that it encodes a protein containing two CSDs. The phenotypes of C. crescentus csp mutants were analyzed, and we found that cspC is important for cells to maintain viability during extended periods in stationary phase. Also, cspC and cspCD strains presented altered morphology, with frequent non-viable filamentous cells, and cspCD also showed a pronounced cell death at late stationary phase. In contrast, the cspAB mutant presented increased viability in this phase, which is accompanied by an altered expression of both cspC and cspD, but the triple cspABD mutant loses this characteristic. Taken together, our results suggest that there is a hierarchy of importance among the csp genes regarding stationary phase viability, which is probably achieved by a fine tune balance of the levels of these proteins.
Resumo:
At present a complete mtDNA sequence has been reported for only two hymenopterans, the Old World honey bee, Apis mellifera and the sawfly Perga condei. Among the bee group, the tribe Meliponini (stingless bees) has some distinction due to its Pantropical distribution, great number of species and large importance as main pollinators in several ecosystems, including the Brazilian rain forest. However few molecular studies have been conducted on this group of bees and few sequence data from mitochondrial genomes have been described. In this project, we PCR amplified and sequenced 78% of the mitochondrial genome of the stingless bee Melipona bicolor (Apidae, Meliponini). The sequenced region contains all of the 13 mitochondrial protein-coding genes, 18 of 22 tRNA genes, and both rRNA genes (one of them was partially sequenced). We also report the genome organization (gene content and order), gene translation, genetic code, and other molecular features, such as base frequencies, codon usage, gene initiation and termination. We compare these characteristics of M. bicolor to those of the mitochondrial genome of A. mellifera and other insects. A highly biased A+T content is a typical characteristic of the A. mellifera mitochondrial genome and it was even more extreme in that of M. bicolor. Length and compositional differences between M. bicolor and A. mellifera genes were detected and the gene order was compared. Eleven tRNA gene translocations were observed between these two species. This latter finding was surprising, considering the taxonomic proximity of these two bee tribes. The tRNA Lys gene translocation was investigated within Meliponini and showed high conservation across the Pantropical range of the tribe.
Resumo:
Background: Ticks secrete a cement cone composed of many salivary proteins, some of which are rich in the amino acid glycine in order to attach to their hosts' skin. Glycine-rich proteins (GRPs) are a large family of heterogeneous proteins that have different functions and features; noteworthy are their adhesive and tensile characteristics. These properties may be essential for successful attachment of the metastriate ticks to the host and the prolonged feeding necessary for engorgement. In this work, we analyzed Expressed Sequence Tags (ESTs) similar to GRPs from cDNA libraries constructed from salivary glands of adult female ticks representing three hard, metastriate species in order to verify if their expression correlated with biological differences such as the numbers of hosts ticks feed on during their parasitic life cycle, whether one (monoxenous parasite) or two or more (heteroxenous parasite), and the anatomy of their mouthparts, whether short (Brevirostrata) or long (Longirostrata). These ticks were the monoxenous Brevirostrata tick, Rhipicephalus (Boophilus) microplus, a heteroxenous Brevirostrata tick, Rhipicephalus sanguineus, and a heteroxenous Longirostrata tick, Amblyomma cajennense. To further investigate this relationship, we conducted phylogenetic analyses using sequences of GRPs from these ticks as well as from other species of Brevirostrata and Longirostrata ticks. Results: cDNA libraries from salivary glands of the monoxenous tick, R. microplus, contained more contigs of glycine-rich proteins than the two representatives of heteroxenous ticks, R. sanguineus and A. cajennense (33 versus, respectively, 16 and 11). Transcripts of ESTs encoding GRPs were significantly more numerous in the salivary glands of the two Brevirostrata species when compared to the number of transcripts in the Longirostrata tick. The salivary gland libraries from Brevirostrata ticks contained numerous contigs significantly similar to silks of true spiders (17 and 8 in, respectively, R. microplus and R. sanguineus), whereas the Longirostrata tick contained only 4 contigs. The phylogenetic analyses of GRPs from various species of ticks showed that distinct clades encoding proteins with different biochemical properties are represented among species according to their biology. Conclusions: We found that different species of ticks rely on different types and amounts of GRPs in order to attach and feed on their hosts. Metastriate ticks with short mouthparts express more transcripts of GRPs than a tick with long mouthparts and the tick that feeds on a single host during its life cycle contain a greater variety of these proteins than ticks that feed on several hosts.
Resumo:
Intergenic spacers of chloroplast DNA (cpDNA) are very useful in phylogenetic and population genetic studies of plant species, to study their potential integration in phylogenetic analysis. The non-coding trnE-trnT intergenic spacer of cpDNA was analyzed to assess the nucleotide sequence polymorphism of 16 Solanaceae species and to estimate its ability to contribute to the resolution of phylogenetic studies of this group. Multiple alignments of DNA sequences of trnE-trnT intergenic spacer made the identification of nucleotide variability in this region possible and the phylogeny was estimated by maximum parsimony and rooted with Convolvulaceae Ipomoea batalas, the most closely related family. Besides, this intergenic spacer was tested for the phylogenetic ability to differentiate taxonomic levels. For this purpose, species from four other families were analyzed and compared with Solanaceae species. Results confirmed polymorphism in the trnE-trnT region at different taxonomic levels.
Resumo:
Mycoplasma suis, the causative agent of porcine infectious anemia, has never been cultured in vitro and mechanisms by which it causes disease are poorly understood. Thus, the objective herein was to use whole genome sequencing and analysis of M. suis to define pathogenicity mechanisms and biochemical pathways. M. suis was harvested from the blood of an experimentally infected pig. Following DNA extraction and construction of a paired end library, whole-genome sequencing was performed using GS-FLX (454) and Titanium chemistry. Reads on paired-end constructs were assembled using GS De Novo Assembler and gaps closed by primer walking; assembly was validated by PFGE. Glimmer and Manatee Annotation Engine were used to predict and annotate protein-coding sequences (CDS). The M. suis genome consists of a single, 742,431 bp chromosome with low G+C content of 31.1%. A total of 844 CDS, 3 single copies, unlinked rRNA genes and 32 tRNAs were identified. Gene homologies and GC skew graph show that M. suis has a typical Mollicutes oriC. The predicted metabolic pathway is concise, showing evidence of adaptation to blood environment. M. suis is a glycolytic species, obtaining energy through sugars fermentation and ATP-synthase. The pentose-phosphate pathway, metabolism of cofactors and vitamins, pyruvate dehydrogenase and NAD(+) kinase are missing. Thus, ribose, NADH, NADPH and coenzyme A are possibly essential for its growth. M. suis can generate purines from hypoxanthine, which is secreted by RBCs, and cytidine nucleotides from uracil. Toxins orthologs were not identified. We suggest that M. suis may cause disease by scavenging and competing for host nutrients, leading to decreased life-span of RBCs. In summary, genome analysis shows that M. suis is dependent on host cell metabolism and this characteristic is likely to be linked to its pathogenicity. The prediction of essential nutrients will aid the development of in vitro cultivation systems.
Resumo:
An important topic in genomic sequence analysis is the identification of protein coding regions. In this context, several coding DNA model-independent methods based on the occurrence of specific patterns of nucleotides at coding regions have been proposed. Nonetheless, these methods have not been completely suitable due to their dependence on an empirically predefined window length required for a local analysis of a DNA region. We introduce a method based on a modified Gabor-wavelet transform (MGWT) for the identification of protein coding regions. This novel transform is tuned to analyze periodic signal components and presents the advantage of being independent of the window length. We compared the performance of the MGWT with other methods by using eukaryote data sets. The results show that MGWT outperforms all assessed model-independent methods with respect to identification accuracy. These results indicate that the source of at least part of the identification errors produced by the previous methods is the fixed working scale. The new method not only avoids this source of errors but also makes a tool available for detailed exploration of the nucleotide occurrence.
Resumo:
Phylogenetic analyses of representative species from the five genera of Winteraceae (Drimys, Pseudowintera, Takhtajania, Tasmannia, and Zygogynum s.l.) were performed using ITS nuclear sequences and a combined data-set of ITS + psbA-trnH + rpS16 sequences (sampling of 30 and 15 species, respectively). Indel informativity using simple gap coding or gaps as a fifth character was examined in both data-sets. Parsimony and Bayesian analyses support the monophyly of Drimys, Tasmannia, and Zygogynum s.l., but do not support the monophyly of Belliolum, Zygogynum s.s., and Bubbia. Within Drimys, the combined data-set recovers two subclades. Divergence time estimates suggest that the splitting between Drimys and its sister clade (Pseudowintera + Zygogynum s.l.) occurred around the end of the Cretaceous; in contrast, the divergence between the two subclades within Drimys is more recent (15.5-18.5 MY) and coincides in time with the Andean uplift. Estimates suggest that the earliest divergences within Winteraceae could have predated the first events of Gondwana fragmentation. (C) 2009 Elsevier Inc. All rights reserved.