993 resultados para conserved noncoding sequence
Resumo:
The complete amino acid sequence of myotoxin II (godMT-II), a myotoxic phospholipase A( 2 )(PLA(2)) homologue from the venom of the Central American crotaline snake Cerrophidion (Bothrops) godmani, was determined by direct protein sequencing methods. GodMT-II is a class II PLA, showing a Lys instead of Asp at position 49. An additional substitution in the calcium binding loop region (Asn instead of Tyr at position 28) suggests the lack of enzymatic activity observed in this toxin is due to loss of its ability to bind the co-factor Ca2+, since the residues involved in forming the catalytic network of PLA(2)s (His-48, Tyr-52 and Asp-99) an conserved in godMT-II. This myotoxin shows highest sequence homology with other Lys-49 PLA(2)s from Bothrops, Agkistrodon and Trimeresurus species, suggesting that they constitute a conserved family of proteins, yet in contrast presents lower homology with Bothrops asper myotoxin III, a catalytically-active PLA(2). The C-terminal region of godMT-II, which is rich in cationic and hydrophobic residues, shares high sequence homology to the corresponding region in the myotoxin II from B. asper, which has been proposed to play an important role in the Ca2+-independent membrane damaging activity. (C) 1998 Elsevier B.V. B.V. All rights reserved.
Resumo:
BaP1 is a 22.7-kD P-I-type zinc-dependent metalloproteinase isolated from the venom of the snake Bothrops asper, a medically relevant species in Central America. This enzyme exerts multiple tissue-damaging activities, including hemorrhage, myonecrosis, dermonecrosis, blistering, and edema. BaP1 is a single chain of 202 amino acids that shows highest sequence identity with metalloproteinases isolated front the venoms of snakes of the subfamily Crotalinae. It has six Cys residues involved in three disulfide bridges (Cys 117-Cys 197, Cys 159-Cys 181, Cys 157-Cys 164). It has the consensus sequence H(142)E(143)XXH(146)XXGXXH(152), as well as the sequence C164I165M166, which characterize the metzincin superfamily of metalloproteinases. The active-site cleft separates a major subdomain (residues 1-152), comprising four a-helices and a five-stranded beta-sheet, from the minor subdomain, which is formed by a single a-helix and several loops. The catalytic zinc ion is coordinated by the N-epsilon2 nitrogen atoms of His 142, His 146, and His 152, in addition to a solvent water molecule, which in turn is bound to Glu 143. Several conserved residues contribute to the formation of the hydrophobic pocket, and Met 166 serves as a hydrophobic base for the active-site groups. Sequence and structural comparisons of hemorrhagic and nonhemorrhagic P-I metalloproteinases from snake venoms revealed differences in several regions. In particular, the loop comprising residues 153 to 176 has marked structural differences between metalloproteinases with very different hemorrhagic activities. Because this region lies in close proximity to the active-site microenvironment, it may influence the interaction of these enzymes with physiologically relevant substrates in the extracellular matrix.
Resumo:
Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)
Resumo:
With the aim of further understanding the structure/function relationships in the membrane-damaging activity of the Lys(49) phospholipase A(2) (Lys(49)-PLA(2)) sub-family, we used PCR (polymerase chain reaction) on total venom gland cDNAs from Bothrops jararacussu with degenerate oligodeoxyribonucleotides encoding the N- and C-termini of myotoxin II, a Lys(49)-PLA(2) from Bothrops asper. A 350-bp cDNA coding for bothropstoxin I (BtxtxI) was amplified. Sequencing of the amplified fragment shows that BtxtxI has a Lys(49), and comparison with the known structure of myotoxin II showed that the amino acids involved in the formation of a novel dimeric structure in this protein were also conserved.
Sequence, evolution and ligand binding properties of mammalian Duffy antigen/receptor for chemokines
Resumo:
The Duffy antigen/receptor for chemokine, DARC, acts as a widely expressed promiscuous chemokine receptor and as the erythrocyte receptor for Plasmodium vivax. To gain insight into the evolution and structure/function relations of DARC, we analyzed the binding of anti-human Fy monoclonal antibodies (mAbs) and human chemokines to red blood cells (RBCs) from 11 nonhuman primates and two nonprimate mammals, and we elucidated the structures of the DARC genes from gorilla, gibbon, baboon, marmoset, tamarin, night monkey and cattle. CXCL-8 and CCL-5 chemokine binding analysis indicated that the promiscuous binding profile characteristic of DARC is conserved across species. Among three mAbs that detected the Fy6 epitope by flow cytometric analysis of human and chimpanzee RBCs, only one reacted with night monkey and squirrel monkey. Only chimpanzee RBCs bound a significant amount of the anti-Fy3 mAb. Fy3 was also poorly detected on RBCs from gorilla, baboon and rhesus monkey, but not from new world monkeys. Alignment of DARC homologous sequences allowed us to construct a phylogenetic tree in which all branchings were in accordance with current knowledge of primate phylogeny. Although DARC was expected to be under strong internal and external selection pressure, in order to maintain chemokine binding and avoid Plasmodium vivax binding, respectively, our present study did not provide arguments in favor of a selection pressure on the extracellular domains involved in ligand specificity. The amino acid variability of DARC-like polypeptides was found to be well correlated with the hydrophylicity indexes, with the highest divergence on the amino-terminal extracellular domain. Analysis of the deduced amino acid sequences highlighted the conservation of some amino acid residues, which should prove to be critical for the structural and functional properties of DARC.
Resumo:
The complete nucleotide sequence of the genomic RNA 1 (8745 nt) and RNA 2 (4986 nt) of Citrus leprosis virus cytoplasmic type (CiLV-C) was determined using cloned cDNA. RNA 1 contains two open reading frames (ORFs), which correspond to 286 and 29 kDa proteins. The 286 kDa protein is a polyprotein putatively involved in virus replication, which contains four conserved domains: methyltransferase, protease, helicase and polymerase. RNA 2 contains four ORFs corresponding to 15, 61, 32 and 24 kDa proteins, respectively. The 32 kDa protein is apparently involved in cell-to-cell movement of the virus, but none of the other putative proteins exhibit any conserved domain. The 5' regions of the two genomic RNAs contain a 'cap' structure and poly(A) tails were identified in the 3'-terminals. Sequence analyses and searches for structural and non-structural protein similarities revealed conserved domains with members of the genera Furovirus, Bromovirus, Tobravirus and Tobamovirus, although phylogenetic analyses strongly suggest that CiLV-C is a member of a distinct, novel virus genus and family, and definitely demonstrate that it does not belong to the family Rhabdoviridae, as previously proposed. Based on these results it was proposed that Citrus leprosis virus be considered as the type member of a new genus of viruses, Cilevirus.
Resumo:
Sixty-five accessions of the species-rich freshwater red algal order Batrachospermales were characterized through DNA sequencing of two regions: the mitochondrial cox1 gene (664 bp), which is proposed as the DNA barcode for red algae, and the UPA (universal plastid amplicon) marker (370 bp), which has been recently identified as a universally amplifying region of the plastid genome. upgma phenograms of both markers were consistent in their species-level relationships, although levels of sequence divergence were very different. Intraspecific variation of morphologically identified accessions for the cox1 gene ranged from 0 to 67 bp (divergences were highest for the two taxa with the greatest number of accessions; Batrachospermum helminthosum and Batrachospermum macrosporum); while in contrast, the more conserved universal plastid amplicon exhibited much lower intraspecific variation (generally 0-3 bp). Comparisons to previously published mitochondrial cox2-3 spacer sequences for B. helminthosum indicated that the cox1 gene and cox2-3 spacer were characterized by similar levels of sequence divergence, and phylogeographic patterns based on these two markers were consistent. The two taxa represented by the largest numbers of specimens (B. helminthosum and B. macrosporum) have cox1 intraspecific divergence values that are substantially higher than previously reported, but no morphological differences can be discerned at this time among the intraspecific groups revealed in the analyses. DNA barcode data, which are based on a short fragment of an organellar genome, need to be interpreted in conjunction with other taxonomic characters, and additional batrachospermalean taxa need to be analyzed in detail to be able to draw generalities regarding intraspecific variation in this order. Nevertheless, these analyses reveal a number of batrachospermalean taxa worthy of more detailed DNA barcode study, and it is predicted that such research will have a substantial effect on the taxonomy of species within the Batrachospermales in the future.
Resumo:
The eukaryotic translation initiation factor 2 (eIF2) binds the methionyl-initiator tRNA in a GTP-dependent mode. This complex associates with the 40 S ribosomal particle, which then, with the aid of other factors, binds to the 5' end of the mRNA and migrates to the first AUG codon, where eIF5 promotes GTP hydrolysis, followed by the formation of the 80 S ribosome. Here we provide a comparative sequence analysis of the β subunit of eIF2 and its archaeal counterpart (aIF2β). aIF2β differs from eIF2β in not possessing an N-terminal extension implicated in binding RNA, eIF5 and eIF2B. The remaining sequences are highly conserved, and are shared with eIF5. Previously isolated mutations in the yeast eIF2β, which allow initiation of translation at UUG codons due to the uncovering of an intrinsic GTPase activity in eIF2, involve residues that are conserved in aIF2β, but not in eIF5. We show that the sequence of eIF2B homologous to aIF2β is sufficient for binding eIF2γ, the only subunit with which it interacts, and comprises, at the most, 78 residues, eIF5 does not interact with eIF2γ, despite its similarity with eIF2β, probably because of a gap in homology in this region. These observations have implications for the evolution of the mechanism of translation initiation.
Resumo:
Xylella fastidiosa is a fastidious, xylem-limited bacterium that causes a range of economically important plant diseases. Here we report the complete genome sequence of X. fastidiosa clone 9a5c, which causes citrus variegated chlorosis - a serious disease of orange trees. The genome comprises a 52.7% GC-rich 2,679,305-base-pair (bp) circular chromosome and 'two plasmids of 51,158 bp and 1,285 bp. We can assign putative functions to47% of the 2,904 predicted coding regions. Efficient metabolic functions are predicted, with sugars as the principal energy and carbon source, supporting existence in the nutrient-poor xylem sap. The mechanisms associated with pathogenicity and virulence involve toxins, antibiotics and ion sequestration systems, as well as bacterium-bacterium and bacterium-host interactions mediated by a range of proteins. Orthologues of some of these proteins have only been identified in animal and human pathogens; their presence in X. fastidiosa indicates that the molecular basis for bacterial pathogenicity is both conserved and independent of host. At least 83 genes are bacteriophage-derived and include virulence-associated genes from other bacteria, providing direct evidence of phage-mediated horizontal gene transfer.
Resumo:
T-cell based vaccine approaches have emerged to counteract HIV-1/AIDS. Broad, polyfunctional and cytotoxic CD4(+) T-cell responses have been associated with control of HIV-1 replication, which supports the inclusion of CD4(+) T-cell epitopes in vaccines. A successful HIV-1 vaccine should also be designed to overcome viral genetic diversity and be able to confer immunity in a high proportion of immunized individuals from a diverse HLA-bearing population. In this study, we rationally designed a multiepitopic DNA vaccine in order to elicit broad and cross-clade CD4(+) T-cell responses against highly conserved and promiscuous peptides from the HIV-1 M-group consensus sequence. We identified 27 conserved, multiple HLA-DR-binding peptides in the HIV-1 M-group consensus sequences of Gag, Pol, Nef, Vif, Vpr, Rev and Vpu using the TEPITOPE algorithm. The peptides bound in vitro to an average of 12 out of the 17 tested HLA-DR molecules and also to several molecules such as HLA-DP, -DQ and murine IA(b) and IA(d). Sixteen out of the 27 peptides were recognized by PBMC from patients infected with different HIV-1 variants and 72% of such patients recognized at least 1 peptide. Immunization with a DNA vaccine (HIVBr27) encoding the identified peptides elicited IFN-gamma secretion against 11 out of the 27 peptides in BALB/c mice; CD4(+) and CD8(+) T-cell proliferation was observed against 8 and 6 peptides, respectively. HIVBr27 immunization elicited cross-clade T-cell responses against several HIV-1 peptide variants. Polyfunctional CD4(+) and CD8(+) T cells, able to simultaneously proliferate and produce IFN-gamma and TNF-alpha, were also observed. This vaccine concept may cope with HIV-1 genetic diversity as well as provide increased population coverage, which are desirable features for an efficacious strategy against HIV-1/AIDS.
Resumo:
Coccidiosis of the domestic fowl is a worldwide disease caused by seven species of protozoan parasites of the genus Eimeria. The genome of the model species, Eimeria tenella, presents a complexity of 55-60 MB distributed in 14 chromosomes. Relatively few studies have been undertaken to unravel the complexity of the transcriptome of Eimeria parasites. We report here the generation of more than 45,000 open reading frame expressed sequence tag (ORESTES) cDNA reads of E. tenella, Eimeria maxima and Eimeria acervulina, covering several developmental stages: unsporulated oocysts, sporoblastic oocysts, sporulated oocysts, sporozoites and second generation merozoites. All reads were assembled to constitute gene indices and submitted to a comprehensive functional annotation pipeline. In the case of E. tenella, we also incorporated publicly available ESTs to generate an integrated body of information. Orthology analyses have identified genes conserved across different apicomplexan parasites, as well as genes restricted to the genus Eimeria. Digital expression profiles obtained from ORESTES/EST countings, submitted to clustering analyses, revealed a high conservation pattern across the three Eimeria spp. Distance trees showed that unsporulated and sporoblastic oocysts constitute a distinct clade in all species, with sporulated oocysts forming a more external branch. This latter stage also shows a close relationship with sporozoites, whereas first and second generation merozoites are more closely related to each other than to sporozoites. The profiles were unambiguously associated with the distinct developmental stages and strongly correlated with the order of the stages in the parasite life cycle. Finally, we present The Eimeria Transcript Database (http://www.coccidia.icb.usp.br/eimeriatdb), a website that provides open access to all sequencing data, annotation and comparative analysis. We expect this repository to represent a useful resource to the Eimeria scientific community, helping to define potential candidates for the development of new strategies to control coccidiosis of the domestic fowl. (C) 2011 Australian Society for Parasitology Inc. Published by Elsevier Ltd. All rights reserved.
Resumo:
Abstract Background The mitochondrial DNA of kinetoplastid flagellates is distinctive in the eukaryotic world due to its massive size, complex form and large sequence content. Comprised of catenated maxicircles that contain rRNA and protein-coding genes and thousands of heterogeneous minicircles encoding small guide RNAs, the kinetoplast network has evolved along with an extreme form of mRNA processing in the form of uridine insertion and deletion RNA editing. Many maxicircle-encoded mRNAs cannot be translated without this post-transcriptional sequence modification. Results We present the complete sequence and annotation of the Trypanosoma cruzi maxicircles for the CL Brener and Esmeraldo strains. Gene order is syntenic with Trypanosoma brucei and Leishmania tarentolae maxicircles. The non-coding components have strain-specific repetitive regions and a variable region that is unique for each strain with the exception of a conserved sequence element that may serve as an origin of replication, but shows no sequence identity with L. tarentolae or T. brucei. Alternative assemblies of the variable region demonstrate intra-strain heterogeneity of the maxicircle population. The extent of mRNA editing required for particular genes approximates that seen in T. brucei. Extensively edited genes were more divergent among the genera than non-edited and rRNA genes. Esmeraldo contains a unique 236-bp deletion that removes the 5'-ends of ND4 and CR4 and the intergenic region. Esmeraldo shows additional insertions and deletions outside of areas edited in other species in ND5, MURF1, and MURF2, while CL Brener has a distinct insertion in MURF2. Conclusion The CL Brener and Esmeraldo maxicircles represent two of three previously defined maxicircle clades and promise utility as taxonomic markers. Restoration of the disrupted reading frames might be accomplished by strain-specific RNA editing. Elements in the non-coding region may be important for replication, transcription, and anchoring of the maxicircle within the kinetoplast network.
Resumo:
Abstract Background One of the least common types of alternative splicing is the complete retention of an intron in a mature transcript. Intron retention (IR) is believed to be the result of intron, rather than exon, definition associated with failure of the recognition of weak splice sites flanking short introns. Although studies on individual retained introns have been published, few systematic surveys of large amounts of data have been conducted on the mechanisms that lead to IR. Results TTo understand how sequence features are associated with or control IR, and to produce a generalized model that could reveal previously unknown signals that regulate this type of alternative splicing, we partitioned intron retention events observed in human cDNAs into two groups based on the relative abundance of both isoforms and compared relevant features. We found that a higher frequency of IR in human is associated with individual introns that have weaker splice sites, genes with shorter intron lengths, higher expression levels and lower density of both a set of exon splicing silencers (ESSs) and the intronic splicing enhancer GGG. Both groups of retained introns presented events conserved in mouse, in which the retained introns were also short and presented weaker splice sites. Conclusion Although our results confirmed that weaker splice sites are associated with IR, they showed that this feature alone cannot explain a non-negligible fraction of events. Our analysis suggests that cis-regulatory elements are likely to play a crucial role in regulating IR and also reveals previously unknown features that seem to influence its occurrence. These results highlight the importance of considering the interplay among these features in the regulation of the relative frequency of IR.
Resumo:
Abstract Background Pancreatic ductal adenocarcinoma (PDAC) is known by its aggressiveness and lack of effective therapeutic options. Thus, improvement in current knowledge of molecular changes associated with pancreatic cancer is urgently needed to explore novel venues of diagnostics and treatment of this dismal disease. While there is mounting evidence that long noncoding RNAs (lncRNAs) transcribed from intronic and intergenic regions of the human genome may play different roles in the regulation of gene expression in normal and cancer cells, their expression pattern and biological relevance in pancreatic cancer is currently unknown. In the present work we investigated the relative abundance of a collection of lncRNAs in patients' pancreatic tissue samples aiming at identifying gene expression profiles correlated to pancreatic cancer and metastasis. Methods Custom 3,355-element spotted cDNA microarray interrogating protein-coding genes and putative lncRNA were used to obtain expression profiles from 38 clinical samples of tumor and non-tumor pancreatic tissues. Bioinformatics analyses were performed to characterize structure and conservation of lncRNAs expressed in pancreatic tissues, as well as to identify expression signatures correlated to tissue histology. Strand-specific reverse transcription followed by PCR and qRT-PCR were employed to determine strandedness of lncRNAs and to validate microarray results, respectively. Results We show that subsets of intronic/intergenic lncRNAs are expressed across tumor and non-tumor pancreatic tissue samples. Enrichment of promoter-associated chromatin marks and over-representation of conserved DNA elements and stable secondary structure predictions suggest that these transcripts are generated from independent transcriptional units and that at least a fraction is under evolutionary selection, and thus potentially functional. Statistically significant expression signatures comprising protein-coding mRNAs and lncRNAs that correlate to PDAC or to pancreatic cancer metastasis were identified. Interestingly, loci harboring intronic lncRNAs differentially expressed in PDAC metastases were enriched in genes associated to the MAPK pathway. Orientation-specific RT-PCR documented that intronic transcripts are expressed in sense, antisense or both orientations relative to protein-coding mRNAs. Differential expression of a subset of intronic lncRNAs (PPP3CB, MAP3K14 and DAPK1 loci) in metastatic samples was confirmed by Real-Time PCR. Conclusion Our findings reveal sets of intronic lncRNAs expressed in pancreatic tissues whose abundance is correlated to PDAC or metastasis, thus pointing to the potential relevance of this class of transcripts in biological processes related to malignant transformation and metastasis in pancreatic cancer.
Resumo:
Abstract Background Xanthomonads are plant-associated bacteria responsible for diseases on economically important crops. Xanthomonas fuscans subsp. fuscans (Xff) is one of the causal agents of common bacterial blight of bean. In this study, the complete genome sequence of strain Xff 4834-R was determined and compared to other Xanthomonas genome sequences. Results Comparative genomics analyses revealed core characteristics shared between Xff 4834-R and other xanthomonads including chemotaxis elements, two-component systems, TonB-dependent transporters, secretion systems (from T1SS to T6SS) and multiple effectors. For instance a repertoire of 29 Type 3 Effectors (T3Es) with two Transcription Activator-Like Effectors was predicted. Mobile elements were associated with major modifications in the genome structure and gene content in comparison to other Xanthomonas genomes. Notably, a deletion of 33 kbp affects flagellum biosynthesis in Xff 4834-R. The presence of a complete flagellar cluster was assessed in a collection of more than 300 strains representing different species and pathovars of Xanthomonas. Five percent of the tested strains presented a deletion in the flagellar cluster and were non-motile. Moreover, half of the Xff strains isolated from the same epidemic than 4834-R was non-motile and this ratio was conserved in the strains colonizing the next bean seed generations. Conclusions This work describes the first genome of a Xanthomonas strain pathogenic on bean and reports the existence of non-motile xanthomonads belonging to different species and pathovars. Isolation of such Xff variants from a natural epidemic may suggest that flagellar motility is not a key function for in planta fitness.