984 resultados para Molecular Sequence Annotation


Relevância:

40.00% 40.00%

Publicador:

Resumo:

Manual curation has long been held to be the gold standard for functional annotation of DNA sequence. Our experience with the annotation of more than 20,000 full-length cDNA sequences revealed problems with this approach, including inaccurate and inconsistent assignment of gene names, as well as many good assignments that were difficult to reproduce using only computational methods. For the FANTOM2 annotation of more than 60,000 cDNA clones, we developed a number of methods and tools to circumvent some of these problems, including an automated annotation pipeline that provides high-quality preliminary annotation for each sequence by introducing an uninformative filter that eliminates uninformative annotations, controlled vocabularies to accurately reflect both the functional assignments and the evidence supporting them, and a highly refined, Web-based manual annotation tool that allows users to view a wide array of sequence analyses and to assign gene names and putative functions using a consistent nomenclature. The ultimate utility of our approach is reflected in the low rate of reassignment of automated assignments by manual curation. Based on these results, we propose a new standard for large-scale annotation, in which the initial automated annotations are manually investigated and then computational methods are iteratively modified and improved based on the results of manual curation.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Background: WGS is increasingly used as a first-line diagnostic test for patients with rare genetic diseases such as neurodevelopmental disorders (NDD). Clinical applications require a robust infrastructure to support processing, storage and analysis of WGS data. The identification and interpretation of SVs from WGS data also needs to be improved. Finally, there is a need for a prioritization system that enables downstream clinical analysis and facilitates data interpretation. Here, we present the results of a clinical application of WGS in a cohort of patients with NDD. Methods: We developed highly portable workflows for processing WGS data, including alignment, quality control, and variant calling of SNVs and SVs. A benchmark analysis of state-of-the-art SV detection tools was performed to select the most accurate combination for SV calling. A gene-based prioritization system was also implemented to support variant interpretation. Results: Using a benchmark analysis, we selected the most accurate combination of tools to improve SV detection from WGS data and build a dedicated pipeline. Our workflows were used to process WGS data from 77 NDD patient-parent families. The prioritization system supported downstream analysis and enabled molecular diagnosis in 32% of patients, 25% of which were SVs and suggested a potential diagnosis in 20% of patients, requiring further investigation to achieve diagnostic certainty. Conclusion: Our data suggest that the integration of SNVs and SVs is a main factor that increases diagnostic yield by WGS and show that the adoption of a dedicated pipeline improves the process of variant detection and interpretation.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Guarana seeds have the highest caffeine concentration among plants accumulating purine alkaloids, but in contrast with coffee and tea, practically nothing is known about caffeine metabolism in this Amazonian plant. In this study, the levels of purine alkaloids in tissues of five guarana cultivars were determined. Theobromine was the main alkaloid that accumulated in leaves, stems, inflorescences and pericarps of fruit, while caffeine accumulated in the seeds and reached levels from 3.3% to 5.8%. In all tissues analysed, the alkaloid concentration, whether theobromine or caffeine, was higher in young/immature tissues, then decreasing with plant development/maturation. Caffeine synthase activity was highest in seeds of immature fruit. A nucleotide sequence (PcCS) was assembled with sequences retrieved from the EST database REALGENE using sequences of caffeine synthase from coffee and tea, whose expression was also highest in seeds from immature fruit. The PcCS has 1083bp and the protein sequence has greater similarity and identity with the caffeine synthase from cocoa (BTS1) and tea (TCS1). A recombinant PcCS allowed functional characterization of the enzyme as a bifunctional CS, able to catalyse the methylation of 7-methylxanthine to theobromine (3,7-dimethylxanthine), and theobromine to caffeine (1,3,7-trimethylxanthine), respectively. Among several substrates tested, PcCS showed higher affinity for theobromine, differing from all other caffeine synthases described so far, which have higher affinity for paraxanthine. When compared to previous knowledge on the protein structure of coffee caffeine synthase, the unique substrate affinity of PcCS is probably explained by the amino acid residues found in the active site of the predicted protein.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The epididymis has an important role in the maturation of sperm for fertilization, but little is known about the epididymal molecules involved in sperm modifications during this process. We have previously described the expression pattern for an antigen in epididymal epithelial cells that reacts with the monoclonal antibody (mAb) TRA 54. Immunohistochemical and immunoblotting analyses suggest that the epitope of the epididymal antigen probably involves a sugar moiety that is released into the epididymal lumen in an androgen-dependent manner and subsequently binds to luminal sperm. Using column chromatography, SDS-PAGE with in situ digestion and mass spectrometry, we have identified the protein recognized by mAb TRA 54 in mouse epididymal epithelial cells. The ∼65 kDa protein is part of a high molecular mass complex (∼260 kDa) that is also present in the sperm acrosomal vesicle and is completely released after the acrosomal reaction. The amino acid sequence of the protein corresponded to that of albumin. Immunoprecipitates with anti-albumin antibody contained the antigen recognized by mAb TRA 54, indicating that the epididymal molecule recognized by mAb TRA 54 is albumin. RT-PCR detected albumin mRNA in the epididymis and fertilization assays in vitro showed that the glycoprotein complex containing albumin was involved in the ability of sperm to recognize and penetrate the egg zona pellucida. Together, these results indicate that epididymal-derived albumin participates in the formation of a high molecular mass glycoprotein complex that has an important role in egg fertilization.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

A monomeric basic PLA2 (PhTX-II) of 14149.08 Da molecular weight was purified to homogeneity from Porthidium hyoprora venom. Amino acid sequence by in tandem mass spectrometry revealed that PhTX-II belongs to Asp49 PLA2 enzyme class and displays conserved domains as the catalytic network, Ca2+-binding loop and the hydrophobic channel of access to the catalytic site, reflected in the high catalytic activity displayed by the enzyme. Moreover, PhTX-II PLA2 showed an allosteric behavior and its enzymatic activity was dependent on Ca2+. Examination of PhTX-II PLA2 by CD spectroscopy indicated a high content of alpha-helical structures, similar to the known structure of secreted phospholipase IIA group suggesting a similar folding. PhTX-II PLA2 causes neuromuscular blockade in avian neuromuscular preparations with a significant direct action on skeletal muscle function, as well as, induced local edema and myotoxicity, in mice. The treatment of PhTX-II by BPB resulted in complete loss of their catalytic activity that was accompanied by loss of their edematogenic effect. On the other hand, enzymatic activity of PhTX-II contributes to this neuromuscular blockade and local myotoxicity is dependent not only on enzymatic activity. These results show that PhTX-II is a myotoxic Asp49 PLA2 that contributes with toxic actions caused by P. hyoprora venom.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Telomerase RNAs (TERs) are highly divergent between species, varying in size and sequence composition. Here, we identify a candidate for the telomerase RNA component of Leishmania genus, which includes species that cause leishmaniasis, a neglected tropical disease. Merging a thorough computational screening combined with RNA-seq evidence, we mapped a non-coding RNA gene localized in a syntenic locus on chromosome 25 of five Leishmania species that shares partial synteny with both Trypanosoma brucei TER locus and a putative TER candidate-containing locus of Crithidia fasciculata. Using target-driven molecular biology approaches, we detected a ∼2,100 nt transcript (LeishTER) that contains a 5' spliced leader (SL) cap, a putative 3' polyA tail and a predicted C/D box snoRNA domain. LeishTER is expressed at similar levels in the logarithmic and stationary growth phases of promastigote forms. A 5'SL capped LeishTER co-immunoprecipitated and co-localized with the telomerase protein component (TERT) in a cell cycle-dependent manner. Prediction of its secondary structure strongly suggests the existence of a bona fide single-stranded template sequence and a conserved C[U/C]GUCA motif-containing helix II, representing the template boundary element. This study paves the way for further investigations on the biogenesis of parasite TERT ribonucleoproteins (RNPs) and its role in parasite telomere biology.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The phytopathogenic fungus Moniliophthora perniciosa (Stahel) Aime & Philips-Mora, causal agent of witches' broom disease of cocoa, causes countless damage to cocoa production in Brazil. Molecular studies have attempted to identify genes that play important roles in fungal survival and virulence. In this study, sequences deposited in the M. perniciosa Genome Sequencing Project database were analyzed to identify potential biological targets. For the first time, the ergosterol biosynthetic pathway in M. perniciosa was studied and the lanosterol 14α-demethylase gene (ERG11) that encodes the main enzyme of this pathway and is a target for fungicides was cloned, characterized molecularly and its phylogeny analyzed. ERG11 genomic DNA and cDNA were characterized and sequence analysis of the ERG11 protein identified highly conserved domains typical of this enzyme, such as SRS1, SRS4, EXXR and the heme-binding region (HBR). Comparison of the protein sequences and phylogenetic analysis revealed that the M. perniciosa enzyme was most closely related to that of Coprinopsis cinerea.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

This study aimed to evaluate species level taxonomy and phylogenetic relationship among Thorea species in Brazil and other regions of the world using two molecular markers - RUBISCO large subunit plastid gene (rbcL) and nuclear small-subunit ribosomal DNA (SSU rDNA). Three samples of Thorea from Brazil (states of Mato Grosso do Sul and São Paulo) and one sample from Dominican Republic (DR) were sequenced. Analyses based on partial sequences of rbcL (1,282 bp) and complete sequences of SSU (1,752 bp) were essentially congruent and revealed that Thoreales formed a distinct monophyletic clade, which had two major branches with high support, representing the genera Thorea and Nemalionopsis. Thorea clade had four main branches with high support for all analyses, each one representing the species: 1) T. gaudichaudii C. Agardh from Asia (Japan and Philippines) - this clade occurred only in the rbcL analyses; 2) T. violacea Bory from Asia (Japan) and North America (U.S.A. and DR); 3) T. hispida (Thore) Desvaux from Europe (England) and Asia (Japan); 4) a distinct group with the three Brazilian samples (sequence identity: rbcL 97.2%, 1,246 bp; SSU 96.0-98.1%, 1,699-1,720 bp). The Brazilian samples clearly formed a monophyletic clade based on both molecular markers and was interpreted as a separate species, for which we resurrected the name T. bachmannii Pujals. Morphological and molecular evidences indicate that the Thoreales is well-resolved at ordinal and generic levels. In contrast, Thorea species recognized by molecular data require additional characters (e.g. reproductive and chromosome numbers) to allow consistent and reliable taxonomic circumscription aiming at a world revision based on molecular and morphological evidences.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

A novel karyotype with 2n = 50, FN = 48, was described for specimens of Thaptomys collected at Una, State of Bahia, Brazil, which are morphologically indistinguishable from Thaptomys nigrita, 2n = 52, FN = 52, found in other localities. It was hence proposed that the 2n = 50 karyotype could belong to a distinct species, cryptic of Thaptomys nigrita, once chromosomal rearrangements observed, along with the geographic distance, might represent a reproductive barrier between both forms. Phylogenetic analyses using maximum parsimony and maximum likelihood based on partial cytochrome b sequences with 1077 bp were performed, attempting to establish the relationships among the individuals with distinct karyotypes along the geographic distribution of the genus; the sample comprised 18 karyotyped specimens of Thaptomys, encompassing 15 haplotypes, from eight different localities of the Atlantic Rainforest. The intra-generic relationships corroborated the distinct diploid numbers, once both phylogenetic reconstructions recovered two monophyletic lineages, a northeastern clade grouping the 2n = 50 and a southeastern clade with three subclades, grouping the 2n = 52 karyotype. The sequence divergence observed between their individuals ranged from 1.9% to 3.5%.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

At present a complete mtDNA sequence has been reported for only two hymenopterans, the Old World honey bee, Apis mellifera and the sawfly Perga condei. Among the bee group, the tribe Meliponini (stingless bees) has some distinction due to its Pantropical distribution, great number of species and large importance as main pollinators in several ecosystems, including the Brazilian rain forest. However few molecular studies have been conducted on this group of bees and few sequence data from mitochondrial genomes have been described. In this project, we PCR amplified and sequenced 78% of the mitochondrial genome of the stingless bee Melipona bicolor (Apidae, Meliponini). The sequenced region contains all of the 13 mitochondrial protein-coding genes, 18 of 22 tRNA genes, and both rRNA genes (one of them was partially sequenced). We also report the genome organization (gene content and order), gene translation, genetic code, and other molecular features, such as base frequencies, codon usage, gene initiation and termination. We compare these characteristics of M. bicolor to those of the mitochondrial genome of A. mellifera and other insects. A highly biased A+T content is a typical characteristic of the A. mellifera mitochondrial genome and it was even more extreme in that of M. bicolor. Length and compositional differences between M. bicolor and A. mellifera genes were detected and the gene order was compared. Eleven tRNA gene translocations were observed between these two species. This latter finding was surprising, considering the taxonomic proximity of these two bee tribes. The tRNA Lys gene translocation was investigated within Meliponini and showed high conservation across the Pantropical range of the tribe.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Cryptosporidium spp. are important cause of enteric disease in humans, but may also infect animals. This study describes the relative frequency of several Cryptosporidium species found in human specimens from HIV infected patients in the São Paulo municipality obtained from January to July 2007. Sequence analysis of the products of nested-PCR based on small subunit rRNA and Cryptosporidium oocyst wall protein coding genes revealed 17 (63.0%) isolates of C. hominis, four (14.8%) C. parvum, five (18.5%) C. felis and one (3.7%) C. canis. These findings suggest that, in urban environments of Brazil, the cat adapted C. felis may play a potential role in the zoonotic transmission of cryptosporidiosis whereas the anthroponotic transmission of cryptosporidiosis caused by C. hominis seems to predominate.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The availaibilty of chloroplast genome (cpDNA) sequences of Atropa belladonna, Nicotiana sylvestris, N tabacum, N tomentosiformis, Solanum bulbocastanum, S lycopersicum and S tuberosum, which are Solanaceae species, allowed us to analyze the organization of cpSSRs in their genic and intergenic regions In general, the number of cpSSRs in cpDNA ranged from 161 in S tuberosum to 226 in N tabacum, and the number of intergenic cpSSRs was higher than genic cpSSRs The mononucleotide repeats were the most frequent in studied species, but we also identified di-, tri-, tetra-, penta- and hexanucleotide repeats Multiple alignments of all cpSSRs sequence from Solanaceae species made the identification of nucleotide variability possible and the phylogeny was estimated by maximum parsimony Our study showed that the plastome database can be exploited for phylogenetic analyses and biotechnological approaches

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Intergenic spacers of chloroplast DNA (cpDNA) are very useful in phylogenetic and population genetic studies of plant species, to study their potential integration in phylogenetic analysis. The non-coding trnE-trnT intergenic spacer of cpDNA was analyzed to assess the nucleotide sequence polymorphism of 16 Solanaceae species and to estimate its ability to contribute to the resolution of phylogenetic studies of this group. Multiple alignments of DNA sequences of trnE-trnT intergenic spacer made the identification of nucleotide variability in this region possible and the phylogeny was estimated by maximum parsimony and rooted with Convolvulaceae Ipomoea batalas, the most closely related family. Besides, this intergenic spacer was tested for the phylogenetic ability to differentiate taxonomic levels. For this purpose, species from four other families were analyzed and compared with Solanaceae species. Results confirmed polymorphism in the trnE-trnT region at different taxonomic levels.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Background: Cutaneous mycoses are common human infections among healthy and immunocompromised hosts, and the anthropophilic fungus Trichophyton rubrum is the most prevalent microorganism isolated from such clinical cases worldwide. The aim of this study was to determine the transcriptional profile of T. rubrum exposed to various stimuli in order to obtain insights into the responses of this pathogen to different environmental challenges. Therefore, we generated an expressed sequence tag (EST) collection by constructing one cDNA library and nine suppression subtractive hybridization libraries. Results: The 1388 unigenes identified in this study were functionally classified based on the Munich Information Center for Protein Sequences (MIPS) categories. The identified proteins were involved in transcriptional regulation, cellular defense and stress, protein degradation, signaling, transport, and secretion, among other functions. Analysis of these unigenes revealed 575 T. rubrum sequences that had not been previously deposited in public databases. Conclusion: In this study, we identified novel T. rubrum genes that will be useful for ORF prediction in genome sequencing and facilitating functional genome analysis. Annotation of these expressed genes revealed metabolic adaptations of T. rubrum to carbon sources, ambient pH shifts, and various antifungal drugs used in medical practice. Furthermore, challenging T. rubrum with cytotoxic drugs and ambient pH shifts extended our understanding of the molecular events possibly involved in the infectious process and resistance to antifungal drugs.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Stingless bees exhibit extraordinary variation in nest architecture within and among species. To test for phylogenetic association of behavioral traits for species of the Neotropical stingless bee genus Trigona s.s., a phylogenetic hypothesis was generated by combining sequence data of 24 taxa from one mitochondrial (16S rRNA) and four nuclear gene fragments (long-wavelength rhodopsin copy 1 (opsin), elongation factor-1 alpha copy F2, arginine kinase, and 28S rRNA). Fifteen characteristics of the nest architecture were coded and tested for phylogenetic association. Several characters have significant phylogenetic signal, including type of nesting substrate, nest construction material, and hemipterophily, the tending of hemipteroid insects in exchange for sugar excretions. Phylogenetic independent habits encountered in Trigona s.s. include coprophily and necrophagy.