916 resultados para Genome-specific Sequence
Resumo:
Whole-genome sequencing (WGS) could potentially provide a single platform for extracting all the information required to predict an organism’s phenotype. However, its ability to provide accurate predictions has not yet been demonstrated in large independent studies of specific organisms. In this study, we aimed to develop a genotypic prediction method for antimicrobial susceptibilities. The whole genomes of 501 unrelated Staphylococcus aureus isolates were sequenced, and the assembled genomes were interrogated using BLASTn for a panel of known resistance determinants (chromosomal mutations and genes carried on plasmids). Results were compared with phenotypic susceptibility testing for 12 commonly used antimicrobial agents (penicillin, methicillin, erythromycin, clindamycin, tetracycline, ciprofloxacin, vancomycin, trimethoprim, gentamicin, fusidic acid, rifampin, and mupirocin) performed by the routine clinical laboratory. We investigated discrepancies by repeat susceptibility testing and manual inspection of the sequences and used this information to optimize the resistance determinant panel and BLASTn algorithm. We then tested performance of the optimized tool in an independent validation set of 491 unrelated isolates, with phenotypic results obtained in duplicate by automated broth dilution (BD Phoenix) and disc diffusion. In the validation set, the overall sensitivity and specificity of the genomic prediction method were 0.97 (95% confidence interval [95% CI], 0.95 to 0.98) and 0.99 (95% CI, 0.99 to 1), respectively, compared to standard susceptibility testing methods. The very major error rate was 0.5%, and the major error rate was 0.7%. WGS was as sensitive and specific as routine antimicrobial susceptibility testing methods. WGS is a promising alternative to culture methods for resistance prediction in S. aureus and ultimately other major bacterial pathogens.
Resumo:
Mathematical ability is heritable, but few studies have directly investigated its molecular genetic basis. Here we aimed to identify specific genetic contributions to variation in mathematical ability. We carried out a genome wide association scan using pooled DNA in two groups of U.K. samples, based on end of secondary/high school national academic exam achievement: high (n = 419) versus low (n = 183) mathematical ability while controlling for their verbal ability. Significant differences in allele frequencies between these groups were searched for in 906,600 SNPs using the Affymetrix GeneChip Human Mapping version 6.0 array. After meeting a threshold of p<1.5×10-5, 12 SNPs from the pooled association analysis were individually genotyped in 542 of the participants and analyzed to validate the initial associations (lowest p-value 1.14 ×10-6). In this analysis, one of the SNPs (rs789859) showed significant association after Bonferroni correction, and four (rs10873824, rs4144887, rs12130910 rs2809115) were nominally significant (lowest p-value 3.278 × 10-4). Three of the SNPs of interest are located within, or near to, known genes (FAM43A, SFT2D1, C14orf64). The SNP that showed the strongest association, rs789859, is located in a region on chromosome 3q29 that has been previously linked to learning difficulties and autism. rs789859 lies 1.3 kbp downstream of LSG1, and 700 bp upstream of FAM43A, mapping within the potential promoter/regulatory region of the latter. To our knowledge, this is only the second study to investigate the association of genetic variants with mathematical ability, and it highlights a number of interesting markers for future study.
Resumo:
Salmonella enterica is a zoonotic pathogen of clinical and veterinary significance, with over 2500 serovars. In previous work we compared two serovars displaying host associations inferred from isolation statistics. Here, to validate genome sequence data and to expand on the role of environmental metabolite constitution in host range determination we use a phenotypic microarray approach to assess the ability of these serovars to metabolise ~500 substrates at 25°C with oxygen (aerobic conditions) to represent the ex vivo environment and at 37°C with and without oxygen (aerobic/anaerobic conditions) to represent the in vivo environment. A total of 26 substrates elicited a significant difference in the rate of metabolism of which only one, D-galactonic acid-g-lactone, could be explained by the presence (S. Mbandaka) or the absence (S. Derby) of metabolic genes. We find that S. Mbandaka respires more efficiently at ambient temperatures and under aerobic conditions on 18 substrates including: glucosominic acid, saccharic acid, trehalose, fumaric acid, maltotriose, N-acetyl-D-glucosamine, N-acetyl-beta-D-mannosamine, fucose, L-serine and dihydroxy-acetone; whereas S. Derby is more metabolically competent anaerobically at 37°C for dipeptides, glutamine-glutamine, alanine-lysine, asparagine-glutamine and nitrogen sources glycine and nitrite. We conclude that the specific phenotype cannot be reliably predicted from the presence of metabolic genes directly relating to the metabolic pathways under study.
Resumo:
The genome of the soil-dwelling heterotrophic N2-fixing Gram-negative bacterium Azotobacter chroococcum NCIMB 8003 (ATCC 4412) (Ac-8003) has been determined. It consists of 7 circular replicons totalling 5,192,291 bp comprising a circular chromosome of 4,591,803 bp and six plasmids pAcX50a, b, c, d, e, f of 10,435 bp, 13,852, 62,783, 69,713, 132,724, and 311,724 bp respectively. The chromosome has a G+C content of 66.27% and the six plasmids have G+C contents of 58.1, 55.3, 56.7, 59.2, 61.9, and 62.6% respectively. The methylome has also been determined and 5 methylation motifs have been identified. The genome also contains a very high number of transposase/inactivated transposase genes from at least 12 of the 17 recognised insertion sequence families. The Ac-8003 genome has been compared with that of Azotobacter vinelandii ATCC BAA-1303 (Av-DJ), a derivative of strain O, the only other member of the Azotobacteraceae determined so far which has a single chromosome of 5,365,318 bp and no plasmids. The chromosomes show significant stretches of synteny throughout but also reveal a history of many deletion/insertion events. The Ac-8003 genome encodes 4628 predicted protein-encoding genes of which 568 (12.2%) are plasmid borne. 3048 (65%) of these show > 85% identity to the 5050 protein-encoding genes identified in Av-DJ, and of these 99 are plasmid-borne. The core biosynthetic and metabolic pathways and macromolecular architectures and machineries of these organisms appear largely conserved including genes for CO-dehydrogenase, formate dehydrogenase and a soluble NiFe-hydrogenase. The genetic bases for many of the detailed phenotypic differences reported for these organisms have also been identified. Also many other potential phenotypic differences have been uncovered. Properties endowed by the plasmids are described including the presence of an entire aerobic corrin synthesis pathway in pAcX50f and the presence of genes for retro-conjugation in pAcX50c. All these findings are related to the potentially different environmental niches from which these organisms were isolated and to emerging theories about how microbes contribute to their communities.
Resumo:
Background: Concerted evolution is normally used to describe parallel changes at different sites in a genome, but it is also observed in languages where a specific phoneme changes to the same other phoneme in many words in the lexicon—a phenomenon known as regular sound change. We develop a general statistical model that can detect concerted changes in aligned sequence data and apply it to study regular sound changes in the Turkic language family. Results: Linguistic evolution, unlike the genetic substitutional process, is dominated by events of concerted evolutionary change. Our model identified more than 70 historical events of regular sound change that occurred throughout the evolution of the Turkic language family, while simultaneously inferring a dated phylogenetic tree. Including regular sound changes yielded an approximately 4-fold improvement in the characterization of linguistic change over a simpler model of sporadic change, improved phylogenetic inference, and returned more reliable and plausible dates for events on the phylogenies. The historical timings of the concerted changes closely follow a Poisson process model, and the sound transition networks derived from our model mirror linguistic expectations. Conclusions: We demonstrate that a model with no prior knowledge of complex concerted or regular changes can nevertheless infer the historical timings and genealogical placements of events of concerted change from the signals left in contemporary data. Our model can be applied wherever discrete elements—such as genes, words, cultural trends, technologies, or morphological traits—can change in parallel within an organism or other evolving group.
Resumo:
Coconut, Cocos nucifera L. is a major plantation crop, which ensures income for millions of people in the tropical region. Detailed molecular studies on zygotic embryo development would provide valuable clues for the identification of molecular markers to improve somatic embryogenesis. Since there is no ongoing genome project for this species, coconut expressed sequence tags (EST) would be an interesting technique to identify important coconut embryo specific genes as well as other functional genes in different biochemical pathways. The goal of this study was to analyse the ESTs by examining the transcriptome data of the different embryo tissue types together with one somatic tissue. Here, four cDNA libraries from immature embryo, mature embryo, microspore derived embryo and mature leaves were constructed. cDNA was sequenced by the Roche-454 GS-FLX system and assembled into 32621 putative unigenes and 155017 singletons. Of these unigenes, 18651 had significant sequence similarities to non-redundant protein database, from which 16153 were assigned to one or more gene ontology categories. Homologue genes, which are responsible for embryo development such as chitinase, beta-1,3-glucanase, ATP synthase CF0 subunit, thaumatin-like protein and metallothionein-like protein were identified among the embryo EST collection. Of the unigenes, 6694 were mapped into 139 KEGG pathways including carbohydrate metabolism, energy metabolism, lipid metabolism, amino acid metabolism and nucleotide metabolism. This collection of 454-derived EST data generated from different tissue types provides a significant resource for genome wide studies and gene discovery of coconut, a non-model species.
Resumo:
Eukaryotic genome expansion/retraction caused by LTR-retrotransposon activity is dependent on the expression of full length copies to trigger efficient transposition and recombination-driven events. The Tnt1 family of retrotransposons has served as a model to evaluate the diversity among closely related elements within Solanaceae species and found that members of the family vary mainly in their U3 region of the long terminal repeats (LTRs). Recovery of a full length genomic copy of Retrosol was performed through a PCR-based approach from wild potato, Solanum oplocense. Further characterization focusing on both LTR sequences of the amplified copy allowed estimating an approximate insertion time at 2 million years ago thus supporting the occurrence of transposition cycles after genus divergence. Copy number of Tnt1-like elements in Solanum species were determined through genomic quantitative PCR whereby results sustain that Retrosol in Solanum species is a low copy number retrotransposon (1-4 copies) while Retrolyc1 has an intermediate copy number (38 copies) in S. peruvianum. Comparative analysis of retrotransposon content revealed no correlation between genome size or ploidy level and Retrosol copy number. The tetraploid cultivated potato with a cellular genome size of 1,715 Mbp harbours similar copy number per monoploid genome than other diploid Solanum species (613-884 Mbp). Conversely, S. peruvianum genome (1,125 Mbp) has a higher copy number. These results point towards a lineage specific dynamic flux regarding the history of amplification/activity of Tnt1-like elements in the genome of Solanum species.
Resumo:
Hepatitis C virus (HCV) infection frequently persists despite substantial virus-specific immune responses and the combination of pegylated interferon (INF)-alpha and ribavirin therapy. Major histocompatibility complex class I restricted CD8+ T cells are responsible for the control of viraemia in HCV infection, and several studies suggest protection against viral infection associated with specific HLAs. The reason for low rates of sustained viral response (SVR) in HCV patients remains unknown. Escape mutations in response to cytotoxic T lymphocyte are widely described; however, its influence in the treatment outcome is ill understood. Here, we investigate the differences in CD8 epitopes frequencies from the Los Alamos database between groups of patients that showed distinct response to pegylated alpha-INF with ribavirin therapy and test evidence of natural selection on the virus in those who failed treatment, using five maximum likelihood evolutionary models from PAML package. The group of sustained virological responders showed three epitopes with frequencies higher than Non-responders group, all had statistical support, and we observed evidence of selection pressure in the last group. No escape mutation was observed. Interestingly, the epitope VLSDFKTWL was 100% conserved in SVR group. These results suggest that the response to treatment can be explained by the increase in immune pressure, induced by interferon therapy, and the presence of those epitopes may represent an important factor in determining the outcome of therapy.
Resumo:
ZNF630 is a member of the primate-specific Xp11 zinc finger gene cluster that consists of six closely related genes, of which ZNF41, ZNF81, and ZNF674 have been shown to be involved in mental retardation. This suggests that mutations of ZNF630 might influence cognitive function. Here, we detected 12 ZNF630 deletions in a total of 1,562 male patients with mental retardation from Brazil, USA, Australia, and Europe. The breakpoints were analyzed in 10 families, and in all cases they were located within two segmental duplications that share more than 99% sequence identity, indicating that the deletions resulted from non-allelic homologous recombination. In 2,121 healthy male controls, 10 ZNF630 deletions were identified. In total, there was a 1.6-fold higher frequency of this deletion in males with mental retardation as compared to controls, but this increase was not statistically significant (P-value = 0.174). Conversely, a 1.9-fold lower frequency of ZNF630 duplications was observed in patients, which was not significant either (P-value = 0.163). These data do not show that ZNF630 deletions or duplications are associated with mental retardation. (C) 2010 Wiley-Liss, Inc.
Resumo:
The genus Eigenmannia (Teleostei: Gymnotiformes), a widely distributed fish genus from the Neotropical region, presents very complex morphological patterns and many taxonomic problems. It is suggested that this genus harbors a species complex that is hard to differentiate using only morphological characteristics. As a result, many species of Eigenmannia may be currently gathered under a common name. With the objective of providing new tools for species characterization in this group, an analysis of the polymorphism of DNA inter-simple sequence repeats (ISSR), obtained by single primer amplification reaction (SPAR), combined with karyotype identification, was carried out in specimens sampled from populations of the Upper Parana, So Francisco and Amazon river basins (Brazil). Specific ISSR patterns generated by primers (AAGC)(4) and (GGAC)(4) were found to characterize the ten cytotypes analyzed, even though the cytotypes 2n = 38 and 2n = 38 XX:XY, from the Upper Parana basin, share some ISSR amplification patterns. The geographical distribution of all Eigenmannia specimens sampled was inferred, showing the cytotype 2n = 31/2n = 32 as the most frequent and largely distributed in the Upper Parana basin. The cytotype 2n = 34 was reported for the first time in the genus Eigenmania, restricted to the So Francisco basin. Polymorphic ISSR patterns were also detected for each cytotype. Considering our results and the data reported previously in the literature, it is suggested that many of the forms of Eigenmannia herein analyzed might be regarded as different species. This work reinforces the importance of employing diverse approaches, such as molecular and cytogenetic characterization, to address taxonomic and evolutionary issues.
Resumo:
Traditionally comparative cytogenetic studies are based mainly on banding patterns. Nevertheless, when dealing with species with highly rearranged genomes, as in Akodon species, or with other highly divergent species, cytogenetic comparisons of banding patterns prove inadequate. Hence, comparative chromosome painting has become the method of choice for genome comparisons at the cytogenetic level since it allows complete chromosome probes of a species to be hybridized in situ onto chromosomes of other species, detecting homologous genomic regions between them. In the present study, we have explored the highly rearranged complements of the Akodon species using reciprocal chromosome painting through species-specific chromosome probes obtained by chromosome sorting. The results revealed complete homology among the complements of Akodon sp. n. (ASP), 2n = 10; Akodon cursor (ACU), 2n = 15; Akodon montensis (AMO), 2n = 24; and Akodon paranaensis (APA), 2n = 44, and extensive chromosome rearrangements have been detected within the species with high precision. Robertsonian and tandem rearrangements, pericentric inversions and/or centromere repositioning, paracentric inversion, translocations, insertions, and breakpoints, where chromosomal rearrangements, seen to be favorable, were observed. Chromosome painting using the APA set of 21 autosomes plus X and Y revealed eight syntenic segments that are shared with A. montensis, A. cursor, and ASP, and one syntenic segment shared by A. montensis and A. cursor plus five exclusive chromosome associations for A. cursor and six for ASP chromosome X, except for the heterochromatin region of ASP X, and even chromosome Y shared complete homology among the species. These data indicate that all those closely related species have experienced a recent extensive process of autosomal rearrangement in which, except for ASP, there is still complete conservation of sex chromosomes homologies.
Resumo:
Ribosomal RNA genes are encoded by large units clustered (18S, 5S, and 28S) in the nucleolar organizer region in several organisms. Sometimes additional insertions are present in the coding region for the 28S rDNA. These insertions are specific non-long terminal repeat retrotransposons that have very restricted integration targets within the genome. The retrotransposon present in the genome of Rhynchosciara americana, RaR2, was isolated by the screening of a genomic library. Sequence analysis showed the presence of conserved regions, such as a reverse transcriptase domain and a zinc finger motif in the amino terminal region. The insertion site was highly conserved in R. americana and a phylogenetic analysis showed that this element belongs to the R2 clade. The chromosomal localization confirmed that the RaR2 mobile element was inserted into a specific site in the rDNA gene. The expression level of RaR2 in salivary glands during larval development was determined by quantitative RT-PCR, and the increase of relative expression in the 3P of the fourth instar larval could be related to intense gene activity characteristic of this stage. 5`-Truncated elements were identified in different DNA samples. Additionally, in three other Rhynchosciara species, the R2 element was present as a full-length element.
Resumo:
Non-LTR retrotransposons, also known as long interspersed nuclear elements (LINEs), are transposable elements that encode a reverse transcriptase and insert into genomic locations via RNA intermediates. The sequence analysis of a cDNA library constructed from mRNA of the salivary glands of R. americana showed the presence of putative class I elements. The cDNA clone with homology to a reverse transcriptase was the starting point for the present study. Genomic phage was isolated and sequenced and the molecular structure of the element was characterized as being a non-LTR retrotransposable element. Southern blot analysis indicated that this transposable element is represented by repeat sequences in the genome of R. americana. Chromosome tips were consistently positive when this element was used as probe in in-situ hybridization. Real-time RT-PCR showed that this retrotransposon is transcribed at different periods of larval development. Most interesting, the silencing of this retrotransposon in R. americana by RNA interference resulted in reduced transcript levels and in accelerated larval development.
Resumo:
Two mariner-like elements, Ramar1 and Ramar2, are described in the genome of Rhynchosciara americana, whose nucleotide consensus sequences were derived from multiple defective copies containing deletions, frame shifts and stop codons. Ramar1 contains several conserved amino acid blocks which were identified, including a specific D,D(34)D signature motif. Ramar2 is a defective mariner-like element, which contains a deletion overlapping in most of the internal region of the transposase ORF while its extremities remain intact. Predicted transposase sequences demonstrated that Ramar1 and Ramar2 phylogenetically present high identity to mariner-like elements of mauritiana subfamily. Southern blot analysis indicated that Ramar1 is widely represented in the genome of Rhynchosciara americana. In situ hybridizations showed Ramar1 localized in several chromosome regions, mainly in pericentromeric heterochromatin and their boundaries, while Ramar2 appeared as a single band in chromosome A.
Resumo:
The genome of the most virulent among 22 Brazilian geographical isolates of Spodoptera frugiperda nucleopolyhedrovirus, isolate 19 (SfMNPV-1 9), was completely sequenced and shown to comprise 132 565 bp and 141 open reading frames (ORFs). A total of 11 ORFs with no homology to genes in the GenBank database were found. Of those, four had typical baculovirus; promoter motifs and polyadenylation sites. Computer-simulated restriction enzyme cleavage patterns of SfMNPV-1 9 were compared with published physical maps of other SfMNPV isolates. Differences were observed in terms of the restriction profiles and genome size. Comparison of SfMNPV-1 9 with the sequence of the SfMNPV isolate 3AP2 indicated that they differed due to a 1427 bp deletion, as well as by a series of smaller deletions and point mutations. The majority of genes of SfMNPV-1 9 were conserved in the closely related Spodoptera exigua NPV (SeMNPV) and Agrotis segetum NPV (AgseMNPV-A), but a few regions experienced major changes and rearrangements. Synthenic maps for the genomes of group 11 NPVs revealed that gene collinearity was observed only within certain clusters. Analysis of the dynamics of gene gain and loss along the phylogenetic tree of the NPVs showed that group 11 had only five defining genes and supported the hypothesis that these viruses form ten highly divergent ancient lineages. Crucially, more than 60% of the gene gain events followed a power-law relation to genetic distance among baculoviruses, indicative of temporal organization in the gene accretion process.