928 resultados para complete genome


Relevância:

30.00% 30.00%

Publicador:

Resumo:

Background: Genome wide association studies (GWAS) are becoming the approach of choice to identify genetic determinants of complex phenotypes and common diseases. The astonishing amount of generated data and the use of distinct genotyping platforms with variable genomic coverage are still analytical challenges. Imputation algorithms combine directly genotyped markers information with haplotypic structure for the population of interest for the inference of a badly genotyped or missing marker and are considered a near zero cost approach to allow the comparison and combination of data generated in different studies. Several reports stated that imputed markers have an overall acceptable accuracy but no published report has performed a pair wise comparison of imputed and empiric association statistics of a complete set of GWAS markers. Results: In this report we identified a total of 73 imputed markers that yielded a nominally statistically significant association at P < 10(-5) for type 2 Diabetes Mellitus and compared them with results obtained based on empirical allelic frequencies. Interestingly, despite their overall high correlation, association statistics based on imputed frequencies were discordant in 35 of the 73 (47%) associated markers, considerably inflating the type I error rate of imputed markers. We comprehensively tested several quality thresholds, the haplotypic structure underlying imputed markers and the use of flanking markers as predictors of inaccurate association statistics derived from imputed markers. Conclusions: Our results suggest that association statistics from imputed markers showing specific MAF (Minor Allele Frequencies) range, located in weak linkage disequilibrium blocks or strongly deviating from local patterns of association are prone to have inflated false positive association signals. The present study highlights the potential of imputation procedures and proposes simple procedures for selecting the best imputed markers for follow-up genotyping studies.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Background: The Trypanosoma cruzi genome was sequenced from a hybrid strain (CL Brener). However, high allelic variation and the repetitive nature of the genome have prevented the complete linear sequence of chromosomes being determined. Determining the full complement of chromosomes and establishing syntenic groups will be important in defining the structure of T. cruzi chromosomes. A large amount of information is now available for T. cruzi and Trypanosoma brucei, providing the opportunity to compare and describe the overall patterns of chromosomal evolution in these parasites. Methodology/Principal Findings: The genome sizes, repetitive DNA contents, and the numbers and sizes of chromosomes of nine strains of T. cruzi from four lineages (TcI, TcII, TcV and TcVI) were determined. The genome of the TcI group was statistically smaller than other lineages, with the exception of the TcI isolate Tc1161 (Jose-IMT). Satellite DNA content was correlated with genome size for all isolates, but this was not accompanied by simultaneous amplification of retrotransposons. Regardless of chromosomal polymorphism, large syntenic groups are conserved among T. cruzi lineages. Duplicated chromosome-sized regions were identified and could be retained as paralogous loci, increasing the dosage of several genes. By comparing T. cruzi and T. brucei chromosomes, homologous chromosomal regions in T. brucei were identified. Chromosomes Tb9 and Tb11 of T. brucei share regions of syntenic homology with three and six T. cruzi chromosomal bands, respectively. Conclusions: Despite genome size variation and karyotype polymorphism, T. cruzi lineages exhibit conservation of chromosome structure. Several syntenic groups are conserved among all isolates analyzed in this study. The syntenic regions are larger than expected if rearrangements occur randomly, suggesting that they are conserved owing to positive selection. Mapping of the syntenic regions on T. cruzi chromosomal bands provides evidence for the occurrence of fusion and split events involving T. brucei and T. cruzi chromosomes.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The complete nucleotide sequence of the genomic RNA from the insect picorna-like virus Drosophila C virus (DCV) was determined. The DCV sequence predicts a genome organization different to that of other RNA virus families whose sequences are known. The single-stranded positive-sense genomic RNA is 9264 nucleotides in length and contains two large open reading frames (ORFs) which are separated by 191 nucleotides. The 5' ORF contains regions of similarities with the RNA-dependent RNA polymerase, helicase and protease domains of viruses from the picornavirus, comovirus and sequivirus families. The 3' ORF encodes the capsid proteins as confirmed by N-terminal sequence analysis of these proteins. The capsid protein coding region is unusual in two ways: firstly the cistron appears to lack an initiating methionine and secondly no subgenomic RNA is produced, suggesting that the proteins may be translated through internal initiation of translation from the genomic length RNA. The finding of this novel genome organization for DCV shows that this virus is not a member of the Picornaviridae as previously thought, but belongs to a distinct and hitherto unrecognized virus family.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

An analysis of the relationships of the major arthropod groups Was undertaken using mitochondrial genome data to examine the hypotheses that Hexapoda is polyphyletic and that Collembola is more closely related to branchiopod crustaceans than insects. We sought to examine the sensitivity of this relationship to outgroup choice, data treatment. gene choice and optimality criteria used in the phylogenetic analysis of mitochondrial genome data. Additionally we sequenced the mitochondrial genome of ail archaeognathan, Nesomachilis australica. to improve taxon selection in the apterygote insects, a group poorly represented in previous mitochondrial phylogenies. The sister group of the Collembola was rarely resolved in our analyses with a significant level of support. The use of different outgroups (myriapods, nematodes, or annelids + mollusks) resulted in many different placements of Collembola. The way in which the dataset was coded for analysis (DNA, DNA with the exclusion of third codon position and as amino acids) also had marked affects on tree topology. We found that nodal Support was spread evenly throughout the 13 mitochondrial genes and the exclusion of genes resulted in significantly less resolution in the inferred trees. Optimality criteria had a much lesser effect on topology than the preceding factors; parsimony and Bayesian trees for a given data set and treatment were quite similar. We therefore conclude that the relationships of the extant arthropod groups as inferred by mitochondrial genomes are highly vulnerable to outgroup choice, data treatment and gene choice, and no consistent alternative hypothesis of Collembola's relationships is supported. Pending the resolution of these identified problems with the application of mitogenomic data to basal arthropod relationships, it is difficult to justify the rejection of hexapod monophyly, which is well supported on morphological grounds. (c) The Willi Hennig Society 2004.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Rapid evolution and high intrahost sequence diversity are hallmarks of human and simian immunodeficiency virus (HIV/SIV) infection. Minor viral variants have important implications for drug resistance, receptor tropism, and immune evasion. Here, we used ultradeep pyrosequencing to sequence complete HIV/SIV genomes, detecting variants present at a frequency as low as 1%. This approach provides a more complete characterization of the viral population than is possible with conventional methods, revealing low-level drug resistance and detecting previously hidden changes in the viral population. While this work applies pyrosequencing to immunodeficiency viruses, this approach could be applied to virtually any viral pathogen.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Although patterns of somatic alterations have been reported for tumor genomes, little is known on how they compare with alterations present in non-tumor genomes. A comparison of the two would be crucial to better characterize the genetic alterations driving tumorigenesis. We sequenced the genomes of a lymphoblastoid (HCC1954BL) and a breast tumor (HCC1954) cell line derived from the same patient and compared the somatic alterations present in both. The lymphoblastoid genome presents a comparable number and similar spectrum of nucleotide substitutions to that found in the tumor genome. However, a significant difference in the ratio of non-synonymous to synonymous substitutions was observed between both genomes (P = 0.031). Protein-protein interaction analysis revealed that mutations in the tumor genome preferentially affect hub-genes (P = 0.0017) and are co-selected to present synergistic functions (P < 0.0001). KEGG analysis showed that in the tumor genome most mutated genes were organized into signaling pathways related to tumorigenesis. No such organization or synergy was observed in the lymphoblastoid genome. Our results indicate that endogenous mutagens and replication errors can generate the overall number of mutations required to drive tumorigenesis and that it is the combination rather than the frequency of mutations that is crucial to complete tumorigenic transformation.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The complete arrangement of genes in the mitochondrial (mt) genome is known for 12 species of insects, and part of the gene arrangement in the mt genome is known for over 300 other species of insects. The arrangement of genes in the mt genome is very conserved in insects studied, since all of the protein-coding and rRNA genes and most of the tRNA genes are arranged in the same way. We sequenced the entire mt genome of the wallaby louse, Heterodoxus macropus, which is 14,670 bp long and has the 37 genes typical of animals and some noncoding regions. The largest noncoding region is 73 bp long (93% A+T), and the second largest is 47 bp long (92% AST). Both of these noncoding regions seem to be able to form stem-loop structures. The arrangement of genes in the mt genome of this louse is unlike that of any other animal studied. All tRNA genes have moved and/or inverted relative to the ancestral gene arrangement of insects, which is present in the fruit fly Drosophila yakuba. At least nine protein-coding genes (atp6, atp8, cox2, cob, nad1-nad3, nad5, and nad6) have moved; moreover, four of these genes (atp6, atp8, nad1, and nad3) have inverted. The large number of gene rearrangements in the mt genome of H. macropus is unprecedented for an arthropod.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The main focus of the human genome sequencing project has been gene discovery, but a great additional benefit is that it offers the chance to examine the large proportion of the genome that does not contain human genes. The nature of this ‘noncoding’ DNA is poorly understood, both as an evolutionary question (how did it get there?) and in the functional sense (what is it doing now?). Much of the noncoding DNA is derived from retroviruses that have inserted their DNA into the genome. The availability of complete genomic sequences will revolutionize studies of the number and location of endogenous retroviruses, their role in genome evolution, and their contribution to human disease.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Unlike other members of the genus, Echinococcus granulosus is known to exhibit considerable levels of variation in biology, physiology and molecular genetics. Indeed, some of the taxa regarded as 'genotypes' within E. granulosus might be sufficiently distinct as to merit specific status. Here, complete mitochondrial genomes are presented of 2 genotypes of E. granulosus (G1-sheep-dog strain: G4-horse-dog strain) and of another taeniid cestode, Taenia crassiceps. These genomes are characterized and compared with those of Echinococcus multilocularis and Hymenolepis diminuta. Genomes of all the species are very similar in structure, length and base-composition. Pairwise comparisons of concatenated protein-coding genes indicate that the G1 and G4 genotypes of E. granulosus are almost as distant from each other as each is from a distinct species, E. multilocularis. Sequences for the variable genes atp6 and nad3 were obtained from additional genotypes of E. granulosus, from E. vogeli and E. oligarthrus. Again, pairwise comparisons showed the distinctiveness of the G1 and G4 genotypes. Phylogenetic analyses of concatenated atp6, nad1 (partial) and cox1 (partial) genes from E. multilocularis, E. vogeli, E. oligarthrus, 5 genotypes of E. granulosus, and using T. crassiceps as an outgroup, yielded the same results. We conclude that the sheep-dog and horse-dog strains of E. granulosus should be regarded as distinct at the specific level.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Our previous studies have shown that two distinct genotypes of Sindbis (SIN) virus occur in Australia. One of these, the Oriental/Australian type, circulates throughout most of the Australian continent, whereas the recently identified south-west (SW) genetic type appears to be restricted to a distinct geographic region located in the temperate south-west of Australia. We have now determined the complete nucleotide and translated amino acid sequences of a SW isolate of SIN virus (SW6562) and performed comparative analyses with other SIN viruses at the genomic level. The genome of SW6562 is 11,569 nucleotides in length, excluding the cap nucleotide and poly (A) tail. Overall this virus differs from the prototype SIN virus (strain AR339) by 23% in nucleotide sequence and 12.5% in amino acid sequence. Partial sequences of four regions of the genome of four SW isolates were determined and compared with the corresponding sequences from a number of SIN isolates from different regions of the World. These regions are the non-structural protein (nsP3), the E2 gene, the capsid gene, and the repeated sequence elements (RSE) of the 3'UTR. These comparisons revealed that the SW SIN viruses were more closely related to South African and European strains than to other Australian isolates of SIN virus. Thus the SW genotype of SIN virus may have been introduced into this region of Australia by viremic humans or migratory birds and subsequently evolved independently in the region. The sequence data also revealed that the SW genotype contains a unique deletion in the RSE of the 3'UTR region of the genome. Previous studies have shown that deletions in this region of the SIN genome can have significant effects on virus replication in mosquito and avian cells, which may explain the restricted distribution of this genotype of SIN virus.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The work presented in this thesis describes the functional characterization of hydrogenases in the overall energy metabolism of the sulfate reducing bacterium Desulfovibrio gigas. With the complete annotation of the D. gigas genome, we were able to verify that only the two previously described hydrogenases are present in this organism, the periplasmic [NiFe] HynAB and the cytoplasmic membrane-bound [NiFe] Ech.(...)

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Rigorous organization and quality control (QC) are necessary to facilitate successful genome-wide association meta-analyses (GWAMAs) of statistics aggregated across multiple genome-wide association studies. This protocol provides guidelines for (i) organizational aspects of GWAMAs, and for (ii) QC at the study file level, the meta-level across studies and the meta-analysis output level. Real-world examples highlight issues experienced and solutions developed by the GIANT Consortium that has conducted meta-analyses including data from 125 studies comprising more than 330,000 individuals. We provide a general protocol for conducting GWAMAs and carrying out QC to minimize errors and to guarantee maximum use of the data. We also include details for the use of a powerful and flexible software package called EasyQC. Precise timings will be greatly influenced by consortium size. For consortia of comparable size to the GIANT Consortium, this protocol takes a minimum of about 10 months to complete.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Clone CL Brener is the reference organism used in the Trypanosoma cruzi Genome Project. Some biological parameters of CL Brener were determined: (a) the doubling time of epimastigote forms cultured in liver infusion-tryptose (LIT) medium at 28oC is 58±13 hr; (b) differentiation of epimastigotes to metacyclic trypomastigotes is obtained by incubation in LIT-20% Grace´s medium; (c) trypomastigotes infect mammalian cultured cells and perform the complete intracellular cycle at 33 and 37oC; (d) blood forms are highly infective to mice; (e) blood forms are susceptible to nifurtimox and benznidazole. The molecular typing of CL Brener has been determined: (a) isoenzymatic profiles are characteristic of zymodeme ZB; (b) PCR amplification of a 24Sa ribosomal RNA sequence indicates it belongs to T. cruzi lineage 1; (c) schizodeme, randomly amplified polymorphic DNA (RAPD) and DNA fingerprinting analyses were performed

Relevância:

30.00% 30.00%

Publicador:

Resumo:

"The host-parasite relationship" is a vast and diverse research field which, despite huge human and financial input over many years, remains largely shrouded in mystery. Clearly, the adaptation of parasites to their different host species, and to the different environmental stresses that they represent, depends on interactions with, and responses to, various molecules of host and/or parasite origin. The schistosome genome project is a primary strategy to reach the goal; this systematic research project has successfully developed novel technologies for qualitative and quantitative characterization of schistosome genes and genome organization by extensive international collaboration between top quality laboratories. Schistosomes are a family of parasitic blood flukes (Phylum Platyhelminthes), which have seven pairs of autosomal chromosomes and one pair of sex chromosomes (ZZ for a male worm and ZW for a female), of a haploid genome size of 2.7x108 base pairs (Simpson et al. 1982). Schistosomes are ideal model organisms for the development of genome mapping strategies since they have a small genome size comparable to that of well-characterized model organisms such as Caenorhabditis elegans (100 Mb) and Drosophila (165 Mb), and contain functional genes with a high level of homology to the host mammalian genes. Here we summarize the current progress in the schistosome genome project, the information of 3,047 transcribed genes (Expressed Sequence Tags; EST), complete sets of cDNA and genomic DNA libraries (including YAC and cosmid libraries) with a mapping technique to the well defined schistosome chromosomes. The schistosome genome project will further identify and characterize the key molecules that are responsible for host-parasite adaptation, i.e., successful growth, development, maturation and reproduction of the parasite within its host in the near future

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Ants are some of the most abundant and familiar animals on Earth, and they play vital roles in most terrestrial ecosystems. Although all ants are eusocial, and display a variety of complex and fascinating behaviors, few genomic resources exist for them. Here, we report the draft genome sequence of a particularly widespread and well-studied species, the invasive Argentine ant (Linepithema humile), which was accomplished using a combination of 454 (Roche) and Illumina sequencing and community-based funding rather than federal grant support. Manual annotation of >1,000 genes from a variety of different gene families and functional classes reveals unique features of the Argentine ant's biology, as well as similarities to Apis mellifera and Nasonia vitripennis. Distinctive features of the Argentine ant genome include remarkable expansions of gustatory (116 genes) and odorant receptors (367 genes), an abundance of cytochrome P450 genes (>110), lineage-specific expansions of yellow/major royal jelly proteins and desaturases, and complete CpG DNA methylation and RNAi toolkits. The Argentine ant genome contains fewer immune genes than Drosophila and Tribolium, which may reflect the prominent role played by behavioral and chemical suppression of pathogens. Analysis of the ratio of observed to expected CpG nucleotides for genes in the reproductive development and apoptosis pathways suggests higher levels of methylation than in the genome overall. The resources provided by this genome sequence will offer an abundance of tools for researchers seeking to illuminate the fascinating biology of this emerging model organism.