64 resultados para genome
Resumo:
We characterized the consensus sequence and structure of a long terminal repeat (LTR) retrotransposon from the genome of the human blood fluke, Schistosoma japonicum, and have earned this element, Gulliver. The full length, consensus Gulliver LTR retrotransposon was 4788 bp, and it was flanked at its 5'- and 3'-ends by LTRs of 259 bp. Each LTR included RNA polymerase II promoter sequences, a CAAT signal and a TATA box, Gulliver exhibited features characteristic of a functional LTR retrotransposon including two read through (termination) ORFs encoding retroviral gag and pol proteins of 312 and 1071 amino acid residues, respectively. The gag ORF encoded motifs conserved in nucleic acid binding proteins, while the pol ORF encoded conserved domains of aspartic protease, reverse transcriptase (RT), RNaseH and integrase, in that order, a pol pattern conserved in the gypsy lineage of LTR retrotransposons. Whereas the sequence and structure of Gulliver was similar to that of gypsy, phylogenetic analysis revealed that Gulliver did not group particularly closely with the gypsy family. Rather, its closest relatives were a LTR retrotransposon from Caenorhabditis elegans, mag from Bombyx mori and, to a lesser extent, easel from the salmon Oncorhynchus keta. Dot blot hybridizations indicated that Gulliver was present at between 100 and several thousand copies in the S. japonicum genome, and Southern hybridization analysis suggested its probable presence in the genome of Schistosoma mansoni. Transcripts encoding the RT domain of Gulliver were detected by RT-PCR in larval and adult stages of S. japonicum, indicating that (at least) the RT domain of Gulliver is transcribed. This is the first report of the sequence and structure of an LTR retrotransposon from any schistosome or indeed from any species belonging to the phylum Platyhelminthes. (C) 2001 Elsevier Science B.V. All rights reserved.
Resumo:
The complete nucleotide sequence of the mitochondrial (mt) DNA molecule of the liverfluke, Fasciola hepatica (phylum Platyhelminthes, class Trematoda, family Fasciolidae), was determined, It comprises 14462 bp, contains 12 protein-encoding, 2 ribosomal and 22 transfer RNA genes, and is the second complete flatworm (and the first trematode) mitochondrial sequence to be described in detail. All of the genes are transcribed from the same strand. Of the genes typically found in mitochondrial genomes of eumetazoans, only atp8 is absent. The nad4L and nad4 genes overlap by 40 nt. Most intergenic sequences are very short. Two larger non-coding regions are present. The longer one (817 nt) is located between trnG and cox3 and consists of 8 identical tandem repeats of 85 nt, rich in G and C, followed by 1 imperfect repeat. The shorter non-coding region (187 nt) exhibits no special features and is separated from the longer region by trnG. The gene arrangement resembles that of some other trematodes including the eastern Asian Schistosoma species (and cyclophyllidean cestode species) but it is strikingly different from that of the African schistosomes, represented by Schistosoma mansoni. The genetic code is as inferred previously for flatworms. Transfer RNA genes range in length from 58 to 70 nt, their products producing characteristic 'clover leaf' structures, except for tRNA(S-VON) and tRNA(S-AGN) lacking the DHU arm.
Resumo:
The first chordates appear in the fossil record at the time of the Cambrian explosion, nearly 550 million years ago. The modern ascidian tadpole represents a plausible approximation to these ancestral chordates. To illuminate the origins of chordate and vertebrates, we generated a draft of the protein-coding portion of the genome of the most studied ascidian, Ciona intestinalis. The Ciona genome contains similar to16,000 protein-coding genes, similar to the number in other invertebrates, but only half that found in vertebrates. Vertebrate gene families are typically found in simplified form in Ciona, suggesting that ascidians contain the basic ancestral complement of genes involved in cell signaling and development. The ascidian genome has also acquired a number of lineage-specific innovations, including a group of genes engaged in cellulose metabolism that are related to those in bacteria and fungi.
Resumo:
The main focus of the human genome sequencing project has been gene discovery, but a great additional benefit is that it offers the chance to examine the large proportion of the genome that does not contain human genes. The nature of this ‘noncoding’ DNA is poorly understood, both as an evolutionary question (how did it get there?) and in the functional sense (what is it doing now?). Much of the noncoding DNA is derived from retroviruses that have inserted their DNA into the genome. The availability of complete genomic sequences will revolutionize studies of the number and location of endogenous retroviruses, their role in genome evolution, and their contribution to human disease.
Resumo:
Darwin's paradigm holds that the diversity of present-day organisms has arisen via a process of genetic descent with modification, as on a bifurcating tree. Evidence is accumulating that genes are sometimes transferred not along lineages but rather across lineages. To the extent that this is so, Darwin's paradigm can apply only imperfectly to genomes, potentially complicating or perhaps undermining attempts to reconstruct historical relationships among genomes (i.e., a genome tree). Whether most genes in a genome have arisen via treelike (vertical) descent or by lateral transfer across lineages can be tested if enough complete genome sequences are used. We define a phylogenetically discordant sequence (PDS) as an open reading frame (ORF) that exhibits patterns of similarity relationships statistically distinguishable from those of most other ORFs in the same genome. PDSs represent between 6.0 and 16.8% (mean, 10.8%) of the analyzable ORFs in the genomes of 28 bacteria, eight archaea, and one eukaryote (Saccharomyces cerevisiae). In this study we developed and assessed a distance-based approach, based on mean pairwise sequence similarity, for generating genome trees. Exclusion of PDSs improved bootstrap support for basal nodes but altered few topological features, indicating that there is little systematic bias among PDSs. Many but not all features of the genome tree from which PDSs were excluded are consistent with the 16S rRNA tree.
Resumo:
A newly described non-long terminal repeat (non-LTR) retrotransposon element was isolated from the genome of the Oriental schistosome, Schistosoma japonicum. At least 1000 partial copies of the element, which was named pido, were dispersed throughout the genome of S. japonicum. As is usual with non-LTR retrotransposons, it is expected that many pido elements will be 5'-truncated. A consensus sequence of 3564 bp of the truncated pido element was assembled from several genomic fragments that contained pido-hybridizing sequences. The sequence encoded part of the first open reading frame (ORF), the entire second ORF and, at its 3'-terminus, a tandemly repetitive, A-rich (TA(6)TA(5)TA(8)) tail, The ORF1 of pido encoded a nucleic acid binding protein and ORF2 encoded a retroviral-like polyprotein that included apurinic/apyrimidinic endonuclease (EN) and reverse transcriptase (RT) domains, in that order. Based on its sequence and structure, and phylogenetic analyses of both the RT and EN domains, pido belongs to the chicken repeat 1 (CR1)-like lineage of elements known from the chicken, turtle, puffer fish, mosquitoes and other taxa. pido shared equal similarity with CRI from chicken, an uncharacterized retrotransposon from Caenorhabditis elegans and SR1 (a non-LTR retrotransposon) from the related blood fluke Schistosoma mansoni; the level of similarity between pido and SR1 indicated that these two schistosome retrotransposons were related but not orthologous. The findings indicate that schistosomes have been colonized by at least two discrete CRI-like elements. Whereas pido did not appear to have a tight target site specificity, at least one copy of pido has inserted into the 3'-untranslated region of a protein-encoding gene (GeriBank AW736757) of as yet unknown identity. mRNA encoding the RT of pido was detected by reverse transcription-polymerase chain reaction in the egg, miracidium. and adult developmental stages of S. japonicum, indicating that the RT domain was transcribed and suggesting that pido was replicating actively and mobile within the S. japonicum genome. (C) 2002 Elsevier Science B.V. All rights reserved.
Resumo:
Point mutations that resulted in a substitution of the conserved 3'-penultimate cytidine in genomic RNA or the RNA negative strand of the self-amplifying replicon of the Flavivirus Kunjin virus completely blocked in vivo replication. Similarly, substitutions of the conserved 3'-terminal uridine in the RNA negative or positive strand completely blocked replication or caused much-reduced replication, respectively. The same preference for cytidine in the 3'-terminal dinucleotide was noted in reports of the in vitro activity of the RNA-dependent RNA polymerase (RdRp) for the other genera of Flaviviridae that also employ a double-stranded RNA (dsRNA) template to initiate asymmetric semiconservative RNA positive-strand synthesis. The Kunjin virus replicon results were interpreted in the context of a proposed model for initiation of RNA synthesis based on the solved crystal structure of the RdRp of phi6 bacteriophage, which also replicates efficiently using a dsRNA template with conserved 3'-penultimate cytidines and a 3'-terminal pyrimidine. A previously untested substitution of the conserved pentanucleotide at the top of the 3'-terminal stem-loop of all Flavivirus species also blocked detectable in vivo replication of the Kunjin virus replicon RNA.
Resumo:
Idiosyncratic markers are features of genes and genomes that are so unusual that it is unlikely that they evolved more than once in a lineage of organisms. Here we explore further the potential of idiosyncratic markers and changes to typically conserved tRNA sequences for phylogenetic inference. Hard ticks were chosen as the model group because their phylogeny has been studied extensively. Fifty-eight candidate markers from hard ticks ( family Ixodidae) and 22 markers from the subfamily Rhipicephalinae sensu lato were mapped onto phylogenies of these groups. Two of the most interesting markers, features of the secondary structure of two different tRNAs, gave strong support to the hypothesis that species of the Prostriata ( Ixodes spp.) are monophyletic. Previous analyses of genes and morphology did not strongly support this relationship, instead suggesting that the Prostriata is paraphyletic with respect to the Metastriata ( the rest of the hard ticks). Parallel or convergent evolution was not found in the arrangements of mitochondrial genes in ticks nor were there any reversals to the ancestral arthropod character state. Many of the markers identified were phylogenetically informative, whereas others should be informative with study of additional taxa. Idiosyncratic markers and changes to typically conserved nucleotides in tRNAs that are phylogenetically informative were common in this data set, and thus these types of markers might be found in other organisms.
Resumo:
To help understand the mechanisms of gene rearrangement in the mitochondrial (mt) genomes of hemipteroid insects, we sequenced the mt genome of the plague thrips, Thrips imaginis (Thysanoptera). This genome is circular, 15,407 by long, and has many unusual features, including (1) rRNA genes inverted and distant from one another, (2) an extra gene for tRNA-Ser, (3) a tRNA-Val lacking a D-arm, (4) two pseudo-tRNA genes, (5) duplicate control regions, and (6) translocations and/or inversions of 24 of the 37 genes. The mechanism of rRNA gene transcription in T. imaginis may be different from that of other arthropods since the two rRNA genes have inverted and are distant from one another. Further, the rRNA genes are not adjacent or even close to either of the two control regions. Tandem duplication and deletion is a plausible model for the evolution of duplicate control regions and for the gene translocations, but intramitochondrial recombination may account for the gene inversions in T. imaginis. All the 18 genes between control regions #1 and #2 have translocated and/or inverted, whereas only six of the 20 genes outside this region have translocated and/or inverted. Moreover, the extra tRNA gene and the two pseudo-tRNA genes are either in this region or immediately adjacent to one of the control regions. These observations suggest that tandem duplication and deletion may be facilitated by the duplicate control regions and may have occurred a number of times in the lineage leading to T. imaginis. T. imaginis shares two novel gene boundaries with a lepidopsocid species from another order of hemipteroid insects, the Psocoptera. The evidence available suggests that these shared gene boundaries evolved by convergence and thus are not informative for the interordinal phylogeny of hemipteroid insects. We discuss the potential of hemipteroid insects as a model system for studies of the evolution of animal rut genomes and outline some fundamental questions that may be addressed with this system.
Resumo:
Linkage disequilibrium (LD) mapping is commonly used as a fine mapping tool in human genome mapping and has been used with some success for initial disease gene isolation in certain isolated inbred human populations. An understanding of the population history of domestic dog breeds suggests that LID mapping could be routinely utilized in this species for initial genome-wide scans. Such an approach offers significant advantages over traditional linkage analysis. Here, we demonstrate, using canine copper toxicosis in the Bedlington terrier as the model, that LID mapping could be reasonably expected to be a useful strategy in low-resolution, genome-wide scans in pure-bred dogs. Significant LID was demonstrated over distances up to 33.3 cM. It is very unlikely, for a number of reasons discussed, that this result could be extrapolated to the rest of the genome. It is, however, consistent with the expectation given the population structure of canine breeds and, in this breed at least, with the hypothesis that it may be possible to utilize LID in a genome-wide scan. In this study, LD mapping confirmed the location of the copper toxicosis in Bedlington terrier gene (CT-BT) and was able to do so in a population that was refractory to traditional linkage analysis.
Resumo:
Schizophrenia is a common disorder with high heritability and a 10-fold increase in risk to siblings of probands. Replication has been inconsistent for reports of significant genetic linkage. To assess evidence for linkage across studies, rank-based genome scan meta-analysis (GSMA) was applied to data from 20 schizophrenia genome scans. Each marker for each scan was assigned to 1 of 120 30-cM bins, with the bins ranked by linkage scores (1 = most significant) and the ranks averaged across studies (R-avg) and then weighted for sample size (rootN[affected cases]). A permutation test was used to compute the probability of observing, by chance, each bin's average rank (P-AvgRnk) or of observing it for a bin with the same place (first, second, etc.) in the order of average ranks in each permutation (P-ord). The GSMA produced significant genomewide evidence for linkage on chromosome 2q (P-AvgRnk
Resumo:
We have rated eye color on a 3-point scale (1=blue/grey, 2=hazel/green, 3=brown) in 502 twin families and carried out a 5-10 cM genome scan (400-757 markers). We analyzed eye color as a threshold trait and performed multipoint sib pair linkage analysis using variance components analysis in Mx. A lod of 19.2 was found at the marker D15S1002, less than 1 cM from OCA2, which has been previously implicated in eye color variation. We estimate that 74% of variance in eye color liability is due to this QTL and a further 18% due to polygenic effects. However, a large shoulder on this peak suggests that other loci affecting eye color may be telomeric of OCA2 and inflating the QTL estimate. No other peaks reached genome-wide significance, although lods >2 were seen on 5p and 14q and lods >1 were additionally seen on chromosomes 2, 3, 6, 7, 8, 9, 17 and 18. Most of these secondary peaks were reduced or eliminated when we repeated the scan as a two locus analysis with the 15q linkage included, although this does not necessarily exclude them as false positives. We also estimated the interaction between the 15q QTL and the other marker locus but there was only minor evidence for additive x additive epistasis. Elaborating the analysis to the full two-locus model including non-additive main effects and interactions did not strengthen the evidence for epistasis. We conclude that most variation in eye color in Europeans is due to polymorphism in OCA2 but that there may be modifiers at several other loci.