34 resultados para GENOMES

em National Center for Biotechnology Information - NCBI


Relevância:

20.00% 20.00%

Publicador:

Resumo:

Increasingly, studies of genes and genomes are indicating that considerable horizontal transfer has occurred between prokaryotes. Extensive horizontal transfer has occurred for operational genes (those involved in housekeeping), whereas informational genes (those involved in transcription, translation, and related processes) are seldomly horizontally transferred. Through phylogenetic analysis of six complete prokaryotic genomes and the identification of 312 sets of orthologous genes present in all six genomes, we tested two theories describing the temporal flow of horizontal transfer. We show that operational genes have been horizontally transferred continuously since the divergence of the prokaryotes, rather than having been exchanged in one, or a few, massive events that occurred early in the evolution of prokaryotes. In agreement with earlier studies, we found that differences in rates of evolution between operational and informational genes are minimal, suggesting that factors other than rate of evolution are responsible for the observed differences in horizontal transfer. We propose that a major factor in the more frequent horizontal transfer of operational genes is that informational genes are typically members of large, complex systems, whereas operational genes are not, thereby making horizontal transfer of informational gene products less probable (the complexity hypothesis).

Relevância:

20.00% 20.00%

Publicador:

Resumo:

A satellite DNA sequence, As120a, specific to the A-genome chromosomes in the hexaploid oat, Avena sativa L., was isolated by subcloning a fragment with internal tandem repeats from a plasmid, pAs120, that had been obtained from an Avena strigosa (As genome) genomic library. Southern and in situ hybridization showed that sequences with homology to sequences within pAs120 were dispersed throughout the genome of diploid (A and C genomes), tetraploid (AC genomes), and hexaploid (ACD genomes) Avena species. In contrast, sequences homologous to As120a were found in two A-genome species (A. strigosa and Avena longiglumis) and in the hexaploid A. sativa whereas this sequence was little amplified in the tetraploid Avena murphyi and was absent in the remaining A- and C-genome diploid species. In situ hybridization of pAs120a to hexaploid oat species revealed the distribution of elements of the As120a repeated family over both arms of 14 of 42 chromosomes of this species. By using double in situ hybridization with pAs120a and a C genome-specific probe, three sets of 14 chromosomes were revealed corresponding to the A, C, and D genomes of the hexaploid species. Simultaneous in situ hybridizations with pAs120a and ribosomal probes were used to assign the SAT chromosomes of hexaploid species to their correct genomes. This work reports a sequence able to distinguish between the closely related A and D genomes of hexaploid oats. This sequence offers new opportunities to analyze the relationships of Avena species and to explore the possible evolution of various polyploid oat species.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Eukaryotic genome similarity relationships are inferred using sequence information derived from large aggregates of genomic sequences. Comparisons within and between species sample sequences are based on the profile of dinucleotide relative abundance values (The profile is ρ*XY = f*XY/f*Xf*Y for all XY, where f*X denotes the frequency of the nucleotide X and f*XY denotes the frequency of the dinucleotide XY, both computed from the sequence concatenated with its inverted complement). Previous studies with respect to prokaryotes and this study document that profiles of different DNA sequence samples (sample size ≥50 kb) from the same organism are generally much more similar to each other than they are to profiles from other organisms, and that closely related organisms generally have more similar profiles than do distantly related organisms. On this basis we refer to the collection {ρ*XY} as the genome signature. This paper identifies ρ*XY extremes and compares genome signature differences for a diverse range of eukaryotic species. Interpretations on the mechanisms maintaining these profile differences center on genome-wide replication, repair, DNA structures, and context-dependent mutational biases. It is also observed that mitochondrial genome signature differences between species parallel the corresponding nuclear genome signature differences despite large differences between corresponding mitochondrial and nuclear signatures. The genome signature differences also have implications for contrasts between rodents and other mammals, and between monocot and dicot plants, as well as providing evidence for similarities among fungi and the diversity of protists.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The rice genus, Oryza, which comprises 23 species and 9 recognized genome types, represents an enormous gene pool for genetic improvement of rice cultivars. Clarification of phylogenetic relationships of rice genomes is critical for effective utilization of the wild rice germ plasm. By generating and comparing two nuclear gene (Adh1 and Adh2) trees and a chloroplast gene (matK) tree of all rice species, phylogenetic relationships among the rice genomes were inferred. Origins of the allotetraploid species, which constitute more than one-third of rice species diversity, were reconstructed based on the Adh gene phylogenies. Genome types of the maternal parents of allotetraploid species were determined based on the matK gene tree. The phylogenetic reconstruction largely supports the previous recognition of rice genomes. It further revealed that the EE genome species is most closely related to the DD genome progenitor that gave rise to the CCDD genome. Three species of the CCDD genome may have originated through a single hybridization event, and their maternal parent had the CC genome. The BBCC genome species had different origins, and their maternal parents had either a BB or CC genome. An additional genome type, HHKK, was recognized for Oryza schlechteri and Porteresia coarctata, suggesting that P. coarctata is an Oryza species. The AA genome lineage, which contains cultivated rice, is a recently diverged and rapidly radiated lineage within the rice genus.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Using computer programs developed for this purpose, we searched for various repeated sequences including inverted, direct tandem, and homopurine–homopyrimidine mirror repeats in various prokaryotes, eukaryotes, and an archaebacterium. Comparison of observed frequencies with expectations revealed that in bacterial genomes and organelles the frequency of different repeats is either random or enriched for inverted and/or direct tandem repeats. By contrast, in all eukaryotic genomes studied, we observed an overrepresentation of all repeats, especially homopurine–homopyrimidine mirror repeats. Analysis of the genomic distribution of all abundant repeats showed that they are virtually excluded from coding sequences. Unexpectedly, the frequencies of abundant repeats normalized for their expectations were almost perfect exponential functions of their size, and for a given repeat this function was indistinguishable between different genomes.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

We created a simulation based on experimental data from bacteriophage T7 that computes the developmental cycle of the wild-type phage and also of mutants that have an altered genome order. We used the simulation to compute the fitness of more than 105 mutants. We tested these computations by constructing and experimentally characterizing T7 mutants in which we repositioned gene 1, coding for T7 RNA polymerase. Computed protein synthesis rates for ectopic gene 1 strains were in moderate agreement with observed rates. Computed phage-doubling rates were close to observations for two of four strains, but significantly overestimated those of the other two. Computations indicate that the genome organization of wild-type T7 is nearly optimal for growth: only 2.8% of random genome permutations were computed to grow faster, the highest 31% faster, than wild type. Specific discrepancies between computations and observations suggest that a better understanding of the translation efficiency of individual mRNAs and the functions of qualitatively “nonessential” genes will be needed to improve the T7 simulation. In silico representations of biological systems can serve to assess and advance our understanding of the underlying biology. Iteration between computation, prediction, and observation should increase the rate at which biological hypotheses are formulated and tested.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

To determine whether pathogenic mutations in mtDNA are involved in phenotypic expression of Alzheimer’s disease (AD), the transfer of mtDNA from elderly patients with AD into mtDNA-less (ρ0) HeLa cells was carried out by fusion of platelets or synaptosomal fractions of autopsied brain tissues with ρ0 HeLa cells. The results showed that mtDNA in postmortem brain tissue survives for a long time without degradation and could be rescued in ρ0 HeLa cells. Next, the cybrid clones repopulated with exogenously imported mtDNA from patients with AD were used for examination of respiratory enzyme activity and transfer of mtDNA with the pathogenic mutations that induce mitochondrial dysfunction. The presence of the mutated mtDNA was restricted to brain tissues and their cybrid clones that formed with synaptosomes as mtDNA donors, whereas no cybrid clones that isolated with platelets as mtDNA donors had detectable mutated mtDNA. However, biochemical analyses showed that all cybrid clones with mtDNA imported from platelets or brain tissues of patients with AD restored mitochondrial respiration activity to almost the same levels as those of cybrid clones with mtDNA from age-matched normal controls, suggesting functional integrity of mtDNA in both platelets and brain tissues of elderly patients with AD. These observations warrant the reassessment of the conventional concept that the accumulation of pathogenic mutations in mtDNA throughout the aging process is responsible for the decrease of mitochondrial respiration capacity with age and with the development of age-associated neurodegenerative diseases.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Isopentenyl diphosphate (IPP) is the central intermediate in the biosynthesis of isoprenoids, the most ancient and diverse class of natural products. Two distinct routes of IPP biosynthesis occur in nature: the mevalonate pathway and the recently discovered deoxyxylulose 5-phosphate (DXP) pathway. The evolutionary history of the enzymes involved in both routes and the phylogenetic distribution of their genes across genomes suggest that the mevalonate pathway is germane to archaebacteria, that the DXP pathway is germane to eubacteria, and that eukaryotes have inherited their genes for IPP biosynthesis from prokaryotes. The occurrence of genes specific to the DXP pathway is restricted to plastid-bearing eukaryotes, indicating that these genes were acquired from the cyanobacterial ancestor of plastids. However, the individual phylogenies of these genes, with only one exception, do not provide evidence for a specific affinity between the plant genes and their cyanobacterial homologues. The results suggest that lateral gene transfer between eubacteria subsequent to the origin of plastids has played a major role in the evolution of this pathway.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The availability of complete genome sequences and mRNA expression data for all genes creates new opportunities and challenges for identifying DNA sequence motifs that control gene expression. An algorithm, “MobyDick,” is presented that decomposes a set of DNA sequences into the most probable dictionary of motifs or words. This method is applicable to any set of DNA sequences: for example, all upstream regions in a genome or all genes expressed under certain conditions. Identification of words is based on a probabilistic segmentation model in which the significance of longer words is deduced from the frequency of shorter ones of various lengths, eliminating the need for a separate set of reference data to define probabilities. We have built a dictionary with 1,200 words for the 6,000 upstream regulatory regions in the yeast genome; the 500 most significant words (some with as few as 10 copies in all of the upstream regions) match 114 of 443 experimentally determined sites (a significance level of 18 standard deviations). When analyzing all of the genes up-regulated during sporulation as a group, we find many motifs in addition to the few previously identified by analyzing the subclusters individually to the expression subclusters. Applying MobyDick to the genes derepressed when the general repressor Tup1 is deleted, we find known as well as putative binding sites for its regulatory partners.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Cloned PCR products containing hepatitis C virus (HCV) genomic fragments have been used for analyses of HCV genomic heterogeneity and protein expression. These studies assume that the clones derived are representative of the entire virus population and that subsets are not inadvertently selected. The aim of the present study was to express HCV structural proteins. However, we found that there was a strong cloning selection for defective genomes and that most clones generated initially were incapable of expressing the HCV proteins. The HCV structural region (C-E1-E2-p7) was directly amplified by long reverse transcription–PCR from the plasma of an HCV-infected patient or from a control plasmid containing a viable full-length cDNA of HCV derived from the same patient but cloned in a different vector. The PCR products were cloned into a mammalian expression vector, amplified in Escherichia coli, and tested for their ability to produce HCV structural proteins. Twenty randomly picked clones derived from the HCV-infected patient all contained nucleotide mutations leading to absence or truncation of the expected HCV products. Of 25 clones derived from the control plasmid, only 8% were fully functional for polyprotein synthesis. The insertion of extra nucleotides in the region just upstream of the start codon of the HCV insert led to a statistically significant increase in the number of fully functional clones derived from the patient (42%) and from the control plasmid (72–92%). Nonrandom selection of clones during the cloning procedure has enormous implications for the study of viral heterogeneity, because it can produce a false spectrum of genomic diversity. It can also be an impediment to the construction of infectious viral clones.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

A plastid-derived signal plays an important role in the coordinated expression of both nuclear- and chloroplast-localized genes that encode photosynthesis-related proteins. Arabidopsis GUN (genomes uncoupled) loci have been identified as components of plastid-to-nucleus signal transduction. Unlike wild-type plants, gun mutants have nuclear Lhcb1 expression in the absence of chloroplast development. We observed a synergistic phenotype in some gun double-mutant combinations, suggesting there are at least two independent pathways in plastid-to-nucleus signal transduction. There is a reduction of chlorophyll accumulation in gun4 and gun5 mutant plants, and a gun4gun5 double mutant shows an albino phenotype. We cloned the GUN5 gene, which encodes the ChlH subunit of Mg-chelatase. We also show that gun2 and gun3 are alleles of the known photomorphogenic mutants, hy1 and hy2, which are required for phytochromobilin synthesis from heme. These findings suggest that certain perturbations of the tetrapyrrole biosynthetic pathway generate a signal from chloroplasts that causes transcriptional repression of nuclear genes encoding plastid-localized proteins. The comparison of mutant phenotypes of gun5 and another Mg-chelatase subunit (ChlI) mutant suggests a specific function for ChlH protein in the plastid-signaling pathway.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Operon structure is an important organization feature of bacterial genomes. Many sets of genes occur in the same order on multiple genomes; these conserved gene groupings represent candidate operons. This study describes a computational method to estimate the likelihood that such conserved gene sets form operons. The method was used to analyze 34 bacterial and archaeal genomes, and yielded more than 7600 pairs of genes that are highly likely (P ≥ 0.98) to belong to the same operon. The sensitivity of our method is 30–50% for the Escherichia coli genome. The predicted gene pairs are available from our World Wide Web site http://www.tigr.org/tigr-scripts/operons/operons.cgi.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The database of Clusters of Orthologous Groups of proteins (COGs), which represents an attempt on a phylogenetic classification of the proteins encoded in complete genomes, currently consists of 2791 COGs including 45 350 proteins from 30 genomes of bacteria, archaea and the yeast Saccharomyces cerevisiae (http://www.ncbi.nlm.nih.gov/COG). In addition, a supplement to the COGs is available, in which proteins encoded in the genomes of two multicellular eukaryotes, the nematode Caenorhabditis elegans and the fruit fly Drosophila melanogaster, and shared with bacteria and/or archaea were included. The new features added to the COG database include information pages with structural and functional details on each COG and literature references, improvements of the COGNITOR program that is used to fit new proteins into the COGs, and classification of genomes and COGs constructed by using principal component analysis.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The SWISS-PROT group at EBI has developed the Proteome Analysis Database utilising existing resources and providing comparative analysis of the predicted protein coding sequences of the complete genomes of bacteria, archaea and eukaryotes (http://www.ebi.ac.uk/proteome/). The two main projects used, InterPro and CluSTr, give a new perspective on families, domains and sites and cover 31–67% (InterPro statistics) of the proteins from each of the complete genomes. CluSTr covers the three complete eukaryotic genomes and the incomplete human genome data. The Proteome Analysis Database is accompanied by a program that has been designed to carry out InterPro proteome comparisons for any one proteome against any other one or more of the proteomes in the database.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

GOLD is a comprehensive resource for accessing information related to completed and ongoing genome projects world-wide. The database currently provides information on 350 genome projects, of which 48 have been completely sequenced and their analysis published. GOLD was created in 1997 and since April 2000 it has been licensed to Integrated Genomics. The database is freely available through the URL: http://igweb.integratedgenomics.com/GOLD/.