987 resultados para Eukaryotic Genomes


Relevância:

20.00% 20.00%

Publicador:

Resumo:

In eukaryotic cells the TATA-binding protein (TBP) associates with other proteins known as TBP-associated factors (TAFs) to form multisubunit transcription factors important for gene expression by all three nuclear RNA polymerases. Computer searching of the complete Saccharomyces cerevisiae genome revealed five previously unidentified yeast genes with significant sequence similarity to known human and Drosophila RNA polymerase II TAFs. Each of these genes is essential for viability. A sixth essential gene (FUN81) has previously been noted to be similar to human TAFII18. Coimmunoprecipitation experiments show that all six proteins are associated with TBP, demonstrating that they are true TAFs. Furthermore, these proteins are present in complexes containing the TAFII130 subunit, indicating that they are components of TFIID. Based on their predicted molecular weights, these genes have been designated TAF67, TAF61(68), TAF40, TAF23(25), TAF19(FUN81), and TAF17. Yeast TAF61 is significantly larger than its higher eukaryotic homologues, and deletion analysis demonstrates that the evolutionarily conserved, histone-like domain is sufficient and necessary to support viability.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Cells of several major algal groups are evolutionary chimeras of two radically different eukaryotic cells. Most of these “cells within cells” lost the nucleus of the former algal endosymbiont. But after hundreds of millions of years cryptomonads still retain the nucleus of their former red algal endosymbiont as a tiny relict organelle, the nucleomorph, which has three minute linear chromosomes, but their function and the nature of their ends have been unclear. We report extensive cryptomonad nucleomorph sequences (68.5 kb), from one end of each of the three chromosomes of Guillardia theta. Telomeres of the nucleomorph chromosomes differ dramatically from those of other eukaryotes, being repeats of the 23-mer sequence (AG)7AAG6A, not a typical hexamer (commonly TTAGGG). The subterminal regions comprising the rRNA cistrons and one protein-coding gene are exactly repeated at all three chromosome ends. Gene density (one per 0.8 kb) is the highest for any cellular genome. None of the 38 protein-coding genes has spliceosomal introns, in marked contrast to the chlorarachniophyte nucleomorph. Most identified nucleomorph genes are for gene expression or protein degradation; histone, tubulin, and putatively centrosomal ranbpm genes are probably important for chromosome segregation. No genes for primary or secondary metabolism have been found. Two of the three tRNA genes have introns, one in a hitherto undescribed location. Intergenic regions are exceptionally short; three genes transcribed by two different RNA polymerases overlap their neighbors. The reported sequences encode two essential chloroplast proteins, FtsZ and rubredoxin, thus explaining why cryptomonad nucleomorphs persist.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

To determine whether pathogenic mutations in mtDNA are involved in phenotypic expression of Alzheimer’s disease (AD), the transfer of mtDNA from elderly patients with AD into mtDNA-less (ρ0) HeLa cells was carried out by fusion of platelets or synaptosomal fractions of autopsied brain tissues with ρ0 HeLa cells. The results showed that mtDNA in postmortem brain tissue survives for a long time without degradation and could be rescued in ρ0 HeLa cells. Next, the cybrid clones repopulated with exogenously imported mtDNA from patients with AD were used for examination of respiratory enzyme activity and transfer of mtDNA with the pathogenic mutations that induce mitochondrial dysfunction. The presence of the mutated mtDNA was restricted to brain tissues and their cybrid clones that formed with synaptosomes as mtDNA donors, whereas no cybrid clones that isolated with platelets as mtDNA donors had detectable mutated mtDNA. However, biochemical analyses showed that all cybrid clones with mtDNA imported from platelets or brain tissues of patients with AD restored mitochondrial respiration activity to almost the same levels as those of cybrid clones with mtDNA from age-matched normal controls, suggesting functional integrity of mtDNA in both platelets and brain tissues of elderly patients with AD. These observations warrant the reassessment of the conventional concept that the accumulation of pathogenic mutations in mtDNA throughout the aging process is responsible for the decrease of mitochondrial respiration capacity with age and with the development of age-associated neurodegenerative diseases.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Isopentenyl diphosphate (IPP) is the central intermediate in the biosynthesis of isoprenoids, the most ancient and diverse class of natural products. Two distinct routes of IPP biosynthesis occur in nature: the mevalonate pathway and the recently discovered deoxyxylulose 5-phosphate (DXP) pathway. The evolutionary history of the enzymes involved in both routes and the phylogenetic distribution of their genes across genomes suggest that the mevalonate pathway is germane to archaebacteria, that the DXP pathway is germane to eubacteria, and that eukaryotes have inherited their genes for IPP biosynthesis from prokaryotes. The occurrence of genes specific to the DXP pathway is restricted to plastid-bearing eukaryotes, indicating that these genes were acquired from the cyanobacterial ancestor of plastids. However, the individual phylogenies of these genes, with only one exception, do not provide evidence for a specific affinity between the plant genes and their cyanobacterial homologues. The results suggest that lateral gene transfer between eubacteria subsequent to the origin of plastids has played a major role in the evolution of this pathway.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Analyses of complete genomes indicate that a massive prokaryotic gene transfer (or transfers) preceded the formation of the eukaryotic cell. In comparisons of the entire set of Methanococcus jannaschii genes with their orthologs from Escherichia coli, Synechocystis 6803, and the yeast Saccharomyces cerevisiae, it is shown that prokaryotic genomes consist of two different groups of genes. The deeper, diverging informational lineage codes for genes which function in translation, transcription, and replication, and also includes GTPases, vacuolar ATPase homologs, and most tRNA synthetases. The more recently diverging operational lineage codes for amino acid synthesis, the biosynthesis of cofactors, the cell envelope, energy metabolism, intermediary metabolism, fatty acid and phospholipid biosynthesis, nucleotide biosynthesis, and regulatory functions. In eukaryotes, the informational genes are most closely related to those of Methanococcus, whereas the majority of operational genes are most closely related to those of Escherichia, but some are closest to Methanococcus or to Synechocystis.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The availability of complete genome sequences and mRNA expression data for all genes creates new opportunities and challenges for identifying DNA sequence motifs that control gene expression. An algorithm, “MobyDick,” is presented that decomposes a set of DNA sequences into the most probable dictionary of motifs or words. This method is applicable to any set of DNA sequences: for example, all upstream regions in a genome or all genes expressed under certain conditions. Identification of words is based on a probabilistic segmentation model in which the significance of longer words is deduced from the frequency of shorter ones of various lengths, eliminating the need for a separate set of reference data to define probabilities. We have built a dictionary with 1,200 words for the 6,000 upstream regulatory regions in the yeast genome; the 500 most significant words (some with as few as 10 copies in all of the upstream regions) match 114 of 443 experimentally determined sites (a significance level of 18 standard deviations). When analyzing all of the genes up-regulated during sporulation as a group, we find many motifs in addition to the few previously identified by analyzing the subclusters individually to the expression subclusters. Applying MobyDick to the genes derepressed when the general repressor Tup1 is deleted, we find known as well as putative binding sites for its regulatory partners.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Eukaryotic translation initiation factor 5A (eIF-5A) is a ubiquitous protein found in all eukaryotic cells. The protein is closely associated with cell proliferation in the G1–S stage of the cell cycle. Recent findings show that the eIF-5A proteins are highly expressed in tumor cells and act as a cofactor of the Rev protein in HIV-1-infected cells. The mature eIF is the only protein known to have the unusual amino acid hypusine, a post-translationally modified lysine. The crystal structure of eIF-5A from Methanococcus jannaschii (MJ eIF-5A) has been determined at 1.9 Å and 1.8 Å resolution in two crystal forms by using the multiple isomorphous replacement method and the multiwavelength anomalous diffraction method for the first crystal form and the molecular replacement method for the second crystal form. The structure consists of two folding domains, one of which is similar to the oligonucleotide-binding domain found in the prokaryotic cold shock protein and the translation initiation factor IF1 despite the absence of any significant sequence similarities. The 12 highly conserved amino acid residues found among eIF-5As include the hypusine site and form a long protruding loop at one end of the elongated molecule.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Cloned PCR products containing hepatitis C virus (HCV) genomic fragments have been used for analyses of HCV genomic heterogeneity and protein expression. These studies assume that the clones derived are representative of the entire virus population and that subsets are not inadvertently selected. The aim of the present study was to express HCV structural proteins. However, we found that there was a strong cloning selection for defective genomes and that most clones generated initially were incapable of expressing the HCV proteins. The HCV structural region (C-E1-E2-p7) was directly amplified by long reverse transcription–PCR from the plasma of an HCV-infected patient or from a control plasmid containing a viable full-length cDNA of HCV derived from the same patient but cloned in a different vector. The PCR products were cloned into a mammalian expression vector, amplified in Escherichia coli, and tested for their ability to produce HCV structural proteins. Twenty randomly picked clones derived from the HCV-infected patient all contained nucleotide mutations leading to absence or truncation of the expected HCV products. Of 25 clones derived from the control plasmid, only 8% were fully functional for polyprotein synthesis. The insertion of extra nucleotides in the region just upstream of the start codon of the HCV insert led to a statistically significant increase in the number of fully functional clones derived from the patient (42%) and from the control plasmid (72–92%). Nonrandom selection of clones during the cloning procedure has enormous implications for the study of viral heterogeneity, because it can produce a false spectrum of genomic diversity. It can also be an impediment to the construction of infectious viral clones.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

A plastid-derived signal plays an important role in the coordinated expression of both nuclear- and chloroplast-localized genes that encode photosynthesis-related proteins. Arabidopsis GUN (genomes uncoupled) loci have been identified as components of plastid-to-nucleus signal transduction. Unlike wild-type plants, gun mutants have nuclear Lhcb1 expression in the absence of chloroplast development. We observed a synergistic phenotype in some gun double-mutant combinations, suggesting there are at least two independent pathways in plastid-to-nucleus signal transduction. There is a reduction of chlorophyll accumulation in gun4 and gun5 mutant plants, and a gun4gun5 double mutant shows an albino phenotype. We cloned the GUN5 gene, which encodes the ChlH subunit of Mg-chelatase. We also show that gun2 and gun3 are alleles of the known photomorphogenic mutants, hy1 and hy2, which are required for phytochromobilin synthesis from heme. These findings suggest that certain perturbations of the tetrapyrrole biosynthetic pathway generate a signal from chloroplasts that causes transcriptional repression of nuclear genes encoding plastid-localized proteins. The comparison of mutant phenotypes of gun5 and another Mg-chelatase subunit (ChlI) mutant suggests a specific function for ChlH protein in the plastid-signaling pathway.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Operon structure is an important organization feature of bacterial genomes. Many sets of genes occur in the same order on multiple genomes; these conserved gene groupings represent candidate operons. This study describes a computational method to estimate the likelihood that such conserved gene sets form operons. The method was used to analyze 34 bacterial and archaeal genomes, and yielded more than 7600 pairs of genes that are highly likely (P ≥ 0.98) to belong to the same operon. The sensitivity of our method is 30–50% for the Escherichia coli genome. The predicted gene pairs are available from our World Wide Web site http://www.tigr.org/tigr-scripts/operons/operons.cgi.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Expressed sequence tags (ESTs) are randomly sequenced cDNA clones. Currently, nearly 3 million human and 2 million mouse ESTs provide valuable resources that enable researchers to investigate the products of gene expression. The EST databases have proven to be useful tools for detecting homologous genes, for exon mapping, revealing differential splicing, etc. With the increasing availability of large amounts of poorly characterised eukaryotic (notably human) genomic sequence, ESTs have now become a vital tool for gene identification, sometimes yielding the only unambiguous evidence for the existence of a gene expression product. However, BLAST-based Web servers available to the general user have not kept pace with these developments and do not provide appropriate tools for querying EST databases with large highly spliced genes, often spanning 50 000–100 000 bases or more. Here we describe Gene2EST (http://woody.embl-heidelberg.de/gene2est/), a server that brings together a set of tools enabling efficient retrieval of ESTs matching large DNA queries and their subsequent analysis. RepeatMasker is used to mask dispersed repetitive sequences (such as Alu elements) in the query, BLAST2 for searching EST databases and Artemis for graphical display of the findings. Gene2EST combines these components into a Web resource targeted at the researcher who wishes to study one or a few genes to a high level of detail.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

While genome sequencing projects are advancing rapidly, EST sequencing and analysis remains a primary research tool for the identification and categorization of gene sequences in a wide variety of species and an important resource for annotation of genomic sequence. The TIGR Gene Indices (http://www.tigr.org/tdb/tgi.shtml) are a collection of species-specific databases that use a highly refined protocol to analyze EST sequences in an attempt to identify the genes represented by that data and to provide additional information regarding those genes. Gene Indices are constructed by first clustering, then assembling EST and annotated gene sequences from GenBank for the targeted species. This process produces a set of unique, high-fidelity virtual transcripts, or Tentative Consensus (TC) sequences. The TC sequences can be used to provide putative genes with functional annotation, to link the transcripts to mapping and genomic sequence data, to provide links between orthologous and paralogous genes and as a resource for comparative sequence analysis.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The database of Clusters of Orthologous Groups of proteins (COGs), which represents an attempt on a phylogenetic classification of the proteins encoded in complete genomes, currently consists of 2791 COGs including 45 350 proteins from 30 genomes of bacteria, archaea and the yeast Saccharomyces cerevisiae (http://www.ncbi.nlm.nih.gov/COG). In addition, a supplement to the COGs is available, in which proteins encoded in the genomes of two multicellular eukaryotes, the nematode Caenorhabditis elegans and the fruit fly Drosophila melanogaster, and shared with bacteria and/or archaea were included. The new features added to the COG database include information pages with structural and functional details on each COG and literature references, improvements of the COGNITOR program that is used to fit new proteins into the COGs, and classification of genomes and COGs constructed by using principal component analysis.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

GOLD is a comprehensive resource for accessing information related to completed and ongoing genome projects world-wide. The database currently provides information on 350 genome projects, of which 48 have been completely sequenced and their analysis published. GOLD was created in 1997 and since April 2000 it has been licensed to Integrated Genomics. The database is freely available through the URL: http://igweb.integratedgenomics.com/GOLD/.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Toward the goal of identifying complete sets of transcription factor (TF)-binding sites in the genomes of several gamma proteobacteria, and hence describing their transcription regulatory networks, we present a phylogenetic footprinting method for identifying these sites. Probable transcription regulatory sites upstream of Escherichia coli genes were identified by cross-species comparison using an extended Gibbs sampling algorithm. Close examination of a study set of 184 genes with documented transcription regulatory sites revealed that when orthologous data were available from at least two other gamma proteobacterial species, 81% of our predictions corresponded with the documented sites, and 67% corresponded when data from only one other species were available. That the remaining predictions included bona fide TF-binding sites was proven by affinity purification of a putative transcription factor (YijC) bound to such a site upstream of the fabA gene. Predicted regulatory sites for 2097 E.coli genes are available at http://www.wadsworth.org/resnres/bioinfo/.