927 resultados para genome rearrangement
Resumo:
The ARKdb genome databases provide comprehensive public repositories for genome mapping data from farmed species and other animals (http://www.thearkdb.org) providing a resource similar in function to that offered by GDB or MGD for human or mouse genome mapping data, respectively. Because we have attempted to build a generic mapping database, the system has wide utility, particularly for those species for which development of a specific resource would be prohibitive. The ARKdb genome database model has been implemented for 10 species to date. These are pig, chicken, sheep, cattle, horse, deer, tilapia, cat, turkey and salmon. Access to the ARKdb databases is effected via the World Wide Web using the ARKdb browser and Anubis map viewer. The information stored includes details of loci, maps, experimental methods and the source references. Links to other information sources such as PubMed and EMBL/GenBank are provided. Responsibility for data entry and curation is shared amongst scientists active in genome research in the species of interest. Mirror sites in the United States are maintained in addition to the central genome server at Roslin.
Resumo:
GOBASE (http://megasun.bch.umontreal.ca/gobase/) is a network-accessible biological database, which is unique in bringing together diverse biological data on organelles with taxonomically broad coverage, and in furnishing data that have been exhaustively verified and completed by experts. So far, we have focused on mitochondrial data: GOBASE contains all published nucleotide and protein sequences encoded by mitochondrial genomes, selected RNA secondary structures of mitochondria-encoded molecules, genetic maps of completely sequenced genomes, taxonomic information for all species whose sequences are present in the database and organismal descriptions of key protistan eukaryotes. All of these data have been integrated and organized in a formal database structure to allow sophisticated biological queries using terms that are inherent in biological concepts. Most importantly, data have been validated, completed, corrected and standardized, a prerequisite of meaningful analysis. In addition, where critical data are lacking, such as genetic maps and RNA secondary structures, they are generated by the GOBASE team and collaborators, and added to the database. The database is implemented in a relational database management system, but features an object-oriented view of the biological data through a Web/Genera-generated World Wide Web interface. Finally, we have developed software for database curation (i.e. data updates, validation and correction), which will be described in some detail in this paper.
Resumo:
VIDA is a new virus database that organizes open reading frames (ORFs) from partial and complete genomic sequences from animal viruses. Currently VIDA includes all sequences from GenBank for Herpesviridae, Coronaviridae and Arteriviridae. The ORFs are organized into homologous protein families, which are identified on the basis of sequence similarity relationships. Conserved sequence regions of potential functional importance are identified and can be retrieved as sequence alignments. We use a controlled taxonomical and functional classification for all the proteins and protein families in the database. When available, protein structures that are related to the families have also been included. The database is available for online search and sequence information retrieval at http://www.biochem.ucl.ac.uk/bsm/virus_database/VIDA.html.
Resumo:
The Medicago Genome Initiative (MGI) is a database of EST sequences of the model legume Medicago truncatula. The database is available to the public and has resulted from a collaborative research effort between the Samuel Roberts Noble Foundation and the National Center for Genome Resources to investigate the genome of M.truncatula. MGI is part of the greater integrated Medicago functional genomics program at the Noble Foundation (http://www.noble .org), which is taking a global approach in studying the genetic and biochemical events associated with the growth, development and environmental interactions of this model legume. Our approach will include: large-scale EST sequencing, gene expression profiling, the generation of M.truncatula activation-tagged and promoter trap insertion mutants, high-throughput metabolic profiling, and proteome studies. These multidisciplinary information pools will be interfaced with one another to provide scientists with an integrated, holistic set of tools to address fundamental questions pertaining to legume biology. The public interface to the MGI database can be accessed at http://www.ncgr.org/research/mgi.
Resumo:
The Plasmodium falciparum Genome Database (http://PlasmoDB.org) integrates sequence information, automated analyses and annotation data emerging from the P.falciparum genome sequencing consortium. To date, raw sequence coverage is available for >90% of the genome, and two chromosomes have been finished and annotated. Data in PlasmoDB are organized by chromosome (1–14), and can be accessed using a variety of tools for graphical and text-based browsing or downloaded in various file formats. The GUS (Genomics Unified Schema) implementation of PlasmoDB provides a multi-species genomic relational database, incorporating data from human and mouse, as well as P.falciparum. The relational schema uses a highly structured format to accommodate diverse data sets related to genomic sequence and gene expression. Tools have been designed to facilitate complex biological queries, including many that are specific to Plasmodium parasites and malaria as a disease. Additional projects seek to integrate genomic information with the rich data sets now becoming available for RNA transcription, protein expression, metabolic pathways, genetic and physical mapping, antigenic and population diversity, and phylogenetic relationships with other apicomplexan parasites. The overall goal of PlasmoDB is to facilitate Internet- and CD-ROM-based access to both finished and unfinished sequence information by the global malaria research community.
Resumo:
GOLD is a comprehensive resource for accessing information related to completed and ongoing genome projects world-wide. The database currently provides information on 350 genome projects, of which 48 have been completely sequenced and their analysis published. GOLD was created in 1997 and since April 2000 it has been licensed to Integrated Genomics. The database is freely available through the URL: http://igweb.integratedgenomics.com/GOLD/.
Resumo:
The abundant chromosome abnormalities in most carcinomas are probably a reflection of genomic instability present in the tumor, so the pattern and variability of chromosome abnormalities will reflect the mechanism of instability combined with the effects of selection. Chromosome rearrangement was investigated in 17 colorectal carcinoma-derived cell lines. Comparative genomic hybridization showed that the chromosome changes were representative of those found in primary tumors. Spectral karyotyping (SKY) showed that translocations were very varied and mostly unbalanced, with no translocation occurring in more than three lines. At least three karyotype patterns could be distinguished. Some lines had few chromosome abnormalities: they all showed microsatellite instability, the replication error (RER)+ phenotype. Most lines had many chromosome abnormalities: at least seven showed a surprisingly consistent pattern, characterized by multiple unbalanced translocations and intermetaphase variation, with chromosome numbers around triploid, 6–16 structural aberrations, and similarities in gains and losses. Almost all of these were RER−, but one, LS411, was RER+. The line HCA7 showed a novel pattern, suggesting a third kind of genomic instability: multiple reciprocal translocations, with little numerical change or variability. This line was also RER+. The coexistence in one tumor of two kinds of genomic instability is to be expected if the underlying defects are selected for in tumor evolution.
Resumo:
Viruses with RNA genomes often capture and redirect host cell components to assist in mechanisms particular to RNA-dependent RNA synthesis. The nidoviruses are an order of positive-stranded RNA viruses, comprising coronaviruses and arteriviruses, that employ a unique strategy of discontinuous transcription, producing a series of subgenomic mRNAs linking a 5′ leader to distal portions of the genome. For the prototype coronavirus mouse hepatitis virus (MHV), heterogeneous nuclear ribonucleoprotein (hnRNP) A1 has been shown to be able to bind in vitro to the negative strand of the intergenic sequence, a cis-acting element found in the leader RNA and preceding each downstream ORF in the genome. hnRNP A1 thus has been proposed as a host factor in MHV transcription. To test this hypothesis genetically, we initially constructed MHV mutants with a very high-affinity hnRNP A1 binding site inserted in place of, or adjacent to, an intergenic sequence in the MHV genome. This inserted hnRNP A1 binding site was not able to functionally replace, or enhance transcription from, the intergenic sequence. This finding led us to test more directly the role of hnRNP A1 by analysis of MHV replication and RNA synthesis in a murine cell line that does not express this protein. The cellular absence of hnRNP A1 had no detectable effect on the production of infectious virus, the synthesis of genomic RNA, or the quantity or quality of subgenomic mRNAs. These results strongly suggest that hnRNP A1 is not a required host factor for MHV discontinuous transcription or genome replication.
Resumo:
Pseudogenes are non-functioning copies of genes in genomic DNA, which may either result from reverse transcription from an mRNA transcript (processed pseudogenes) or from gene duplication and subsequent disablement (non-processed pseudogenes). As pseudogenes are apparently ‘dead’, they usually have a variety of obvious disablements (e.g., insertions, deletions, frameshifts and truncations) relative to their functioning homologs. We have derived an initial estimate of the size, distribution and characteristics of the pseudogene population in the Caenorhabditis elegans genome, performing a survey in ‘molecular archaeology’. Corresponding to the 18 576 annotated proteins in the worm (i.e., in Wormpep18), we have found an estimated total of 2168 pseudogenes, about one for every eight genes. Few of these appear to be processed. Details of our pseudogene assignments are available from http://bioinfo.mbb.yale.edu/genome/worm/pseudogene. The population of pseudogenes differs significantly from that of genes in a number of respects: (i) pseudogenes are distributed unevenly across the genome relative to genes, with a disproportionate number on chromosome IV; (ii) the density of pseudogenes is higher on the arms of the chromosomes; (iii) the amino acid composition of pseudogenes is midway between that of genes and (translations of) random intergenic DNA, with enrichment of Phe, Ile, Leu and Lys, and depletion of Asp, Ala, Glu and Gly relative to the worm proteome; and (iv) the most common protein folds and families differ somewhat between genes and pseudogenes—whereas the most common fold found in the worm proteome is the immunoglobulin fold and the most common ‘pseudofold’ is the C-type lectin. In addition, the size of a gene family bears little overall relationship to the size of its corresponding pseudogene complement, indicating a highly dynamic genome. There are in fact a number of families associated with large populations of pseudogenes. For example, one family of seven-transmembrane receptors (represented by gene B0334.7) has one pseudogene for every four genes, and another uncharacterized family (represented by gene B0403.1) is approximately two-thirds pseudogenic. Furthermore, over a hundred apparent pseudogenic fragments do not have any obvious homologs in the worm.
Resumo:
Mitochondrial dysfunction can lead to diverse cellular and organismal responses. We used DNA microarrays to characterize the transcriptional responses to different mitochondrial perturbations in Saccharomyces cerevisiae. We examined respiratory-deficient petite cells and respiratory-competent wild-type cells treated with the inhibitors of oxidative phosphorylation antimycin, carbonyl cyanide m-chlorophenylhydrazone, or oligomycin. We show that respiratory deficiency, but not inhibition of mitochondrial ATP synthesis per se, induces a suite of genes associated with both peroxisomal activities and metabolite-restoration (anaplerotic) pathways that would mitigate the loss of a complete tricarboxylic acid cycle. The array data suggested, and direct microscopic observation of cells expressing a derivative of green fluorescent protein with a peroxisomal matrix-targeting signal confirmed, that respiratory deficiency dramatically induces peroxisome biogenesis. Transcript profiling of cells harboring null alleles of RTG1, RTG2, or RTG3, genes known to control signaling from mitochondria to the nucleus, suggests that there are multiple pathways of cross-talk between these organelles in yeast.
Resumo:
We used genome-wide expression analysis to explore how gene expression in Saccharomyces cerevisiae is remodeled in response to various changes in extracellular environment, including changes in temperature, oxidation, nutrients, pH, and osmolarity. The results demonstrate that more than half of the genome is involved in various responses to environmental change and identify the global set of genes induced and repressed by each condition. These data implicate a substantial number of previously uncharacterized genes in these responses and reveal a signature common to environmental responses that involves ∼10% of yeast genes. The results of expression analysis with MSN2/MSN4 mutants support the model that the Msn2/Msn4 activators induce the common response to environmental change. These results provide a global description of the transcriptional response to environmental change and extend our understanding of the role of activators in effecting this response.
Resumo:
We report here that the DNA-dependent protein kinase (DNA-PK) affects the molecular fate of the recombinant adeno-associated virus (rAAV) genome in skeletal muscle. rAAV-human α1-antitrypsin (rAAV-hAAT) vectors were delivered by intramuscular injection to either C57BL/6 (DNA-PKcs+) or C57BL/6-SCID [severe combined immunodeficient (SCID), DNA-PKcs−] mice. In both strains, high levels of transgene expression were sustained for up to 1 year after a single injection. Southern blot analysis showed that rAAV genomes persisted as linear episomes for more than 1 year in SCID mice, whereas only circular episomal forms were observed in the C57BL/6 strain. These results indicate that DNA-PK is involved in the formation of circular rAAV episomes.
Resumo:
Reovirus genome segment S1 encodes protein σ1, which is the receptor binding protein, modulates tissue tropism, and specifies the nature of the antiviral immune response. It makes up less than 2% of reovirus particles and is synthesized in very small amounts in infected cells. Any antiviral strategy aimed at reducing specifically the expression of this genome segment should, in principle, reduce the infectivity of the virus. To test this hypothesis, we have assembled two hammer-head motif-containing ribozymes (Rzs) targeted to cleave at the conserved B and C domains of the reovirus s1 RNA. Protein-independent but Mg2+-dependent sequence-specific cleavage of s1 RNA was achieved by both the Rzs in trans. Cells that transiently express these Rzs, when challenged with reovirus, were protected against the cytopathic effects caused by the virus. This protection correlated with the specific intracellular reduction of s1 transcripts that was due to their cleavage by the Rzs. Rz-treated cells that were challenged with reovirus showed almost complete disappearance of protein σ1 without significantly altering the levels of the other reovirus structural proteins. Thus, Rzs, besides acting as antiviral agents, could be exploited as biological tools to delineate specific functions of target genes.
Resumo:
Ehrlichiae are responsible for important tick-transmitted diseases, including anaplasmosis, the most prevalent tick-borne infection of livestock worldwide, and the emerging human diseases monocytic and granulocytic ehrlichiosis. Antigenic variation of major surface proteins is a key feature of these pathogens that allows persistence in the mammalian host, a requisite for subsequent tick transmission. In Anaplasma marginale pseudogenes for two antigenically variable gene families, msp2 and msp3, appear in concert. These pseudogenes can be recombined into the functional expression site to generate new antigenic variants. Coordinated control of the recombination of these genes would allow these two gene families to act synergistically to evade the host immune response.
Resumo:
The complete genome sequence of Caulobacter crescentus was determined to be 4,016,942 base pairs in a single circular chromosome encoding 3,767 genes. This organism, which grows in a dilute aquatic environment, coordinates the cell division cycle and multiple cell differentiation events. With the annotated genome sequence, a full description of the genetic network that controls bacterial differentiation, cell growth, and cell cycle progression is within reach. Two-component signal transduction proteins are known to play a significant role in cell cycle progression. Genome analysis revealed that the C. crescentus genome encodes a significantly higher number of these signaling proteins (105) than any bacterial genome sequenced thus far. Another regulatory mechanism involved in cell cycle progression is DNA methylation. The occurrence of the recognition sequence for an essential DNA methylating enzyme that is required for cell cycle regulation is severely limited and shows a bias to intergenic regions. The genome contains multiple clusters of genes encoding proteins essential for survival in a nutrient poor habitat. Included are those involved in chemotaxis, outer membrane channel function, degradation of aromatic ring compounds, and the breakdown of plant-derived carbon sources, in addition to many extracytoplasmic function sigma factors, providing the organism with the ability to respond to a wide range of environmental fluctuations. C. crescentus is, to our knowledge, the first free-living α-class proteobacterium to be sequenced and will serve as a foundation for exploring the biology of this group of bacteria, which includes the obligate endosymbiont and human pathogen Rickettsia prowazekii, the plant pathogen Agrobacterium tumefaciens, and the bovine and human pathogen Brucella abortus.