926 resultados para Genomic data integration
Resumo:
Nineteen benign [World Health Organization (WHO) grade I; MI], 21 atypical (WHO grade II; MII), and 19 anaplastic (WHO grade III; MIII) sporadic meningiomas were screened for chromosomal imbalances by comparative genomic hybridization (CGH). These data were supplemented by molecular genetic analyses of selected chromosomal regions and genes. With increasing malignancy grade, a marked accumulation of genomic aberrations was observed; i.e., the numbers (mean ± SEM) of total alterations detected per tumor were 2.9 ± 0.7 for MI, 9.2 ± 1.2 for MII, and 13.3 ± 1.9 for MIII. The most frequent alteration detected in MI was loss on 22q (58%). In MII, aberrations most commonly identified were losses on 1p (76%), 22q (71%), 14q (43%), 18q (43%), 10 (38%), and 6q (33%), as well as gains on 20q (48%), 12q (43%), 15q (43%), 1q (33%), 9q (33%), and 17q (33%). In MIII, most of these alterations were found at similar frequencies. However, an increase in losses on 6q (53%), 10 (68%), and 14q (63%) was observed. In addition, 32% of MIII demonstrated loss on 9p. Homozygous deletions in the CDKN2A gene at 9p21 were found in 4 of 16 MIII (25%). Highly amplified DNA sequences were mapped to 12q13–q15 by CGH in 1 MII. Southern blot analysis of this tumor revealed amplification of CDK4 and MDM2. By CGH, DNA sequences from 17q were found to be amplified in 1 MII and 8 MIII, involving 17q23 in all cases. Despite the high frequency of chromosomal aberrations in the MII and MIII investigated, none of these tumors showed mutations in exons 5–8 of the TP53 gene. On the basis of the most common aberrations identified in the various malignancy grades, a model for the genomic alterations associated with meningioma progression is proposed.
Resumo:
The structures of the genes encoding the α1 and β1 subunits of murine soluble guanylyl cyclase (sGC) were determined. Full-length cDNAs isolated from mouse lungs encoding the α1 (2.5 kb) and β1 (3.3 kb) subunits are presented in this report. The α1 sGC gene is approximately 26.4 kb and contains nine exons, whereas the β1 sGC gene spans 22 kb and consists of 14 exons. The positions of exon/intron boundaries and the sizes of introns for both genes are described. Comparison of mouse genomic organization with the Human Genome Database predicted the exon/intron boundaries of the human genes and revealed that human and mouse α1 and β1 sGC genes have similar structures. Both mouse genes are localized on the third chromosome, band 3E3-F1, and are separated by a fragment that is 2% of the chromosomal length. The 5′ untranscribed regions of α1 and β1 subunit genes were subcloned into luciferase reporter constructs, and the functional analysis of promoter activity was performed in murine neuroblastoma N1E-115 cells. Our results indicate that the 5′ untranscribed regions for both genes possess independent promoter activities and, together with the data on chromosomal localization, suggest independent regulation of both genes.
Resumo:
Plant phylogenetic estimates are most likely to be reliable when congruent evidence is obtained independently from the mitochondrial, plastid, and nuclear genomes with all methods of analysis. Here, results are presented from separate and combined genomic analyses of new and previously published data, including six and nine genes (8,911 bp and 12,010 bp, respectively) for different subsets of taxa that suggest Amborella + Nymphaeales (water lilies) are the first-branching angiosperm lineage. Before and after tree-independent noise reduction, most individual genomic compartments and methods of analysis estimated the Amborella + Nymphaeales basal topology with high support. Previous phylogenetic estimates placing Amborella alone as the first extant angiosperm branch may have been misled because of a series of specific problems with paralogy, suboptimal outgroups, long-branch taxa, and method dependence. Ancestral character state reconstructions differ between the two topologies and affect inferences about the features of early angiosperms.
Resumo:
The Mouse Genome Database (MGD) is the community database resource for the laboratory mouse, a key model organism for interpreting the human genome and for understanding human biology and disease (http://www.informatics.jax.org). MGD provides standard nomenclature and consensus map positions for mouse genes and genetic markers; it provides a curated set of mammalian homology records, user-defined chromosomal maps, experimental data sets and the definitive mouse ‘gene to sequence’ reference set for the research community. The integration and standardization of these data sets facilitates the transition between mouse DNA sequence, gene and phenotype annotations. A recent focus on allele and phenotype representations enhances the ability of MGD to organize and present data for exploring the relationship between genotype and phenotype. This link between the genome and the biology of the mouse is especially important as phenotype information grows from large mutagenesis projects and genotype information grows from large-scale sequencing projects.
Resumo:
Upon the completion of the Saccharomyces cerevisiae genomic sequence in 1996 [Goffeau,A. et al. (1997) Nature, 387, 5], several creative and ambitious projects have been initiated to explore the functions of gene products or gene expression on a genome-wide scale. To help researchers take advantage of these projects, the Saccharomyces Genome Database (SGD) has created two new tools, Function Junction and Expression Connection. Together, the tools form a central resource for querying multiple large-scale analysis projects for data about individual genes. Function Junction provides information from diverse projects that shed light on the role a gene product plays in the cell, while Expression Connection delivers information produced by the ever-increasing number of microarray projects. WWW access to SGD is available at genome-www.stanford.edu/Saccharomyces/.
The Zebrafish Information Network (ZFIN): a resource for genetic, genomic and developmental research
Resumo:
The Zebrafish Information Network, ZFIN, is a WWW community resource of zebrafish genetic, genomic and developmental research information (http://zfin.org). ZFIN provides an anatomical atlas and dictionary, developmental staging criteria, research methods, pathology information and a link to the ZFIN relational database (http://zfin.org/ZFIN/). The database, built on a relational, object-oriented model, provides integrated information about mutants, genes, genetic markers, mapping panels, publications and contact information for the zebrafish research community. The database is populated with curated published data, user submitted data and large dataset uploads. A broad range of data types including text, images, graphical representations and genetic maps supports the data. ZFIN incorporates links to other genomic resources that provide sequence and ortholog data. Zebrafish nomenclature guidelines and an automated registration mechanism for new names are provided. Extensive usability testing has resulted in an easy to learn and use forms interface with complex searching capabilities.
Resumo:
The amount of genomic and proteomic data that is entered each day into databases and the experimental literature is outstripping the ability of experimental scientists to keep pace. While generic databases derived from automated curation efforts are useful, most biological scientists tend to focus on a class or family of molecules and their biological impact. Consequently, there is a need for molecular class-specific or other specialized databases. Such databases collect and organize data around a single topic or class of molecules. If curated well, such systems are extremely useful as they allow experimental scientists to obtain a large portion of the available data most relevant to their needs from a single source. We are involved in the development of two such databases with substantial pharmacological relevance. These are the GPCRDB and NucleaRDB information systems, which collect and disseminate data related to G protein-coupled receptors and intra-nuclear hormone receptors, respectively. The GPCRDB was a pilot project aimed at building a generic molecular class-specific database capable of dealing with highly heterogeneous data. A first version of the GPCRDB project has been completed and it is routinely used by thousands of scientists. The NucleaRDB was started recently as an application of the concept for the generalization of this technology. The GPCRDB is available via the WWW at http://www.gpcr.org/7tm/ and the NucleaRDB at http://www.receptors.org/NR/.
Resumo:
The Plasmodium falciparum Genome Database (http://PlasmoDB.org) integrates sequence information, automated analyses and annotation data emerging from the P.falciparum genome sequencing consortium. To date, raw sequence coverage is available for >90% of the genome, and two chromosomes have been finished and annotated. Data in PlasmoDB are organized by chromosome (1–14), and can be accessed using a variety of tools for graphical and text-based browsing or downloaded in various file formats. The GUS (Genomics Unified Schema) implementation of PlasmoDB provides a multi-species genomic relational database, incorporating data from human and mouse, as well as P.falciparum. The relational schema uses a highly structured format to accommodate diverse data sets related to genomic sequence and gene expression. Tools have been designed to facilitate complex biological queries, including many that are specific to Plasmodium parasites and malaria as a disease. Additional projects seek to integrate genomic information with the rich data sets now becoming available for RNA transcription, protein expression, metabolic pathways, genetic and physical mapping, antigenic and population diversity, and phylogenetic relationships with other apicomplexan parasites. The overall goal of PlasmoDB is to facilitate Internet- and CD-ROM-based access to both finished and unfinished sequence information by the global malaria research community.
Resumo:
The Homeodomain Resource is an annotated collection of non-redundant protein sequences, three-dimensional structures and genomic information for the homeodomain protein family. Release 3.0 contains 795 full-length homeodomain-containing sequences, 32 experimentally-derived structures and 143 homeobox loci implicated in human genetic disorders. Entries are fully hyperlinked to facilitate easy retrieval of the original records from source databases. A simple search engine with a graphical user interface is provided to query the component databases and assemble customized data sets. A new feature for this release is the addition of DNA recognition sites for all human homeodomain proteins described in the literature. The Homeodomain Resource is freely available through the World Wide Web at http://genome.nhgri.nih.gov/homeodomain.
Resumo:
Gene expression profiling provides powerful analyses of transcriptional responses to cellular perturbation. In contrast to DNA array-based methods, reporter gene technology has been underused for this application. Here we describe a genomewide, genome-registered collection of Escherichia coli bioluminescent reporter gene fusions. DNA sequences from plasmid-borne, random fusions of E. coli chromosomal DNA to a Photorhabdus luminescens luxCDABE reporter allowed precise mapping of each fusion. The utility of this collection covering about 30% of the transcriptional units was tested by analyzing individual fusions representative of heat shock, SOS, OxyR, SoxRS, and cya/crp stress-responsive regulons. Each fusion strain responded as anticipated to environmental conditions known to activate the corresponding regulatory circuit. Thus, the collection mirrors E. coli's transcriptional wiring diagram. This genomewide collection of gene fusions provides an independent test of results from other gene expression analyses. Accordingly, a DNA microarray-based analysis of mitomycin C-treated E. coli indicated elevated expression of expected and unanticipated genes. Selected luxCDABE fusions corresponding to these up-regulated genes were used to confirm or contradict the DNA microarray results. The power of partnering gene fusion and DNA microarray technology to discover promoters and define operons was demonstrated when data from both suggested that a cluster of 20 genes encoding production of type I extracellular polysaccharide in E. coli form a single operon.
Resumo:
Molecular and morphological data have important roles in illuminating evolutionary history. DNA data often yield well resolved phylogenies for living taxa, but are generally unattainable for fossils. A distinct advantage of morphology is that some types of morphological data may be collected for extinct and extant taxa. Fossils provide a unique window on evolutionary history and may preserve combinations of primitive and derived characters that are not found in extant taxa. Given their unique character complexes, fossils are critical in documenting sequences of character transformation over geologic time and may elucidate otherwise ambiguous patterns of evolution that are not revealed by molecular data alone. Here, we employ a methodological approach that allows for the integration of molecular and paleontological data in deciphering one of the most innovative features in the evolutionary history of mammals—laryngeal echolocation in bats. Molecular data alone, including an expanded data set that includes new sequences for the A2AB gene, suggest that microbats are paraphyletic but do not resolve whether laryngeal echolocation evolved independently in different microbat lineages or evolved in the common ancestor of bats and was subsequently lost in megabats. When scaffolds from molecular phylogenies are incorporated into parsimony analyses of morphological characters, including morphological characters for the Eocene taxa Icaronycteris, Archaeonycteris, Hassianycteris, and Palaeochiropteryx, the resulting trees suggest that laryngeal echolocation evolved in the common ancestor of fossil and extant bats and was subsequently lost in megabats. Molecular dating suggests that crown-group bats last shared a common ancestor 52 to 54 million years ago.
Resumo:
Microsatellites are tandem repeat sequences abundant in the genomes of higher eukaryotes and hitherto considered as "junk DNA." Analysis of a human genome representative data base (2.84 Mb) reveals a distinct juxtaposition of A-rich microsatellites and retroposons and suggests their coevolution. The analysis implies that most microsatellites were generated by a 3'-extension of retrotranscripts, similar to mRNA polyadenylylation, and that they serve in turn as "retroposition navigators," directing the retroposons via homology-driven integration into defined sites. Thus, they became instrumental in the preservation and extension of primordial genomic patterns. A role is assigned to these reiterating A-rich loci in the higher-order organization of the chromatin. The disease-associated triplet repeats are mostly found in coding regions and do not show an association with retroposons, constituting a unique set within the family of microsatellite sequences.
Resumo:
Genomic mapping has been used to identify a region of the host genome that determines resistance to fusiform rust disease in loblolly pine where no discrete, simply inherited resistance factors had been previously found by conventional genetic analysis over four decades. A resistance locus, behaving as a single dominant gene, was mapped by association with genetic markers, even though the disease phenotype deviated from the expected Mendelian ratio. The complexity of forest pathosystems and the limitations of genetic analysis, based solely on phenotype, had led to an assumption that effective long-term disease resistance in trees should be polygenic. However, our data show that effective long-term resistance can be obtained from a single qualitative resistance gene, despite the presence of virulence in the pathogen population. Therefore, disease resistance in this endemic coevolved forest pathosystem is not exclusively polygenic. Genomic mapping now provides a powerful tool for characterizing the genetic basis of host pathogen interactions in forest trees and other undomesticated, organisms, where conventional genetic analysis often is limited or not feasible.
Resumo:
Arabidopsis thaliana mutants originally isolated as hypersensitive to irradiation were screened for the ability to be transformed by Agrobacterium transferred DNA (T-DNA). One of four UV-hypersensitive mutants and one of two gamma-hypersensitive mutants tested showed a significant reduction in the frequency of stable transformants compared with radioresistant controls. In a transient assay for T-DNA transfer independent of genomic integration, both mutant lines took up and expressed T-DNA as efficiently as parental lines. These lines are therefore deficient specifically in stable T-DNA integration and thus provide direct evidence for the role of a plant function in that process. As radiation hypersensitivity suggests a deficiency in repair of DNA damage, that plant function may be one that is also involved in DNA repair, possibly, from other evidence, in repair of double-strand DNA breaks.
Resumo:
The mechanisms involved in the integration of proteins into the thylakoid membrane are largely unknown. However, many of the steps of this process for the light-harvesting chlorophyll a/b protein (LHCP) have been described and reconstituted in vitro. LHCP is synthesized as a precursor in the cytosol and posttranslationally imported into chloroplasts. Upon translocation across the envelope membranes, the N-terminal transit peptide is cleaved, and the apoprotein is assembled into a soluble "transit complex" and then integrated into the thylakoid membrane via three transmembrane helices. Here we show that 54CP, a chloroplast homologue of the 54-kDa subunit of the mammalian signal recognition particle (SRP54), is essential for transit complex formation, is present in the complex, and is required for LHCP integration into the thylakoid membrane. Our data indicate that 54CP functions posttranslationally as a molecular chaperone and potentially pilots LHCP to the thylakoids. These results demonstrate that one of several pathways for protein routing to the thylakoids is homologous to the SRP pathway and point to a common evolutionary origin for the protein transport systems of the endoplasmic reticulum and the thylakoid membrane.