Biblioteca Digital

20 resultados para Genomic data integration

em National Center for Biotechnology Information - NCBI

Transgene organization in rice engineered through direct DNA transfer supports a two-phase integration mechanism mediated by the establishment of integration hot spots

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Organization of transgenes in rice transformed through direct DNA transfer strongly suggests a two-phase integration mechanism. In the “preintegration” phase, transforming plasmid molecules (either intact or partial) are spliced together. This gives rise to rearranged transgenic sequences, which upon integration do not contain any interspersed plant genomic sequences. Subsequently, integration of transgenic DNA into the host genome is initiated. Our experiments suggest that the original site of integration acts as a hot spot, facilitating subsequent integration of successive transgenic molecules at the same locus. The resulting transgenic locus may have plant DNA separating the transgenic sequences. Our data indicate that transformation through direct DNA transfer, specifically particle bombardment, generally results in a single transgenic locus as a result of this two-phase integration mechanism. Transgenic plants generated through such processes may, therefore, be more amenable to breeding programs as the single transgenic locus will be easier to characterize genetically. Results from direct DNA transfer experiments suggest that in the absence of protein factors involved in exogenous DNA transfer through Agrobacterium, the qualitative and/or quantitative efficiency of transformation events is not compromised. Our results cast doubt on the role of Agrobacterium vir genes in the integration process.

Genomic evolution during a 10,000-generation experiment with bacteria

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Molecular methods are used widely to measure genetic diversity within populations and determine relationships among species. However, it is difficult to observe genomic evolution in action because these dynamics are too slow in most organisms. To overcome this limitation, we sampled genomes from populations of Escherichia coli evolving in the laboratory for 10,000 generations. We analyzed the genomes for restriction fragment length polymorphisms (RFLP) using seven insertion sequences (IS) as probes; most polymorphisms detected by this approach reflect rearrangements (including transpositions) rather than point mutations. The evolving genomes became increasingly different from their ancestor over time. Moreover, tremendous diversity accumulated within each population, such that almost every individual had a different genetic fingerprint after 10,000 generations. As has been often suggested, but not previously shown by experiment, the rates of phenotypic and genomic change were discordant, both across replicate populations and over time within a population. Certain pivotal mutations were shared by all descendants in a population, and these are candidates for beneficial mutations, which are rare and difficult to find. More generally, these data show that the genome is highly dynamic even over a time scale that is, from an evolutionary perspective, very brief.

Integration of multiple instructive cues by neural crest stem cells reveals cell-intrinsic biases in relative growth factor responsiveness

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Growth factors can influence lineage determination of neural crest stem cells (NCSCs) in an instructive manner, in vitro. Because NCSCs are likely exposed to multiple signals in vivo, these findings raise the question of how stem cells would integrate such combined influences. Bone morphogenetic protein 2 (BMP2) promotes neuronal differentiation and glial growth factor 2 (GGF2) promotes glial differentiation; if NCSCs are exposed to saturating concentrations of both factors, BMP2 appears dominant. By contrast, if the cells are exposed to saturating concentrations of both BMP2 and transforming growth factor β1 (which promotes smooth muscle differentiation), the two factors appear codominant. Sequential addition experiments indicate that NCSCs require 48–96 hrs in GGF2 before they commit to a glial fate, whereas the cells commit to a smooth muscle fate within 24 hr in transforming growth factor β1. The delayed response to GGF2 does not reflect a lack of functional receptors; however, because the growth factor induces rapid mitogen-activated protein kinase phosphorylation in naive cells. Furthermore, GGF2 can attenuate induction of the neurogenic transcription factor mammalian achaete-scute homolog 1, by low doses of BMP2. This short-term antineurogenic influence of GGF2 is not sufficient for glial lineage commitment, however. These data imply that NCSCs exhibit cell-intrinsic biases in the timing and relative dosage sensitivity of their responses to instructive factors that influence the outcome of lineage decisions in the presence of multiple factors. The relative delay in glial lineage commitment, moreover, apparently reflects successive short-term and longer-term actions of GGF2. Such a delay may help to explain why glia normally differentiate after neurons, in vivo.

Genomic analysis of human and mouse TCL1 loci reveals a complex of tightly clustered genes

Relevância:

30.00% 30.00%

Publicador:

Resumo:

TCL1 and TCL1b genes on human chromosome 14q23.1 are activated in T cell leukemias by translocations and inversions at 14q32.1, juxtaposing them to regulatory elements of T cell receptor genes. In this report we present the cloning, mapping, and expression analysis of the human and murine TCL1/Tcl1 locus. In addition to TCL1 and TCL1b, the human locus contains two additional genes, TCL1-neighboring genes (TNG) 1 and 2, encoding proteins of 141 and 110 aa, respectively. Both genes show no homology to any known genes, but their expression profiles are very similar to those of TCL1 and TCL1b. TNG1 and TNG2 also are activated in T cell leukemias with rearrangements at 14q32.1. To aid in the development of a mouse model we also have characterized the murine Tcl1 locus and found five genes homologous to human TCL1b. Tcl1b1–Tcl1b5 proteins range from 117 to 123 aa and are 65–80% similar, but they show only a 30–40% similarity to human TCL1b. All five mouse Tcl1b and murine Tcl1 mRNAs are abundant in mouse oocytes and two-cell embryos but rare in various adult tissues and lymphoid cell lines. These data suggest a similar or complementary function of these proteins in early embryogenesis.

Genomic and cDNA sequence analysis of the cell matrix adhesion regulator gene

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The cell matrix adhesion regulator (CMAR) gene has been suggested to be a signal transduction molecule influencing cell adhesion to collagen and, through this, possibly involved in tumor suppression. The originally reported CMAR cDNA was 464 bp long with a tyrosine phosphorylation site at the extreme 3′ end, which mutagenesis studies had shown to be central to the function of this gene. Since the discovery of a 4-bp insertion polymorphism within the originally reported coding region, further sequence information has been obtained. The cDNA has been extended 5′ by ≈2 kb revealing a 559-bp region showing strong homology to the proposed 5′ untranslated sequence of a murine protein kinase receptor family member, variant in kinase (vik). CMAR genomic sequencing has shown the presence of an intron, the intron/exon boundary lying within this region of homology. An RNA transcript for CMAR of ≈2.5 kb has also been identified. The data suggest complex mechanisms for control of expression of two closely associated genes, CMAR and the vik- associated sequence.

Analysis of genomic alterations in benign, atypical, and anaplastic meningiomas: Toward a genetic model of meningioma progression

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Nineteen benign [World Health Organization (WHO) grade I; MI], 21 atypical (WHO grade II; MII), and 19 anaplastic (WHO grade III; MIII) sporadic meningiomas were screened for chromosomal imbalances by comparative genomic hybridization (CGH). These data were supplemented by molecular genetic analyses of selected chromosomal regions and genes. With increasing malignancy grade, a marked accumulation of genomic aberrations was observed; i.e., the numbers (mean ± SEM) of total alterations detected per tumor were 2.9 ± 0.7 for MI, 9.2 ± 1.2 for MII, and 13.3 ± 1.9 for MIII. The most frequent alteration detected in MI was loss on 22q (58%). In MII, aberrations most commonly identified were losses on 1p (76%), 22q (71%), 14q (43%), 18q (43%), 10 (38%), and 6q (33%), as well as gains on 20q (48%), 12q (43%), 15q (43%), 1q (33%), 9q (33%), and 17q (33%). In MIII, most of these alterations were found at similar frequencies. However, an increase in losses on 6q (53%), 10 (68%), and 14q (63%) was observed. In addition, 32% of MIII demonstrated loss on 9p. Homozygous deletions in the CDKN2A gene at 9p21 were found in 4 of 16 MIII (25%). Highly amplified DNA sequences were mapped to 12q13–q15 by CGH in 1 MII. Southern blot analysis of this tumor revealed amplification of CDK4 and MDM2. By CGH, DNA sequences from 17q were found to be amplified in 1 MII and 8 MIII, involving 17q23 in all cases. Despite the high frequency of chromosomal aberrations in the MII and MIII investigated, none of these tumors showed mutations in exons 5–8 of the TP53 gene. On the basis of the most common aberrations identified in the various malignancy grades, a model for the genomic alterations associated with meningioma progression is proposed.

Genomic organization of α1 and β1 subunits of the mammalian soluble guanylyl cyclase genes

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The structures of the genes encoding the α1 and β1 subunits of murine soluble guanylyl cyclase (sGC) were determined. Full-length cDNAs isolated from mouse lungs encoding the α1 (2.5 kb) and β1 (3.3 kb) subunits are presented in this report. The α1 sGC gene is approximately 26.4 kb and contains nine exons, whereas the β1 sGC gene spans 22 kb and consists of 14 exons. The positions of exon/intron boundaries and the sizes of introns for both genes are described. Comparison of mouse genomic organization with the Human Genome Database predicted the exon/intron boundaries of the human genes and revealed that human and mouse α1 and β1 sGC genes have similar structures. Both mouse genes are localized on the third chromosome, band 3E3-F1, and are separated by a fragment that is 2% of the chromosomal length. The 5′ untranscribed regions of α1 and β1 subunit genes were subcloned into luciferase reporter constructs, and the functional analysis of promoter activity was performed in murine neuroblastoma N1E-115 cells. Our results indicate that the 5′ untranscribed regions for both genes possess independent promoter activities and, together with the data on chromosomal localization, suggest independent regulation of both genes.

Independent and combined analyses of sequences from all three genomic compartments converge on the root of flowering plant phylogeny

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Plant phylogenetic estimates are most likely to be reliable when congruent evidence is obtained independently from the mitochondrial, plastid, and nuclear genomes with all methods of analysis. Here, results are presented from separate and combined genomic analyses of new and previously published data, including six and nine genes (8,911 bp and 12,010 bp, respectively) for different subsets of taxa that suggest Amborella + Nymphaeales (water lilies) are the first-branching angiosperm lineage. Before and after tree-independent noise reduction, most individual genomic compartments and methods of analysis estimated the Amborella + Nymphaeales basal topology with high support. Previous phylogenetic estimates placing Amborella alone as the first extant angiosperm branch may have been misled because of a series of specific problems with paralogy, suboptimal outgroups, long-branch taxa, and method dependence. Ancestral character state reconstructions differ between the two topologies and affect inferences about the features of early angiosperms.

The Mouse Genome Database (MGD): integration nexus for the laboratory mouse

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The Mouse Genome Database (MGD) is the community database resource for the laboratory mouse, a key model organism for interpreting the human genome and for understanding human biology and disease (http://www.informatics.jax.org). MGD provides standard nomenclature and consensus map positions for mouse genes and genetic markers; it provides a curated set of mammalian homology records, user-defined chromosomal maps, experimental data sets and the definitive mouse ‘gene to sequence’ reference set for the research community. The integration and standardization of these data sets facilitates the transition between mouse DNA sequence, gene and phenotype annotations. A recent focus on allele and phenotype representations enhances the ability of MGD to organize and present data for exploring the relationship between genotype and phenotype. This link between the genome and the biology of the mouse is especially important as phenotype information grows from large mutagenesis projects and genotype information grows from large-scale sequencing projects.

Saccharomyces Genome Database provides tools to survey gene expression and functional analysis data

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Upon the completion of the Saccharomyces cerevisiae genomic sequence in 1996 [Goffeau,A. et al. (1997) Nature, 387, 5], several creative and ambitious projects have been initiated to explore the functions of gene products or gene expression on a genome-wide scale. To help researchers take advantage of these projects, the Saccharomyces Genome Database (SGD) has created two new tools, Function Junction and Expression Connection. Together, the tools form a central resource for querying multiple large-scale analysis projects for data about individual genes. Function Junction provides information from diverse projects that shed light on the role a gene product plays in the cell, while Expression Connection delivers information produced by the ever-increasing number of microarray projects. WWW access to SGD is available at genome-www.stanford.edu/Saccharomyces/.

The Zebrafish Information Network (ZFIN): a resource for genetic, genomic and developmental research

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The Zebrafish Information Network, ZFIN, is a WWW community resource of zebrafish genetic, genomic and developmental research information (http://zfin.org). ZFIN provides an anatomical atlas and dictionary, developmental staging criteria, research methods, pathology information and a link to the ZFIN relational database (http://zfin.org/ZFIN/). The database, built on a relational, object-oriented model, provides integrated information about mutants, genes, genetic markers, mapping panels, publications and contact information for the zebrafish research community. The database is populated with curated published data, user submitted data and large dataset uploads. A broad range of data types including text, images, graphical representations and genetic maps supports the data. ZFIN incorporates links to other genomic resources that provide sequence and ortholog data. Zebrafish nomenclature guidelines and an automated registration mechanism for new names are provided. Extensive usability testing has resulted in an easy to learn and use forms interface with complex searching capabilities.

Collecting and harvesting biological data: the GPCRDB and NucleaRDB information systems

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The amount of genomic and proteomic data that is entered each day into databases and the experimental literature is outstripping the ability of experimental scientists to keep pace. While generic databases derived from automated curation efforts are useful, most biological scientists tend to focus on a class or family of molecules and their biological impact. Consequently, there is a need for molecular class-specific or other specialized databases. Such databases collect and organize data around a single topic or class of molecules. If curated well, such systems are extremely useful as they allow experimental scientists to obtain a large portion of the available data most relevant to their needs from a single source. We are involved in the development of two such databases with substantial pharmacological relevance. These are the GPCRDB and NucleaRDB information systems, which collect and disseminate data related to G protein-coupled receptors and intra-nuclear hormone receptors, respectively. The GPCRDB was a pilot project aimed at building a generic molecular class-specific database capable of dealing with highly heterogeneous data. A first version of the GPCRDB project has been completed and it is routinely used by thousands of scientists. The NucleaRDB was started recently as an application of the concept for the generalization of this technology. The GPCRDB is available via the WWW at http://www.gpcr.org/7tm/ and the NucleaRDB at http://www.receptors.org/NR/.

PlasmoDB: An integrative database of the Plasmodium falciparum genome. Tools for accessing and analyzing finished and unfinished sequence data

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The Plasmodium falciparum Genome Database (http://PlasmoDB.org) integrates sequence information, automated analyses and annotation data emerging from the P.falciparum genome sequencing consortium. To date, raw sequence coverage is available for >90% of the genome, and two chromosomes have been finished and annotated. Data in PlasmoDB are organized by chromosome (1–14), and can be accessed using a variety of tools for graphical and text-based browsing or downloaded in various file formats. The GUS (Genomics Unified Schema) implementation of PlasmoDB provides a multi-species genomic relational database, incorporating data from human and mouse, as well as P.falciparum. The relational schema uses a highly structured format to accommodate diverse data sets related to genomic sequence and gene expression. Tools have been designed to facilitate complex biological queries, including many that are specific to Plasmodium parasites and malaria as a disease. Additional projects seek to integrate genomic information with the rich data sets now becoming available for RNA transcription, protein expression, metabolic pathways, genetic and physical mapping, antigenic and population diversity, and phylogenetic relationships with other apicomplexan parasites. The overall goal of PlasmoDB is to facilitate Internet- and CD-ROM-based access to both finished and unfinished sequence information by the global malaria research community.

The Homeodomain Resource: sequences, structures, DNA binding sites and genomic information

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The Homeodomain Resource is an annotated collection of non-redundant protein sequences, three-dimensional structures and genomic information for the homeodomain protein family. Release 3.0 contains 795 full-length homeodomain-containing sequences, 32 experimentally-derived structures and 143 homeobox loci implicated in human genetic disorders. Entries are fully hyperlinked to facilitate easy retrieval of the original records from source databases. A simple search engine with a graphical user interface is provided to query the component databases and assemble customized data sets. A new feature for this release is the addition of DNA recognition sites for all human homeodomain proteins described in the literature. The Homeodomain Resource is freely available through the World Wide Web at http://genome.nhgri.nih.gov/homeodomain.

A genomic approach to gene fusion technology

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Gene expression profiling provides powerful analyses of transcriptional responses to cellular perturbation. In contrast to DNA array-based methods, reporter gene technology has been underused for this application. Here we describe a genomewide, genome-registered collection of Escherichia coli bioluminescent reporter gene fusions. DNA sequences from plasmid-borne, random fusions of E. coli chromosomal DNA to a Photorhabdus luminescens luxCDABE reporter allowed precise mapping of each fusion. The utility of this collection covering about 30% of the transcriptional units was tested by analyzing individual fusions representative of heat shock, SOS, OxyR, SoxRS, and cya/crp stress-responsive regulons. Each fusion strain responded as anticipated to environmental conditions known to activate the corresponding regulatory circuit. Thus, the collection mirrors E. coli's transcriptional wiring diagram. This genomewide collection of gene fusions provides an independent test of results from other gene expression analyses. Accordingly, a DNA microarray-based analysis of mitomycin C-treated E. coli indicated elevated expression of expected and unanticipated genes. Selected luxCDABE fusions corresponding to these up-regulated genes were used to confirm or contradict the DNA microarray results. The power of partnering gene fusion and DNA microarray technology to discover promoters and define operons was demonstrated when data from both suggested that a cluster of 20 genes encoding production of type I extracellular polysaccharide in E. coli form a single operon.

«
1
2
»