943 resultados para GENOMIC ANCESTRY


Relevância:

20.00% 20.00%

Publicador:

Resumo:

Transcriptional regulation has been studied intensively in recent decades. One important aspect of this regulation is the interaction between regulatory proteins, such as transcription factors (TF) and nucleosomes, and the genome. Different high-throughput techniques have been invented to map these interactions genome-wide, including ChIP-based methods (ChIP-chip, ChIP-seq, etc.), nuclease digestion methods (DNase-seq, MNase-seq, etc.), and others. However, a single experimental technique often only provides partial and noisy information about the whole picture of protein-DNA interactions. Therefore, the overarching goal of this dissertation is to provide computational developments for jointly modeling different experimental datasets to achieve a holistic inference on the protein-DNA interaction landscape.

We first present a computational framework that can incorporate the protein binding information in MNase-seq data into a thermodynamic model of protein-DNA interaction. We use a correlation-based objective function to model the MNase-seq data and a Markov chain Monte Carlo method to maximize the function. Our results show that the inferred protein-DNA interaction landscape is concordant with the MNase-seq data and provides a mechanistic explanation for the experimentally collected MNase-seq fragments. Our framework is flexible and can easily incorporate other data sources. To demonstrate this flexibility, we use prior distributions to integrate experimentally measured protein concentrations.

We also study the ability of DNase-seq data to position nucleosomes. Traditionally, DNase-seq has only been widely used to identify DNase hypersensitive sites, which tend to be open chromatin regulatory regions devoid of nucleosomes. We reveal for the first time that DNase-seq datasets also contain substantial information about nucleosome translational positioning, and that existing DNase-seq data can be used to infer nucleosome positions with high accuracy. We develop a Bayes-factor-based nucleosome scoring method to position nucleosomes using DNase-seq data. Our approach utilizes several effective strategies to extract nucleosome positioning signals from the noisy DNase-seq data, including jointly modeling data points across the nucleosome body and explicitly modeling the quadratic and oscillatory DNase I digestion pattern on nucleosomes. We show that our DNase-seq-based nucleosome map is highly consistent with previous high-resolution maps. We also show that the oscillatory DNase I digestion pattern is useful in revealing the nucleosome rotational context around TF binding sites.

Finally, we present a state-space model (SSM) for jointly modeling different kinds of genomic data to provide an accurate view of the protein-DNA interaction landscape. We also provide an efficient expectation-maximization algorithm to learn model parameters from data. We first show in simulation studies that the SSM can effectively recover underlying true protein binding configurations. We then apply the SSM to model real genomic data (both DNase-seq and MNase-seq data). Through incrementally increasing the types of genomic data in the SSM, we show that different data types can contribute complementary information for the inference of protein binding landscape and that the most accurate inference comes from modeling all available datasets.

This dissertation provides a foundation for future research by taking a step toward the genome-wide inference of protein-DNA interaction landscape through data integration.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Copyright © Taylor & Francis Group, LLC 2015.Type 2 diabetes is a major health burden in the United States, and population trends suggest this burden will increase. High interest in, and increased availability of, testing for genetic risk of type 2 diabetes presents a new opportunity for reducing type 2 diabetes risk for many patients; however, to date, there is little evidence that genetic testing positively affects type 2 diabetes prevention. Genetic information may not fit patients illness representations, which may reduce the chances of risk-reducing behavior changes. The present study aimed to examine illness representations in a clinical sample who are at risk for type 2 diabetes and interested in genetic testing. The authors used the Common Sense Model to analyze survey responses of 409 patients with type 2 diabetes risk factors. Patients were interested in genetic testing for type 2 diabetes risk and believed in its importance. Most patients believed that genetic factors are important to developing type 2 diabetes (67%), that diet and exercise are effective in preventing type 2 diabetes (95%), and that lifestyle changes are more effective than drugs (86%). Belief in genetic causality was not related to poorer self-reported health behaviors. These results suggest that patients interest in genetic testing for type 2 diabetes might produce a teachable moment that clinicians can use to counsel behavior change.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

BACKGROUND: Patients, clinicians, researchers and payers are seeking to understand the value of using genomic information (as reflected by genotyping, sequencing, family history or other data) to inform clinical decision-making. However, challenges exist to widespread clinical implementation of genomic medicine, a prerequisite for developing evidence of its real-world utility. METHODS: To address these challenges, the National Institutes of Health-funded IGNITE (Implementing GeNomics In pracTicE; www.ignite-genomics.org ) Network, comprised of six projects and a coordinating center, was established in 2013 to support the development, investigation and dissemination of genomic medicine practice models that seamlessly integrate genomic data into the electronic health record and that deploy tools for point of care decision making. IGNITE site projects are aligned in their purpose of testing these models, but individual projects vary in scope and design, including exploring genetic markers for disease risk prediction and prevention, developing tools for using family history data, incorporating pharmacogenomic data into clinical care, refining disease diagnosis using sequence-based mutation discovery, and creating novel educational approaches. RESULTS: This paper describes the IGNITE Network and member projects, including network structure, collaborative initiatives, clinical decision support strategies, methods for return of genomic test results, and educational initiatives for patients and providers. Clinical and outcomes data from individual sites and network-wide projects are anticipated to begin being published over the next few years. CONCLUSIONS: The IGNITE Network is an innovative series of projects and pilot demonstrations aiming to enhance translation of validated actionable genomic information into clinical settings and develop and use measures of outcome in response to genome-based clinical interventions using a pragmatic framework to provide early data and proofs of concept on the utility of these interventions. Through these efforts and collaboration with other stakeholders, IGNITE is poised to have a significant impact on the acceleration of genomic information into medical practice.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

info:eu-repo/semantics/published

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The E1AF protein belongs to the family of Ets transcription factors and is involved in the regulation of metastasis gene expression. It has recently been reported in an undifferentiated child sarcoma that part of this gene could be fused by translocation to the ews gene. We show here that the human e1af gene, which is located in the q21 region of chromosome 17, is organized in 13 exons distributed along 19 kb of genomic DNA. Its two main functional domains, the acidic domain and the DNA-binding ETS domain, are each encoded by three different exons. The 3'-untranslated region of e1af is 0.7 kb. The 5'-untranslated region is about 0.3 kb and is composed of a first exon upstream from the exon containing the first methionine. These data could possibly accelerate an understanding of the molecular basis of putative inherited diseases linked to E1AF. (C) 1999 Elsevier Science B.V. All rights reserved.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Coccolithoviruses are giant dsDNA viruses that infect Emiliania huxleyi, the most ubiquitous marine microalga. Here, we present the genome of the latest coccolithovirus strain to be sequenced, EhV-99B1, and compare it with two other coccolithovirus genomes (EhV-86 and EhV-163). EhV-99B1 shares a pairwise nucleotide identity of 98% with EhV-163 (the two strains were isolated from the same Norwegian fjord but in different years), and just 96.5% with EhV-86 (isolated in the same spring as EhV-99B1 but in the English Channel). We confirmed and extended the list of relevant genomic differences between these EhVs from the Norwegian fjord and EhVs from the English Channel, namely the removal/insertions of: a phosphate permease, an endonuclease, a transposase, and two specific tRNAs. As a whole, this study provided new clues and insights into the diversity and mechanisms driving the evolution of these large oceanic viruses, in particular those processes involving selfish genetic elements.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The Ov/Br septin gene, which is also a fusion partner of MLL in acute myeloid leukaemia, is a member of a family of novel GTP binding proteins that have been implicated in cytokinesis and exocytosis. In this study, we describe the genomic and transcriptional organization of this gene, detailing seventeen exons distributed over 240 kb of sequence. Extensive database analyses identified orthologous rodent cDNAs that corresponded to new, unidentified 5' splice variants of the Ov/Br septin gene, increasing the total number of such variants to six. We report that splicing events, occurring at non-canonical sites within the body of the 3' terminal exon, remove either 1801 bp or 1849 bp of non-coding sequence and facilitate access to a secondary open reading frame of 44 amino acids maintained near the end of the 3' UTR. These events constitute a novel coding arrangement and represent the first report of such a design being implemented by a eukaryotic gene. The various Ov/Br proteins either differ minimally at their amino and carboxy termini or are equivalent to truncated versions of larger isoforms. Northern analysis with an Ov/Br septin 3' UTR probe reveals three transcripts of 4.4, 4 and 3 kb, the latter being restricted to a sub-set of the tissues tested. Investigation of the identified Ov/Br septin isoforms by RT-PCR confirms a complex transcriptional pattern, with several isoforms showing tissue-specific distribution. To date, none of the other human septins have demonstrated such transcriptional complexity.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Phylloxin is a novel prototype antimicrobial peptide from the skin of Phyllomedusa bicolor. Here, we describe parallel identification and sequencing of phylloxin precursor transcript (mRNA) and partial gene structure (genomic DNA) from the same sample of lyophilized skin secretion using our recently-described cloning technique. The open-reading frame of the phylloxin precursor was identical in nucleotide sequence to that previously reported and alignment with the nucleotide sequence derived from genomic DNA indicated the presence of a 175 bp intron located in a near identical position to that found in the dermaseptins. The highly-conserved structural organization of skin secretion peptide genes in P. bicolor can thus be extended to include that encoding phylloxin (plx). These data further reinforce our assertion that application of the described methodology can provide robust genomic/transcriptomic/peptidomic data without the need for specimen sacrifice.