909 resultados para Genetic population data
Resumo:
Background - Recent studies have implicated variants of the transcription factor 7-like 2 (TCF7L2) gene in genetic susceptibility to type 2 diabetes mellitus in several different populations. The aim of this study was to determine whether variants of this gene are also risk factors for type 2 diabetes development in a UK-resident South Asian cohort of Punjabi ancestry. Methods - We genotyped four single nucleotide polymorphisms (SNPs) of TCF7L2 (rs7901695, rs7903146, rs11196205 and rs12255372) in 831 subjects with diabetes and 437 control subjects. Results - The minor allele of each variant was significantly associated with type 2 diabetes; the greatest risk of developing the disease was conferred by rs7903146, with an allelic odds ratio (OR) of 1.31 (95% CI: 1.11 – 1.56, p = 1.96 × 10-3). For each variant, disease risk associated with homozygosity for the minor allele was greater than that for heterozygotes, with the exception of rs12255372. To determine the effect on the observed associations of including young control subjects in our data set, we reanalysed the data using subsets of the control group defined by different minimum age thresholds. Increasing the minimum age of our control subjects resulted in a corresponding increase in OR for all variants of the gene (p ≤ 1.04 × 10-7). Conclusion - Our results support recent findings that TCF7L2 is an important genetic risk factor for the development of type 2 diabetes in multiple ethnic groups.
Resumo:
In the global strategy for preservation genetic resources of farm animals the implementation of information technology is of great importance. In this regards platform independent information tools and approaches for data exchange are needed in order to obtain aggregate values for regions and countries of spreading a separate breed. The current paper presents a XML based solution for data exchange in management genetic resources of farm animals’ small populations. There are specific requirements to the exchanged documents that come from the goal of data analysis. Three main types of documents are distinguished and their XML formats are discussed. DTD and XML Schema for each type are suggested. Some examples of XML documents are given also.
Resumo:
2000 Mathematics Subject Classification: 62H12, 62P99
Resumo:
Examining complete gene knockouts within a viable organism can inform on gene function. We sequenced the exomes of 3222 British Pakistani-heritage adults with high parental relatedness, discovering 1111 rare-variant homozygous genotypes with predicted loss of gene function (knockouts) in 781 genes. We observed 13.7% fewer than expected homozygous knockout genotypes, implying an average load of 1.6 recessive-lethal-equivalent LOF variants per adult. Linking genetic data to lifelong health records, knockouts were not associated with clinical consultation or prescription rate. In this dataset we identified a healthy PRDM9 knockout mother, and performed phased genome sequencing on her, her child and controls, which showed meiotic recombination sites localized away from PRDM9-dependent hotspots. Thus, natural LOF variants inform upon essential genetic loci, and demonstrate PRDM9 redundancy in humans.
Resumo:
The investigations of human mitochondrial DNA (mtDNA) have considerably contributed to human evolution and migration. The Middle East is considered to be an essential geographic area for human migrations out of Africa since it is located at the crossroads of Africa, and the rest of the world. United Arab Emirates (UAE) population inhabits the eastern part of Arabian Peninsula and was investigated in this study. Published data of 18 populations were included in the statistical analysis. The diversity indices showed (1) high genetic distance among African populations and (2) high genetic distance between African populations and non-African populations. Asian populations clustered together in the NJ tree between the African and European populations. MtDNA haplotypes database of the UAE population was generated. By incorporating UAE mtDNA dataset into the existing worldwide mtDNA database, UAE Forensic Laboratories will be able to analyze future mtDNA evidence in a more significant and consistent manner. ^
Resumo:
Genetic diversity can be used to describe patterns of gene flow within and between local and regional populations. The Florida Everglades experiences seasonal fluctuations in water level that can influence local population extinction and recolonization dynamics. In addition, this expansive wetland has been divided into water management regions by canals and levees. These combined factors can affect genetic diversity and population structure of aquatic organisms in the Everglades. We analyzed allelic variation at six DNA microsatellite loci to examine the population structure of spotted sunfish (Lepomis punctatus) from the Everglades. We tested the hypothesis that recurrent local extinction and recent regional divisions have had an effect on patterns of genetic diversity. No marked differences were observed in comparisons of the heterozygosity values of sites within and among water management units. No evidence of isolation by distance was detected in a gene flow and distance correlation between subpopulations. Confidence intervals for the estimated F-statistic values crossed zero, indicating that there was no significant genetic difference between subpopulations within a region or between regions. Notably, the genetic variation among subpopulations in a water conservation area was greater than variation among regions (Fsp>FPT). These data indicate that the spatial scale of recolonization following local extinction appears to be most important within water management units.
Resumo:
We analyzed the effect of periodic drying in the Florida Everglades on spatiotemporal population genetic structure of eastern mosquitofish (Gambusia holbrooki). Severe periodic drying events force individuals from disparate sources to mix in dry season relatively deep-water refuges. In 1996 (a wet year) and 1999 (a dry year), we sampled mosquitofish at 20 dry-season refuges distributed in 3 water management regions and characterized genetic variation for 10 allozyme and 3 microsatellite loci. In 1996, most of the ecosystem did not dry, whereas in 1999, many of our sampling locations were isolated by expanses of dried marsh surface. In 1996, most spatial genetic variation was attributed to heterogeneity within regions. In 1999, spatial genetic variation within regions was not significant. In both years, a small but significant amount of variation (less than 1% of the total variation) was partitioned among regions. Variance was consistently greater than zero among long-hydroperiod sites within a region, but not among short-hydroperiod sites within a region, where hydroperiod was measured as time since last marsh surface dry-down forcing fishes into local refuges. In 1996, all sites were in Hardy–Weinberg equilibrium. In 1999, we observed fewer heterozygotes than expected for most loci and sites suggesting a Wahlund effect arising from fish leaving areas that dried and mixing in deep-water refuges.
Resumo:
The primary goal of this dissertation is the study of patterns of viral evolution inferred from serially-sampled sequence data, i.e., sequence data obtained from strains isolated at consecutive time points from a single patient or host. RNA viral populations have an extremely high genetic variability, largely due to their astronomical population sizes within host systems, high replication rate, and short generation time. It is this aspect of their evolution that demands special attention and a different approach when studying the evolutionary relationships of serially-sampled sequence data. New methods that analyze serially-sampled data were developed shortly after a groundbreaking HIV-1 study of several patients from which viruses were isolated at recurring intervals over a period of 10 or more years. These methods assume a tree-like evolutionary model, while many RNA viruses have the capacity to exchange genetic material with one another using a process called recombination. ^ A genealogy involving recombination is best described by a network structure. A more general approach was implemented in a new computational tool, Sliding MinPD, one that is mindful of the sampling times of the input sequences and that reconstructs the viral evolutionary relationships in the form of a network structure with implicit representations of recombination events. The underlying network organization reveals unique patterns of viral evolution and could help explain the emergence of disease-associated mutants and drug-resistant strains, with implications for patient prognosis and treatment strategies. In order to comprehensively test the developed methods and to carry out comparison studies with other methods, synthetic data sets are critical. Therefore, appropriate sequence generators were also developed to simulate the evolution of serially-sampled recombinant viruses, new and more through evaluation criteria for recombination detection methods were established, and three major comparison studies were performed. The newly developed tools were also applied to "real" HIV-1 sequence data and it was shown that the results represented within an evolutionary network structure can be interpreted in biologically meaningful ways. ^
Resumo:
Lognormal distribution has abundant applications in various fields. In literature, most inferences on the two parameters of the lognormal distribution are based on Type-I censored sample data. However, exact measurements are not always attainable especially when the observation is below or above the detection limits, and only the numbers of measurements falling into predetermined intervals can be recorded instead. This is the so-called grouped data. In this paper, we will show the existence and uniqueness of the maximum likelihood estimators of the two parameters of the underlying lognormal distribution with Type-I censored data and grouped data. The proof was first established under the case of normal distribution and extended to the lognormal distribution through invariance property. The results are applied to estimate the median and mean of the lognormal population.
Resumo:
The exponential growth of studies on the biological response to ocean acidification over the last few decades has generated a large amount of data. To facilitate data comparison, a data compilation hosted at the data publisher PANGAEA was initiated in 2008 and is updated on a regular basis (doi:10.1594/PANGAEA.149999). By January 2015, a total of 581 data sets (over 4 000 000 data points) from 539 papers had been archived. Here we present the developments of this data compilation five years since its first description by Nisumaa et al. (2010). Most of study sites from which data archived are still in the Northern Hemisphere and the number of archived data from studies from the Southern Hemisphere and polar oceans are still relatively low. Data from 60 studies that investigated the response of a mix of organisms or natural communities were all added after 2010, indicating a welcomed shift from the study of individual organisms to communities and ecosystems. The initial imbalance of considerably more data archived on calcification and primary production than on other processes has improved. There is also a clear tendency towards more data archived from multifactorial studies after 2010. For easier and more effective access to ocean acidification data, the ocean acidification community is strongly encouraged to contribute to the data archiving effort, and help develop standard vocabularies describing the variables and define best practices for archiving ocean acidification data.
Resumo:
Copyright © 2015 Royal College of Surgeons of Edinburgh (Scottish charity number SC005317) and Royal College of Surgeons in Ireland. Published by Elsevier Ltd. All rights reserved. Acknowledgements We would like to thank the Scottish Intensive Care Society Audit Group (SICSAG) for providing the data for this study. Mr Jan Jansen is in receipt of an NHS Research Scotland fellowship which includes salary funding.
Resumo:
Copyright © 2015 Royal College of Surgeons of Edinburgh (Scottish charity number SC005317) and Royal College of Surgeons in Ireland. Published by Elsevier Ltd. All rights reserved. Acknowledgements We would like to thank the Scottish Intensive Care Society Audit Group (SICSAG) for providing the data for this study. Mr Jan Jansen is in receipt of an NHS Research Scotland fellowship which includes salary funding.
Resumo:
Funded by UK Government's Overseas Territories Environmental Programme (OTEP)
Resumo:
© The Author 2016. Published by Oxford University Press on behalf of the European Society of Human Reproduction and Embryology. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Resumo:
The advent of next-generation sequencing, now nearing a decade in age, has enabled, among other capabilities, measurement of genome-wide sequence features at unprecedented scale and resolution.
In this dissertation, I describe work to understand the genetic underpinnings of non-Hodgkin’s lymphoma through exploration of the epigenetics of its cell of origin, initial characterization and interpretation of driver mutations, and finally, a larger-scale, population-level study that incorporates mutation interpretation with clinical outcome.
In the first research chapter, I describe genomic characteristics of lymphomas through the lens of their cells of origin. Just as many other cancers, such as breast cancer or lung cancer, are categorized based on their cell of origin, lymphoma subtypes can be examined through the context of their normal B Cells of origin, Naïve, Germinal Center, and post-Germinal Center. By applying integrative analysis of the epigenetics of normal B Cells of origin through chromatin-immunoprecipitation sequencing, we find that differences in normal B Cell subtypes are reflected in the mutational landscapes of the cancers that arise from them, namely Mantle Cell, Burkitt, and Diffuse Large B-Cell Lymphoma.
In the next research chapter, I describe our first endeavor into understanding the genetic heterogeneity of Diffuse Large B Cell Lymphoma, the most common form of non-Hodgkin’s lymphoma, which affects 100,000 patients in the world. Through whole-genome sequencing of 1 case as well as whole-exome sequencing of 94 cases, we characterize the most recurrent genetic features of DLBCL and lay the groundwork for a larger study.
In the last research chapter, I describe work to characterize and interpret the whole exomes of 1001 cases of DLBCL in the largest single-cancer study to date. This highly-powered study enabled sub-gene, gene-level, and gene-network level understanding of driver mutations within DLBCL. Moreover, matched genomic and clinical data enabled the connection of these driver mutations to clinical features such as treatment response or overall survival. As sequencing costs continue to drop, whole-exome sequencing will become a routine clinical assay, and another diagnostic dimension in addition to existing methods such as histology. However, to unlock the full utility of sequencing data, we must be able to interpret it. This study undertakes a first step in developing the understanding necessary to uncover the genomic signals of DLBCL hidden within its exomes. However, beyond the scope of this one disease, the experimental and analytical methods can be readily applied to other cancer sequencing studies.
Thus, this dissertation leverages next-generation sequencing analysis to understand the genetic underpinnings of lymphoma, both by examining its normal cells of origin as well as through a large-scale study to sensitively identify recurrently mutated genes and their relationship to clinical outcome.