941 resultados para Genome scans
Resumo:
The identification of signatures of natural selection in genomic surveys has become an area of intense research, stimulated by the increasing ease with which genetic markers can be typed. Loci identified as subject to selection may be functionally important, and hence (weak) candidates for involvement in disease causation. They can also be useful in determining the adaptive differentiation of populations, and exploring hypotheses about speciation. Adaptive differentiation has traditionally been identified from differences in allele frequencies among different populations, summarised by an estimate of F-ST. Low outliers relative to an appropriate neutral population-genetics model indicate loci subject to balancing selection, whereas high outliers suggest adaptive (directional) selection. However, the problem of identifying statistically significant departures from neutrality is complicated by confounding effects on the distribution of F-ST estimates, and current methods have not yet been tested in large-scale simulation experiments. Here, we simulate data from a structured population at many unlinked, diallelic loci that are predominantly neutral but with some loci subject to adaptive or balancing selection. We develop a hierarchical-Bayesian method, implemented via Markov chain Monte Carlo (MCMC), and assess its performance in distinguishing the loci simulated under selection from the neutral loci. We also compare this performance with that of a frequentist method, based on moment-based estimates of F-ST. We find that both methods can identify loci subject to adaptive selection when the selection coefficient is at least five times the migration rate. Neither method could reliably distinguish loci under balancing selection in our simulations, even when the selection coefficient is twenty times the migration rate.
Resumo:
The genetic etiology of stroke likely reflects the influence of multiple loci with small effects, each modulating different pathophysiological processes. This research project utilized three analytical strategies to address the paucity of information related to the identification and characterization of genetic variation associated with stroke in the general population. ^ First, the general contribution of familial factors to stroke susceptibility was evaluated in a population-based sample of unrelated individuals. Increased risk of subclinical cerebral infarction was observed among individuals with a positive parental history of stroke. This association did not appear to be mediated by established stroke risk factors, specifically blood pressure levels or hypertension status. ^ The need to identify specific gene variation associated with stroke in the general population was addressed by evaluating seven candidate gene polymorphisms in a population-based sample of unrelated individuals. Three polymorphisms were significantly associated with increased subclinical cerebral infarction or incident clinical ischemic stroke risk. These relationships include the G-protein β3 subunit 825C/T polymorphism and clinical stroke in Whites, the lipoprotein lipase S/X447 polymorphism and subclinical and clinical stroke in men, and the angiotensin I-converting enzyme Ins/Del polymorphism and subclinical stroke in White men. These associations did not appear to be obfuscated by the stroke risk factors adjusted for in the analysis models specifically blood pressure levels or anti-hypertensive medication use. ^ The final research strategy considered, on a genome-wide scale, the idea that genetic variation may contribute to the occurrence of hypertension or stroke through a common etiologic pathway. Genomic regions were identified for which significant evidence of heterogeneity was observed among hypertensive sibpairs stratified by family history of stroke information. Regions identified on chromosome 15 in African Americans, and chromosome 13 in Whites and African Americans, suggest the presence of genes influencing hypertension and stroke susceptibility. ^ Insight into the role of genetics in stroke is useful for the potential early identification of individuals at increased risk for stroke and improved understanding of the etiology of the disease. The ultimate goal of these endeavors is to guide the development of therapeutic intervention and informed prevention to provide a lasting and positive impact on public health. ^
Resumo:
Objective. Ankylosing spondylitis (AS) is a debilitating chronic inflammatory condition with a high degree of familiality (λs=82) and heritability (>90%) that primarily affects spinal and sacroiliac joints. Whole genome scans for linkage to AS phenotypes have been conducted, although results have been inconsistent between studies and all have had modest sample sizes. One potential solution to these issues is to combine data from multiple studies in a retrospective meta-analysis. Methods: The International Genetics of Ankylosing Spondylitis Consortium combined data from three whole genome linkage scans for AS (n=3744 subjects) to determine chromosomal markers that show evidence of linkage with disease. Linkage markers typed in different centres were integrated into a consensus map to facilitate effective data pooling. We performed a weighted meta-analysis to combine the linkage results, and compared them with the three individual scans and a combined pooled scan. Results: In addition to the expected region surrounding the HLA-B27 gene on chromosome 6, we determined that several marker regions showed significant evidence of linkage with disease status. Regions on chromosome 10q and 16q achieved 'suggestive' evidence of linkage, and regions on chromosomes 1q, 3q, 5q, 6q, 9q, 17q and 19q showed at least nominal linkage in two or more scans and in the weighted meta-analysis. Regions previously associated with AS on chromosome 2q (the IL-1 gene cluster) and 22q (CYP2D6) exhibited nominal linkage in the meta-analysis, providing further statistical support for their involvement in susceptibility to AS. Conclusion: These findings provide a useful guide for future studies aiming to identify the genes involved in this highly heritable condition. . Published by on behalf of the British Society for Rheumatology.
Resumo:
From our linkage study of Irish families with a high density of schizophrenia, we have previously reported evidence for susceptibility genes in regions 5q21-31, 6p24-21, 8p22-21, and 10p15-p11. In this report, we describe the cumulative results from independent genome scans of three a priori random subsets of 90 families each, and from multipoint analysis of all 270 families in ten regions. Of these ten regions, three (13q32, 18p11-q11, and 18q22-23) did not generate scores above the empirical baseline pairwise scan results, and one (6q13-26) generated a weak signal. Six other regions produced more positive pairwise and multipoint results. They showed the following maximum multipoint H-LOD (heterogeneity LOD) and NPL scores: 2p14-13: 0.89 (P = 0.06) and 2.08 (P = 0.02), 4q24-32: 1.84 (P = 0.007) and 1.67 (P = 0.03), 5q21-31: 2.88 (P= 0.0007), and 2.65 (P = 0.002), 6p25-24: 2.13 (P = 0.005) and 3.59 (P = 0.0005), 6p23: 2.42 (P = 0.001) and 3.07 (P = 0.001), 8p22-21: 1.57 (P = 0.01) and 2.56 (P = 0.005), 10p15-11: 2.04 (P = 0.005) and 1.78 (P = 0.03). The degree of 'internal replication' across subsets differed, with 5q, 6p, and 8p being most consistent and 2p and 10p being least consistent. On 6p, the data suggested the presence of two susceptibility genes, in 6p25-24 and 6p23-22. Very few families were positive on more than one region, and little correlation between regions was evident, suggesting substantial locus heterogeneity. The levels of statistical significance were modest, as expected from loci contributing to complex traits. However, our internal replications, when considered along with the positive results obtained in multiple other samples, suggests that most of these six regions are likely to contain genes that influence liability to schizophrenia.
Resumo:
Schizophrenia is a common disorder with high heritability and a 10-fold increase in risk to siblings of probands. Replication has been inconsistent for reports of significant genetic linkage. To assess evidence for linkage across studies, rank-based genome scan meta-analysis (GSMA) was applied to data from 20 schizophrenia genome scans. Each marker for each scan was assigned to 1 of 120 30-cM bins, with the bins ranked by linkage scores (1 = most significant) and the ranks averaged across studies (R(avg)) and then weighted for sample size (N(sqrt)[affected casess]). A permutation test was used to compute the probability of observing, by chance, each bin's average rank (P(AvgRnk)) or of observing it for a bin with the same place (first, second, etc.) in the order of average ranks in each permutation (P(ord)). The GSMA produced significant genomewide evidence for linkage on chromosome 2q (PAvgRnk
Resumo:
Schizophrenia is a common disorder with high heritability and a 10-fold increase in risk to siblings of probands. Replication has been inconsistent for reports of significant genetic linkage. To assess evidence for linkage across studies, rank-based genome scan meta-analysis (GSMA) was applied to data from 20 schizophrenia genome scans. Each marker for each scan was assigned to 1 of 120 30-cM bins, with the bins ranked by linkage scores (1 = most significant) and the ranks averaged across studies (R-avg) and then weighted for sample size (rootN[affected cases]). A permutation test was used to compute the probability of observing, by chance, each bin's average rank (P-AvgRnk) or of observing it for a bin with the same place (first, second, etc.) in the order of average ranks in each permutation (P-ord). The GSMA produced significant genomewide evidence for linkage on chromosome 2q (P-AvgRnk
Resumo:
The study of continuously varying, quantitative traits is important in evolutionary biology, agriculture, and medicine. Variation in such traits is attributable to many, possibly interacting, genes whose expression may be sensitive to the environment, which makes their dissection into underlying causative factors difficult. An important population parameter for quantitative traits is heritability, the proportion of total variance that is due to genetic factors. Response to artificial and natural selection and the degree of resemblance between relatives are all a function of this parameter. Following the classic paper by R. A. Fisher in 1918, the estimation of additive and dominance genetic variance and heritability in populations is based upon the expected proportion of genes shared between different types of relatives, and explicit, often controversial and untestable models of genetic and non-genetic causes of family resemblance. With genome-wide coverage of genetic markers it is now possible to estimate such parameters solely within families using the actual degree of identity-by-descent sharing between relatives. Using genome scans on 4,401 quasi-independent sib pairs of which 3,375 pairs had phenotypes, we estimated the heritability of height from empirical genome-wide identity-by-descent sharing, which varied from 0.374 to 0.617 (mean 0.498, standard deviation 0.036). The variance in identity-by-descent sharing per chromosome and per genome was consistent with theory. The maximum likelihood estimate of the heritability for height was 0.80 with no evidence for non-genetic causes of sib resemblance, consistent with results from independent twin and family studies but using an entirely separate source of information. Our application shows that it is feasible to estimate genetic variance solely from within- family segregation and provides an independent validation of previously untestable assumptions. Given sufficient data, our new paradigm will allow the estimation of genetic variation for disease susceptibility and quantitative traits that is free from confounding with non-genetic factors and will allow partitioning of genetic variation into additive and non-additive components.
Resumo:
Genome-wide association studies show strong evidence of association with endometriosis for markers on chromosome 1p36 spanning the potential candidate genes WNT4, CDC42 and LINC00339. WNT4 is involved in development of the uterus, and the expression of CDC42 and LINC00339 are altered in women with endometriosis. We conducted fine mapping to examine the role of coding variants in WNT4 and CDC42 and determine the key SNPs with strongest evidence of association in this region. We identified rare coding variants in WNT4 and CDC42 present only in endometriosis cases. The frequencies were low and cannot account for the common signal associated with increased risk of endometriosis. Genotypes for five common SNPs in the region of chromosome 1p36 show stronger association signals when compared with rs7521902 reported in published genome scans. Of these, three SNPs rs12404660, rs3820282, and rs55938609 were located in DNA sequences with potential functional roles including overlap with transcription factor binding sites for FOXA1, FOXA2, ESR1, and ESR2. Functional studies will be required to identify the gene or genes implicated in endometriosis risk.
Resumo:
Knowing the chromosomal areas or actual genes affecting the traits under selection would add more information to be used in the selection decisions which would potentially lead to higher genetic response. The first objective of this study was to map quantitative trait loci (QTL) affecting economically important traits in the Finnish Ayrshire population. The second objective was to investigate the effects of using QTL information in marker-assisted selection (MAS) on the genetic response and the linkage disequilibrium between the different parts of the genome. Whole genome scans were carried out on a grand-daughter design with 12 half-sib families and a total of 493 sons. Twelve different traits were studied: milk yield, protein yield, protein content, fat yield, fat content, somatic cell score (SCS), mastitis treatments, other veterinary treatments, days open, fertility treatments, non-return rate, and calf mortality. The average spacing of the typed markers was 20 cM with 2 to 14 markers per chromosome. Associations between markers and traits were analyzed with multiple marker regression. Significance was determined by permutation and genome-wise P-values obtained by Bonferroni correction. The benefits from MAS were investigated by simulation: a conventional progeny testing scheme was compared to a scheme where QTL information was used within families to select among full-sibs in the male path. Two QTL on different chromosomes were modelled. The effects of different starting frequencies of the favourable alleles and different size of the QTL effects were evaluated. A large number of QTL, 48 in total, were detected at 5% or higher chromosome-wise significance. QTL for milk production were found on 8 chromosomes, for SCS on 6, for mastitis treatments on 1, for other veterinary treatments on 5, for days open on 7, for fertility treatments on 7, for calf mortality on 6, and for non-return rate on 2 chromosomes. In the simulation study the total genetic response was faster with MAS than with conventional selection and the advantage of MAS persisted over the studied generations. The rate of response and the difference between the selection schemes reflected clearly the changes in allele frequencies of the favourable QTL. The disequilibrium between the polygenes and QTL was always negative and it was larger with larger QTL size. The disequilibrium between the two QTL was larger with QTL of large effect and it was somewhat larger with MAS for scenarios with starting frequencies below 0.5 for QTL of moderate size and below 0.3 for large QTL. In conclusion, several QTL affecting economically important traits of dairy cattle were detected. Further studies are needed to verify these QTL, check their presence in the present breeding population, look for pleiotropy and fine map the most interesting QTL regions. The results of the simulation studies show that using MAS together with embryo transfer to pre-select young bulls within families is a useful approach to increase the genetic merit of the AI-bulls compared to conventional selection.
Resumo:
Background:Bacterial non-coding small RNAs (sRNAs) have attracted considerable attention due to their ubiquitous nature and contribution to numerous cellular processes including survival, adaptation and pathogenesis. Existing computational approaches for identifying bacterial sRNAs demonstrate varying levels of success and there remains considerable room for improvement. Methodology/Principal Findings: Here we have proposed a transcriptional signal-based computational method to identify intergenic sRNA transcriptional units (TUs) in completely sequenced bacterial genomes. Our sRNAscanner tool uses position weight matrices derived from experimentally defined E. coli K-12 MG1655 sRNA promoter and rho-independent terminator signals to identify intergenic sRNA TUs through sliding window based genome scans. Analysis of genomes representative of twelve species suggested that sRNAscanner demonstrated equivalent sensitivity to sRNAPredict2, the best performing bioinformatics tool available presently. However, each algorithm yielded substantial numbers of known and uncharacterized hits that were unique to one or the other tool only. sRNAscanner identified 118 novel putative intergenic sRNA genes in Salmonella enterica Typhimurium LT2, none of which were flagged by sRNAPredict2. Candidate sRNA locations were compared with available deep sequencing libraries derived from Hfq-co-immunoprecipitated RNA purified from a second Typhimurium strain (Sittka et al. (2008) PLoS Genetics 4: e1000163). Sixteen potential novel sRNAs computationally predicted and detected in deep sequencing libraries were selected for experimental validation by Northern analysis using total RNA isolated from bacteria grown under eleven different growth conditions. RNA bands of expected sizes were detected in Northern blots for six of the examined candidates. Furthermore, the 5'-ends of these six Northern-supported sRNA candidates were successfully mapped using 5'-RACE analysis. Conclusions/Significance: We have developed, computationally examined and experimentally validated the sRNAscanner algorithm. Data derived from this study has successfully identified six novel S. Typhimurium sRNA genes. In addition, the computational specificity analysis we have undertaken suggests that similar to 40% of sRNAscanner hits with high cumulative sum of scores represent genuine, undiscovered sRNA genes. Collectively, these data strongly support the utility of sRNAscanner and offer a glimpse of its potential to reveal large numbers of sRNA genes that have to date defied identification. sRNAscanner is available from: http://bicmku.in:8081/sRNAscanner or http://cluster.physics.iisc.ernet.in/sRNAscanner/.
Resumo:
The aim of this paper is to develop a flexible model for analysis of quantitative trait loci (QTL) in outbred line crosses, which includes both additive and dominance effects. Our flexible intercross analysis (FIA) model accounts for QTL that are not fixed within founder lines and is based on the variance component framework. Genome scans with FIA are performed using a score statistic, which does not require variance component estimation. RESULTS: Simulations of a pedigree with 800 F2 individuals showed that the power of FIA including both additive and dominance effects was almost 50% for a QTL with equal allele frequencies in both lines with complete dominance and a moderate effect, whereas the power of a traditional regression model was equal to the chosen significance value of 5%. The power of FIA without dominance effects included in the model was close to those obtained for FIA with dominance for all simulated cases except for QTL with overdominant effects. A genome-wide linkage analysis of experimental data from an F2 intercross between Red Jungle Fowl and White Leghorn was performed with both additive and dominance effects included in FIA. The score values for chicken body weight at 200 days of age were similar to those obtained in FIA analysis without dominance. CONCLUSION: We have extended FIA to include QTL dominance effects. The power of FIA was superior, or similar, to standard regression methods for QTL effects with dominance. The difference in power for FIA with or without dominance is expected to be small as long as the QTL effects are not overdominant. We suggest that FIA with only additive effects should be the standard model to be used, especially since it is more computationally efficient.
Resumo:
O conhecimento do genoma pode auxiliar na identificação de regiões cromossômicas e, eventualmente, de genes que controlam características quantitativas (QTLs) de importância econômica. em um experimento com 1.129 suínos resultantes do cruzamento entre machos da raça Meishan e fêmeas Large White e Landrace, foram analisadas as características gordura intramuscular (GIM), em %, e ganho dos 25 aos 90 kg de peso vivo (GP), em g/dia, em 298 animais F1 e 831 F2, e espessura de toucinho (ET), em mm, em 324 F1 e 805 F2. Os animais das gerações F1 e F2 foram tipificados com 29 marcadores microsatélites. Estudou-se a ligação entre os cromossomos 4, 6 e 7 com GIM, ET e GP. Análises de QTL utilizando-se metodologia Bayesiana foram aplicadas mediante três modelos genéticos: modelo poligênico infinitesimal (MPI); modelo poligênico finito (MPF), considerando-se três locos; e MPF combinado com MPI. O número de QTLs, suas respectivas posições nos três cromossomos e o efeito fenotípico foram estimados simultaneamente. Os sumários dos parâmetros estimados foram baseados nas distribuições marginais a posteriori, obtidas por meio do uso da Cadeia de Markov, algoritmos de Monte Carlo (MCMC). Foi possível evidenciar dois QTLs relacionados a GIM nos cromossomos 4 e 6 e dois a ET nos cromossomos 4 e 7. Somente quando se ajustou o MPI, foram observados QTLs no cromossomo 4 para ET e GIM. Não foi possível detectar QTLs para a característica GP com a aplicação dessa metodologia, o que pode ter resultado do uso de marcadores não informativos ou da ausência de QTLs segregando nos cromossomos 4, 6 e 7 desta população. Foi evidenciada a vantagem de se analisar dados experimentais ajustando diferentes modelos genéticos; essas análises ilustram a utilidade e ampla aplicabilidade do método Bayesiano.
Resumo:
Background: New challenges are rising in the animal protein market, and one of the main world challenges is to produce more in shorter time, with better quality and in a sustainable way. Brazil is the largest beef exporter in volume hence the factors affecting the beef meat chain are of major concern in countrýs economy. An emerging class of biotechnological approaches, the molecular markers, is bringing new perspectives to face these challenges, particularly after the publication of the first complete livestock genome (bovine), which has triggered a massive initiative to put in practice the benefits of the so called the Post-Genomic Era. Review: This article aimed at showing the directions and insights in the application of molecular markers on livestock genetic improvement and reproduction as well at organizing the progress so far, pointing some perspectives of these emerging technologies in Brazilian ruminant production context. An overview on the nature of the main molecular markers explored in ruminant production is provided, which describes the molecular bases and detection approaches available for microsatellites (STR) and single nucleotide polymorphisms (SNP). A topic is dedicated to review the history of association studies between markers and important trait variation in livestock, showing the timeline starting on quantitative trait loci (QTL) identification using STR markers and ending in high resolution SNP panels to proceed whole genome scans for phenotype/genotype association. Also the article organizes this information to reveal how QTL prospection using STR could open ground to the feasibility of marker-assisted selection and why this approach is quickly being replaced by studies involving the application of genome-wide association using SNP research in a new concept called genomic selection. Conclusion: The world's scientific community is dedicating effort and resources to apply SNP information in livestock selection through the development of high density panels for genomic association studies, connecting molecular genetic data with phenotypes of economic interest. Once generated, this information can be used to take decisions in genetic improvement programs by selecting animals with the assistance of molecular markers.
Resumo:
Background The European trout (Salmo trutta species complex) occurs across a very wide altitudinal range from lowland rivers to alpine streams. Historically, the major European river systems contained different, evolutionarily distinct trout lineages, and some of this genetic diversity has persisted in spite of extensive human-mediated translocations. We used AFLP-based genome scans to investigate the extent of potentially adaptive divergence among major drainages and along altitudinal gradients replicated in several rivers. Results The proportion of loci showing evidence of divergent selection was larger between drainages than along altitudinal transects within drainages. This suggests divergent selection is stronger between drainages, or adaptive divergence is constrained by gene flow among populations within drainages, although the latter could not be confirmed at a more local scale. Still, altitudinal divergence occurred and, at approximately 2% of the markers, parallel changes of the AFLP band frequencies with altitude were observed suggesting that altitude may well be an important source of divergent selection within rivers. Conclusions Our results indicate that adaptive genetic divergence is common both between major European river systems and along altitudinal gradients within drainages. Alpine trout appear to be a promising model system to investigate the relative roles of divergent selection and gene flow in promoting or preventing adaptation to climate gradients.