940 resultados para models, genetic
Object-Oriented Genetic Programming for the Automatic Inference of Graph Models for Complex Networks
Resumo:
Complex networks are systems of entities that are interconnected through meaningful relationships. The result of the relations between entities forms a structure that has a statistical complexity that is not formed by random chance. In the study of complex networks, many graph models have been proposed to model the behaviours observed. However, constructing graph models manually is tedious and problematic. Many of the models proposed in the literature have been cited as having inaccuracies with respect to the complex networks they represent. However, recently, an approach that automates the inference of graph models was proposed by Bailey [10] The proposed methodology employs genetic programming (GP) to produce graph models that approximate various properties of an exemplary graph of a targeted complex network. However, there is a great deal already known about complex networks, in general, and often specific knowledge is held about the network being modelled. The knowledge, albeit incomplete, is important in constructing a graph model. However it is difficult to incorporate such knowledge using existing GP techniques. Thus, this thesis proposes a novel GP system which can incorporate incomplete expert knowledge that assists in the evolution of a graph model. Inspired by existing graph models, an abstract graph model was developed to serve as an embryo for inferring graph models of some complex networks. The GP system and abstract model were used to reproduce well-known graph models. The results indicated that the system was able to evolve models that produced networks that had structural similarities to the networks generated by the respective target models.
Resumo:
Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq)
Resumo:
Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)
Resumo:
Objectives - It has long been suspected that susceptibility to ankylosing spondylitis (AS) is influenced by genes lying distant to the major histocompatibility complex. This study compares genetic models of AS to assess the most likely mode of inheritance, using recurrence risk ratios in relatives of affected subjects. Methods - Recurrence risk ratios in different degrees of relatives were determined using published data from studies specifically designed to address the question. The methods of Risch were used to determine the expected recurrence risk ratios in different degrees of relatives, assuming equal first degree relative recurrence risk between models. Goodness of fit was determined by χ2 comparison of the expected number of affected subjects with the observed number, given equal numbers of each type of relative studied. Results - The recurrence risks in different degrees of relatives were: monozygotic (MZ) twins 63% (17/27), first degree relatives 8.2% (441/5390), second degree relatives 1.0% (8/834), and third degree relatives 0.7% (7/997). Parent-child recurrence risk (7.9%, 37/466) was not significantly different from the sibling recurrence risk (8.2%, 404/4924), excluding a significant dominance genetic component to susceptibility. Poor fitting models included single gene, genetic heterogeneity, additive, two locus multiplicative, and one locus and residual polygenes (χ2 > 32 (two degrees of freedom), p < 10-6 for all models). The best fitting model studied was a five locus model with multiplicative interaction between loci (χ2 = 1.4 (two degrees of freedom), p = 0.5). Oligogenic multiplicative models were the best fitting over a range of population prevalences and first degree recurrence risk rates. Conclusions - This study suggests that of the genetic models tested, the most likely model operating in AS is an oligogenic model with predominantly multiplicative interaction between loci.
Resumo:
Endosplasmic reticulum aminopeptidase 1 (ERAP1), endoplasmic reticulum aminopeptidase 2 (ERAP2) and puromycin-sensitive aminopeptidase (NPEPPS) are key zinc metallopeptidases that belong to the oxytocinase subfamily of M1 aminopeptidase family. NPEPPS catalyzes the processing of proteosome-derived peptide repertoire followed by trimming of antigenic peptides by ERAP1 and ERAP2 for presentation on major histocompatibility complex (MHC) Class I molecules. A series of genome-wide association studies have demonstrated associations of these aminopeptidases with a range of immune-mediated diseases such as ankylosing spondylitis, psoriasis, Behçet's disease, inflammatory bowel disease and type I diabetes, and significantly, genetic interaction between some aminopeptidases and HLA Class I loci with which these diseases are strongly associated. In this review, we highlight the current state of understanding of the genetic associations of this class of genes, their functional role in disease, and potential as therapeutic targets.
Resumo:
The MFG test is a family-based association test that detects genetic effects contributing to disease in offspring, including offspring allelic effects, maternal allelic effects and MFG incompatibility effects. Like many other family-based association tests, it assumes that the offspring survival and the offspring-parent genotypes are conditionally independent provided the offspring is affected. However, when the putative disease-increasing locus can affect another competing phenotype, for example, offspring viability, the conditional independence assumption fails and these tests could lead to incorrect conclusions regarding the role of the gene in disease. We propose the v-MFG test to adjust for the genetic effects on one phenotype, e.g., viability, when testing the effects of that locus on another phenotype, e.g., disease. Using genotype data from nuclear families containing parents and at least one affected offspring, the v-MFG test models the distribution of family genotypes conditional on offspring phenotypes. It simultaneously estimates genetic effects on two phenotypes, viability and disease. Simulations show that the v-MFG test produces accurate genetic effect estimates on disease as well as on viability under several different scenarios. It generates accurate type-I error rates and provides adequate power with moderate sample sizes to detect genetic effects on disease risk when viability is reduced. We demonstrate the v-MFG test with HLA-DRB1 data from study participants with rheumatoid arthritis (RA) and their parents, we show that the v-MFG test successfully detects an MFG incompatibility effect on RA while simultaneously adjusting for a possible viability loss.
Inference of the genetic architecture underlying BMI and height with the use of 20,240 sibling pairs
Resumo:
Evidence that complex traits are highly polygenic has been presented by population-based genome-wide association studies (GWASs) through the identification of many significant variants, as well as by family-based de novo sequencing studies indicating that several traits have a large mutational target size. Here, using a third study design, we show results consistent with extreme polygenicity for body mass index (BMI) and height. On a sample of 20,240 siblings (from 9,570 nuclear families), we used a within-family method to obtain narrow-sense heritability estimates of 0.42 (SE = 0.17, p = 0.01) and 0.69 (SE = 0.14, p = 6 x 10(-)(7)) for BMI and height, respectively, after adjusting for covariates. The genomic inflation factors from locus-specific linkage analysis were 1.69 (SE = 0.21, p = 0.04) for BMI and 2.18 (SE = 0.21, p = 2 x 10(-10)) for height. This inflation is free of confounding and congruent with polygenicity, consistent with observations of ever-increasing genomic-inflation factors from GWASs with large sample sizes, implying that those signals are due to true genetic signals across the genome rather than population stratification. We also demonstrate that the distribution of the observed test statistics is consistent with both rare and common variants underlying a polygenic architecture and that previous reports of linkage signals in complex traits are probably a consequence of polygenic architecture rather than the segregation of variants with large effects. The convergent empirical evidence from GWASs, de novo studies, and within-family segregation implies that family-based sequencing studies for complex traits require very large sample sizes because the effects of causal variants are small on average.
ssSNPer: identifying statistically similar SNPs to aid interpretation of genetic association studies
Resumo:
ssSNPer is a novel user-friendly web interface that provides easy determination of the number and location of untested HapMap SNPs, in the region surrounding a tested HapMap SNP, which are statistically similar and would thus produce comparable and perhaps more significant association results. Identification of ssSNPs can have crucial implications for the interpretation of the initial association results and the design of follow-up studies. AVAILABILITY: http://fraser.qimr.edu.au/general/daleN/ssSNPer/
Resumo:
Latent class and genetic analyses were used to identify subgroups of migraine sufferers in a community sample of 6,265 Australian twins (55% female) aged 25-36 who had completed an interview based on International Headache Society (IHS) criteria. Consistent with prevalence rates from other population-based studies, 703 (20%) female and 250 (9%) male twins satisfied the IHS criteria for migraine without aura (MO), and of these, 432 (13%) female and 166 (6%) male twins satisfied the criteria for migraine with aura (MA) as indicated by visual symptoms. Latent class analysis (LCA) of IHS symptoms identified three major symptomatic classes, representing 1) a mild form of recurrent nonmigrainous headache, 2) a moderately severe form of migraine, typically without visual aura symptoms (although 40% of individuals in this class were positive for aura), and 3) a severe form of migraine typically with visual aura symptoms (although 24% of individuals were negative for aura). Using the LCA classification, many more individuals were considered affected to some degree than when using IHS criteria (35% vs. 13%). Furthermore, genetic model fitting indicated a greater genetic contribution to migraine using the LCA classification (heritability, h(2)=0.40; 95% CI, 0.29-0.46) compared with the IHS classification (h(2)=0.36; 95% CI, 0.22-0.42). Exploratory latent class modeling, fitting up to 10 classes, did not identify classes corresponding to either the IHS MO or MA classification. Our data indicate the existence of a continuum of severity, with MA more severe but not etiologically distinct from MO. In searching for predisposing genes, we should therefore expect to find some genes that may underlie all major recurrent headache subtypes, with modifying genetic or environmental factors that may lead to differential expression of the liability for migraine.
Resumo:
Population introduction is an important tool for ecosystem restoration. However, before introductions should be conducted, it is important to evaluate the genetic, phenotypic and ecological suitability of possible replacement populations. Careful genetic analysis is particularly important if it is suspected that the extirpated population was unique or genetically divergent. On the island of Martha's Vineyard, Massachusetts, the introduction of greater prairie chickens (Tympanuchus cupido pinnatus) to replace the extinct heath hen (T. cupido cupido) is being considered as part of an ecosystem restoration project. Martha's Vineyard was home to the last remaining heath hen population until its extinction in 1932. We conducted this study to aid in determining the suitability of greater prairie chickens as a possible replacement for the heath hen. We examined mitochondrial control region sequences from extant populations of all prairie grouse species (Tympanuchus) and from museum skin heath hen specimens. Our data suggest that the Martha's Vineyard heath hen population represents a divergent mitochondrial lineage. This result is attributable either to a long period of geographical isolation from other prairie grouse populations or to a population bottleneck resulting from human disturbance. The mtDNA diagnosability of the heath hen contrasts with the network of mtDNA haplotypes of other prairie grouse (T. cupido attwateri, T. pallidicinctus and T. phasianellus), which do not form distinguishable mtDNA groupings. Our findings suggest that the Martha's Vineyard heath hen was more genetically isolated than are current populations of prairie grouse and place the emphasis for future research on examining prairie grouse adaptations to different habitat types to assess ecological exchangeability between heath hens and greater prairie chickens.
Resumo:
Association studies of quantitative traits have often relied on methods in which a normal distribution of the trait is assumed. However, quantitative phenotypes from complex human diseases are often censored, highly skewed, or contaminated with outlying values. We recently developed a rank-based association method that takes into account censoring and makes no distributional assumptions about the trait. In this study, we applied our new method to age-at-onset data on ALDX1 and ALDX2. Both traits are highly skewed (skewness > 1.9) and often censored. We performed a whole genome association study of age at onset of the ALDX1 trait using Illumina single-nucleotide polymorphisms. Only slightly more than 5% of markers were significant. However, we identified two regions on chromosomes 14 and 15, which each have at least four significant markers clustering together. These two regions may harbor genes that regulate age at onset of ALDX1 and ALDX2. Future fine mapping of these two regions with densely spaced markers is warranted.
Resumo:
In 265 Irish pedigrees, with linkage analysis we find evidence for a vulnerability locus for schizophrenia in region 6p24-22. The greatest lod score, assuming locus heterogeneity, is 3.51 (P = 0.0002) with D6S296. Another test, the C test, also supported linkage, the strongest results being obtained with D6S296 (P = 0.00001), D6S274 (P = 0.004) and D6S285 (P = 0.006). Non-parametric analysis yielded suggestive, but substantially weaker, findings. This locus appears to influence the vulnerability to schizophrenia in roughly 15 to 30% of our pedigrees. Evidence for linkage was maximal using an intermediate phenotypic definition and declined when this definition was narrowed or was broadened to include other psychiatric disorders.
Resumo:
Inbreeding depression is most pronounced for traits closely associated with fitness. The traditional explanation is that natural selection eliminates deleterious mutations with additive or dominant effects more effectively than recessive mutations, leading to directional dominance for traits subject to strong directional selection. Here we report the unexpected finding that, in the butterfly Bicyclus anynana, male sterility contributes disproportionately to inbreeding depression for fitness (complete sterility in about half the sons from brother-sister matings), while female fertility is insensitive to inbreeding. The contrast between the sexes for functionally equivalent traits is inconsistent with standard selection arguments, and suggests that trait-specific developmental properties and cryptic selection play crucial roles in shaping genetic architecture. There is evidence that spermatogenesis is less developmentally stable than oogenesis, though the unusually high male fertility load in B. anynana additionally suggests the operation of complex selection maintaining male sterility recessives. Analysis of the precise causes of inbreeding depression will be needed to generate a model that reliably explains variation in directional dominance and reconciles the gap between observed and expected genetic loads carried by populations. This challenging evolutionary puzzle should stimulate work on the occurrence and causes of sex differences in fertility load.
Resumo:
Among the largest resources for biological sequence data is the large amount of expressed sequence tags (ESTs) available in public and proprietary databases. ESTs provide information on transcripts but for technical reasons they often contain sequencing errors. Therefore, when analyzing EST sequences computationally, such errors must be taken into account. Earlier attempts to model error prone coding regions have shown good performance in detecting and predicting these while correcting sequencing errors using codon usage frequencies. In the research presented here, we improve the detection of translation start and stop sites by integrating a more complex mRNA model with codon usage bias based error correction into one hidden Markov model (HMM), thus generalizing this error correction approach to more complex HMMs. We show that our method maintains the performance in detecting coding sequences.