954 resultados para Models, Genetic
Object-Oriented Genetic Programming for the Automatic Inference of Graph Models for Complex Networks
Resumo:
Complex networks are systems of entities that are interconnected through meaningful relationships. The result of the relations between entities forms a structure that has a statistical complexity that is not formed by random chance. In the study of complex networks, many graph models have been proposed to model the behaviours observed. However, constructing graph models manually is tedious and problematic. Many of the models proposed in the literature have been cited as having inaccuracies with respect to the complex networks they represent. However, recently, an approach that automates the inference of graph models was proposed by Bailey [10] The proposed methodology employs genetic programming (GP) to produce graph models that approximate various properties of an exemplary graph of a targeted complex network. However, there is a great deal already known about complex networks, in general, and often specific knowledge is held about the network being modelled. The knowledge, albeit incomplete, is important in constructing a graph model. However it is difficult to incorporate such knowledge using existing GP techniques. Thus, this thesis proposes a novel GP system which can incorporate incomplete expert knowledge that assists in the evolution of a graph model. Inspired by existing graph models, an abstract graph model was developed to serve as an embryo for inferring graph models of some complex networks. The GP system and abstract model were used to reproduce well-known graph models. The results indicated that the system was able to evolve models that produced networks that had structural similarities to the networks generated by the respective target models.
Resumo:
Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq)
Resumo:
Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)
Resumo:
Among the largest resources for biological sequence data is the large amount of expressed sequence tags (ESTs) available in public and proprietary databases. ESTs provide information on transcripts but for technical reasons they often contain sequencing errors. Therefore, when analyzing EST sequences computationally, such errors must be taken into account. Earlier attempts to model error prone coding regions have shown good performance in detecting and predicting these while correcting sequencing errors using codon usage frequencies. In the research presented here, we improve the detection of translation start and stop sites by integrating a more complex mRNA model with codon usage bias based error correction into one hidden Markov model (HMM), thus generalizing this error correction approach to more complex HMMs. We show that our method maintains the performance in detecting coding sequences.
Resumo:
We investigated the role of the number of loci coding for a neutral trait on the release of additive variance for this trait after population bottlenecks. Different bottleneck sizes and durations were tested for various matrices of genotypic values, with initial conditions covering the allele frequency space. We used three different types of matrices. First, we extended Cheverud and Routman's model by defining matrices of "pure" epistasis for three and four independent loci; second, we used genotypic values drawn randomly from uniform, normal, and exponential distributions; and third we used two models of simple metabolic pathways leading to physiological epistasis. For all these matrices of genotypic values except the dominant metabolic pathway, we find that, as the number of loci increases from two to three and four, an increase in the release of additive variance is occurring. The amount of additive variance released for a given set of genotypic values is a function of the inbreeding coefficient, independently of the size and duration of the bottleneck. The level of inbreeding necessary to achieve maximum release in additive variance increases with the number of loci. We find that additive-by-additive epistasis is the type of epistasis most easily converted into additive variance. For a wide range of models, our results show that epistasis, rather than dominance, plays a significant role in the increase of additive variance following bottlenecks.
Resumo:
Species range shifts in response to climate and land use change are commonly forecasted with species distribution models based on species occurrence or abundance data. Although appealing, these models ignore the genetic structure of species, and the fact that different populations might respond in different ways because of adaptation to their environment. Here, we introduced ancestry distribution models, that is, statistical models of the spatial distribution of ancestry proportions, for forecasting intra-specific changes based on genetic admixture instead of species occurrence data. Using multi-locus genotypes and extensive geographic coverage of distribution data across the European Alps, we applied this approach to 20 alpine plant species considering a global increase in temperature from 0.25 to 4 °C. We forecasted the magnitudes of displacement of contact zones between plant populations potentially adapted to warmer environments and other populations. While a global trend of movement in a north-east direction was predicted, the magnitude of displacement was species-specific. For a temperature increase of 2 °C, contact zones were predicted to move by 92 km on average (minimum of 5 km, maximum of 212 km) and by 188 km for an increase of 4 °C (minimum of 11 km, maximum of 393 km). Intra-specific turnover-measuring the extent of change in global population genetic structure-was generally found to be moderate for 2 °C of temperature warming. For 4 °C of warming, however, the models indicated substantial intra-specific turnover for ten species. These results illustrate that, in spite of unavoidable simplifications, ancestry distribution models open new perspectives to forecast population genetic changes within species and complement more traditional distribution-based approaches.
Resumo:
Female mate choice influences the maintenance of genetic variation by altering the mating success of males with different genotypes. The evolution of preferences themselves, on the other hand, depends on genetic variation present in the population. Few models have tracked this feedback between a choice gene and its effects on genetic variation, in particular when genes that determine offspring viability and attractiveness have dominance effects. Here we build a population genetic model that allows comparing the evolution of various choice rules in a single framework. We first consider preferences for good genes and show that focused preferences for homozygotes evolve more easily than broad preferences, which allow heterozygous males high mating success too. This occurs despite better maintenance of genetic diversity in the latter scenario, and we discuss why empirical findings of superior mating success of heterozygous males consequently do not immediately lead to a better understanding of the lek paradox. Our results thus suggest that the mechanisms that help maintain genetic diversity also have a flipside of making female choice an inaccurate means of producing the desired kind of offspring. We then consider preferences for heterozygosity per se, and show that these evolve only under very special conditions. Choice for compatible genotypes can evolve but its selective advantage diminishes quickly due to frequency-dependent selection. Finally, we show that our model reproduces earlier results on selfing, when the female choice strategy produces assortative mating. Overall, our model indicates that various forms of heterozygote-favouring (or variable) female choice pose a problem for the theory of sexual ornamentation based on indirect benefits, rather than a solution.
Resumo:
Using genome-wide association, we identify common variants at 2p12-p13, 6q26, 17q23 and 19q13 associated with serum creatinine, a marker of kidney function (P = 10(-10) to 10(-15)). Of these, rs10206899 (near NAT8, 2p12-p13) and rs4805834 (near SLC7A9, 19q13) were also associated with chronic kidney disease (P = 5.0 x 10(-5) and P = 3.6 x 10(-4), respectively). Our findings provide insight into metabolic, solute and drug-transport pathways underlying susceptibility to chronic kidney disease.
Resumo:
Our understanding of the distribution of worldwide human genomic diversity has greatly increased over recent years thanks to the availability of large data sets derived from short tandem repeats (STRs), insertion deletion polymorphisms (indels) and single nucleotide polymorphisms (SNPs). A concern, however, is that the current picture of worldwide human genomic diversity may be inaccurate because of biases in the selection process of genetic markers (so-called 'ascertainment bias'). To evaluate this problem, we first compared the distribution of genomic diversity between these three types of genetic markers in the populations from the HGDP-CEPH panel for evidence of bias or incongruities. In a second step, using a very relaxed set of criteria to prevent the intrusion of bias, we developed a new set of unbiased STR markers and compared the results against those from available panels. Contrarily to recent claims, our results show that the STR markers suffer from no discernible bias, and can thus be used as a baseline reference for human genetic diversity and population differentiation. The bias on SNPs is moderate compared to that on the set of indels analysed, which we recommend should be avoided for work describing the distribution of human genetic diversity or making inference on human settlement history.
Resumo:
Due to practical difficulties in obtaining direct genetic estimates of effective sizes, conservation biologists have to rely on so-called 'demographic models' which combine life-history and mating-system parameters with F-statistics in order to produce indirect estimates of effective sizes. However, for the same practical reasons that prevent direct genetic estimates, the accuracy of demographic models is difficult to evaluate. Here we use individual-based, genetically explicit computer simulations in order to investigate the accuracy of two such demographic models aimed at investigating the hierarchical structure of populations. We show that, by and large, these models provide good estimates under a wide range of mating systems and dispersal patterns. However, one of the models should be avoided whenever the focal species' breeding system approaches monogamy with no sex bias in dispersal or when a substructure within social groups is suspected because effective sizes may then be strongly overestimated. The timing during the life cycle at which F-statistics are evaluated is also of crucial importance and attention should be paid to it when designing field sampling since different demographic models assume different timings. Our study shows that individual-based, genetically explicit models provide a promising way of evaluating the accuracy of demographic models of effective size and delineate their field of applicability.
Resumo:
Genetic caste determination has been described in two populations of Pogonomyrmex harvester ants, each comprising a pair of interbreeding lineages. Queens mate with males of their own and of the alternate lineage and produce two types of diploid offspring, those fertilized by males of the queens' lineage which develop into queens and those fertilized by males of the other lineage which develop into workers. Each of the lineages has been shown to be itself of hybrid origin between the species Pogonomyrmex barbatus and Pogonomyrmex rugosus, which both have typical, environmentally determined caste differentiation. In a large scale genetic survey across 35 sites in Arizona, New Mexico and Texas, we found that genetic caste determination associated with pairs of interbreeding lineages occurred frequently (in 26 out of the 35 sites). Overall, we identified eight lineages with genetic caste determination that always co-occurred in the same complementary lineage pairs. Three of the four lineage pairs appear to have a common origin while their relationship with the fourth remains unclear. The level of genetic differentiation among these eight lineages was significantly higher than the differentiation between P. rugosus and P. barbatus, which questions the appropriate taxonomic status of these genetic lineages. In addition to being genetically isolated from one another, all lineages with genetic caste determination were genetically distinct from P. rugosus and P. barbatus, even when colonies of interbreeding lineages co-occurred with colonies of either putative parent at the same site. Such nearly complete reproductive isolation between the lineages and the species with environmental caste determination might prevent the genetic caste determination system to be swept away by gene flow.
Resumo:
A range of models describing metapopulations is surveyed and their implications for conservation biology are described. An overview of the use of both population genetic elements and demographic theory in metapopulation models is given. It would appear that most of the current models suffer from either the use of over-simplified demography or the avoidance of selectively important genetic factors. The scale for which predictions are made by the various models is often obscure. A conceptual framework for describing metapopulations by utilising the concept of fitness of local populations is provided and some examples are given. The expectation that any general theory, such as that of metapopulations, can make useful predictions for particular problems of conservation is examined and compared with the prevailing 'state of the art' recommendations.
Resumo:
Regulatory gene networks contain generic modules, like those involving feedback loops, which are essential for the regulation of many biological functions (Guido et al. in Nature 439:856-860, 2006). We consider a class of self-regulated genes which are the building blocks of many regulatory gene networks, and study the steady-state distribution of the associated Gillespie algorithm by providing efficient numerical algorithms. We also study a regulatory gene network of interest in gene therapy, using mean-field models with time delays. Convergence of the related time-nonhomogeneous Markov chain is established for a class of linear catalytic networks with feedback loops.
Resumo:
OBJECTIVE: To describe the epidemiology of cleft palate (CP) in Europe. DESIGN AND SETTING: A descriptive epidemiological study on 3852 cases of CP, identified (1980 through 1996) from more than 6 million births from the EUROCAT network of 30 registers in 16 European countries. RESULTS: Significant differences in prevalence in Europe between registries and within countries were observed. A total of 2112 (54.8%) CP cases occurred as isolated, 694 (18.0%) were associated with other defects such as multiple congenital anomalies, and 1046 (27.2%) were in recognized conditions. The study confirmed the tendency toward female prevalence (sex ratio [SR] = 0.83), particularly among isolated cases (SR = 0.78) even if SR inversion is reported in some registries. A specific association with neural tube defects (NTDs) in some registers is reported. CONCLUSION: The differences identified in Europe (prevalence, sex, associated anomalies) can be only partially explained by methodological reasons because a common methodology was shared among all registries for case ascertainment and collection, and CP is an easy detectable condition with few induced abortions. The complex model of inheritance and the frequently conflicting results in different populations on the role of genes that constitute risk factors suggest the presence of real biological differences. The association of CP/NTD in an area with a high prevalence of NTDs can identify a group of conditions that can be considered etiologically homogeneous. The epidemiological evaluation can guide genetic research to specify the role of etiological factors in each different population
Resumo:
quantiNemo is an individual-based, genetically explicit stochastic simulation program. It was developed to investigate the effects of selection, mutation, recombination and drift on quantitative traits with varying architectures in structured populations connected by migration and located in a heterogeneous habitat. quantiNemo is highly flexible at various levels: population, selection, trait(s) architecture, genetic map for QTL and/or markers, environment, demography, mating system, etc. quantiNemo is coded in C++ using an object-oriented approach and runs on any computer platform. Availability: Executables for several platforms, user's manual, and source code are freely available under the GNU General Public License at http://www2.unil.ch/popgen/softwares/quantinemo.