923 resultados para Population structure
Resumo:
The curse of dimensionality is a major problem in the fields of machine learning, data mining and knowledge discovery. Exhaustive search for the most optimal subset of relevant features from a high dimensional dataset is NP hard. Sub–optimal population based stochastic algorithms such as GP and GA are good choices for searching through large search spaces, and are usually more feasible than exhaustive and deterministic search algorithms. On the other hand, population based stochastic algorithms often suffer from premature convergence on mediocre sub–optimal solutions. The Age Layered Population Structure (ALPS) is a novel metaheuristic for overcoming the problem of premature convergence in evolutionary algorithms, and for improving search in the fitness landscape. The ALPS paradigm uses an age–measure to control breeding and competition between individuals in the population. This thesis uses a modification of the ALPS GP strategy called Feature Selection ALPS (FSALPS) for feature subset selection and classification of varied supervised learning tasks. FSALPS uses a novel frequency count system to rank features in the GP population based on evolved feature frequencies. The ranked features are translated into probabilities, which are used to control evolutionary processes such as terminal–symbol selection for the construction of GP trees/sub-trees. The FSALPS metaheuristic continuously refines the feature subset selection process whiles simultaneously evolving efficient classifiers through a non–converging evolutionary process that favors selection of features with high discrimination of class labels. We investigated and compared the performance of canonical GP, ALPS and FSALPS on high–dimensional benchmark classification datasets, including a hyperspectral image. Using Tukey’s HSD ANOVA test at a 95% confidence interval, ALPS and FSALPS dominated canonical GP in evolving smaller but efficient trees with less bloat expressions. FSALPS significantly outperformed canonical GP and ALPS and some reported feature selection strategies in related literature on dimensionality reduction.
Resumo:
The curse of dimensionality is a major problem in the fields of machine learning, data mining and knowledge discovery. Exhaustive search for the most optimal subset of relevant features from a high dimensional dataset is NP hard. Sub–optimal population based stochastic algorithms such as GP and GA are good choices for searching through large search spaces, and are usually more feasible than exhaustive and determinis- tic search algorithms. On the other hand, population based stochastic algorithms often suffer from premature convergence on mediocre sub–optimal solutions. The Age Layered Population Structure (ALPS) is a novel meta–heuristic for overcoming the problem of premature convergence in evolutionary algorithms, and for improving search in the fitness landscape. The ALPS paradigm uses an age–measure to control breeding and competition between individuals in the population. This thesis uses a modification of the ALPS GP strategy called Feature Selection ALPS (FSALPS) for feature subset selection and classification of varied supervised learning tasks. FSALPS uses a novel frequency count system to rank features in the GP population based on evolved feature frequencies. The ranked features are translated into probabilities, which are used to control evolutionary processes such as terminal–symbol selection for the construction of GP trees/sub-trees. The FSALPS meta–heuristic continuously refines the feature subset selection process whiles simultaneously evolving efficient classifiers through a non–converging evolutionary process that favors selection of features with high discrimination of class labels. We investigated and compared the performance of canonical GP, ALPS and FSALPS on high–dimensional benchmark classification datasets, including a hyperspectral image. Using Tukey’s HSD ANOVA test at a 95% confidence interval, ALPS and FSALPS dominated canonical GP in evolving smaller but efficient trees with less bloat expressions. FSALPS significantly outperformed canonical GP and ALPS and some reported feature selection strategies in related literature on dimensionality reduction.
Resumo:
Mémoire numérisé par la Division de la gestion de documents et des archives de l'Université de Montréal
Resumo:
There is great interest in using amplified fragment length polymorphism (AFLP) markers because they are inexpensive and easy to produce. It is, therefore, possible to generate a large number of markers that have a wide coverage of species genotnes. Several statistical methods have been proposed to study the genetic structure using AFLP's but they assume Hardy-Weinberg equilibrium and do not estimate the inbreeding coefficient, F-IS. A Bayesian method has been proposed by Holsinger and colleagues that relaxes these simplifying assumptions but we have identified two sources of bias that can influence estimates based on these markers: (i) the use of a uniform prior on ancestral allele frequencies and (ii) the ascertainment bias of AFLP markers. We present a new Bayesian method that avoids these biases by using an implementation based on the approximate Bayesian computation (ABC) algorithm. This new method estimates population-specific F-IS and F-ST values and offers users the possibility of taking into account the criteria for selecting the markers that are used in the analyses. The software is available at our web site (http://www-leca.uif-grenoble.fi-/logiciels.htm). Finally, we provide advice on how to avoid the effects of ascertainment bias.
Resumo:
Land-use changes can alter the spatial population structure of plant species, which may in turn affect the attractiveness of flower aggregations to different groups of pollinators at different spatial scales. To assess how pollinators respond to spatial heterogeneity of plant distributions and whether honeybees affect visitation by other pollinators we used an extensive data set comprising ten plant species and their flower visitors from five European countries. In particular we tested the hypothesis that the composition of the flower visitor community in terms of visitation frequencies by different pollinator groups were affected by the spatial plant population structure, viz. area and density measures, at a within-population (‘patch’) and among-population (‘population’) scale. We found that patch area and population density were the spatial variables that best explained the variation in visitation frequencies within the pollinator community. Honeybees had higher visitation frequencies in larger patches, while bumblebees and hoverflies had higher visitation frequencies in sparser populations. Solitary bees had higher visitation frequencies in sparser populations and smaller patches. We also tested the hypothesis that honeybees affect the composition of the pollinator community by altering the visitation frequencies of other groups of pollinators. There was a positive relationship between visitation frequencies of honeybees and bumblebees, while the relationship with hoverflies and solitary bees varied (positive, negative and no relationship) depending on the plant species under study. The overall conclusion is that the spatial structure of plant populations affects different groups of pollinators in contrasting ways at both the local (‘patch’) and the larger (‘population’) scales and, that honeybees affect the flower visitation by other pollinator groups in various ways, depending on the plant species under study. These contrasting responses emphasize the need to investigate the entire pollinator community when the effects of landscape change on plant–pollinator interactions are studied.
Resumo:
BACKGROUND:The Salmonella enterica serovar Derby is frequently isolated from pigs and turkeys whereas serovar Mbandaka is frequently isolated from cattle, chickens and animal feed in the UK. Through comparative genomics, phenomics and mutant construction we previously suggested possible mechanistic reasons why these serovars demonstrate apparently distinct host ranges. Here, we investigate the genetic and phenotypic diversity of these two serovars in the UK. We produce a phylogenetic reconstruction and perform several biochemical assays on isolates of S. Derby and S. Mbandaka acquired from sites across the UK between the years 2000 and 2010. RESULTS:We show that UK isolates of S. Mbandaka comprise of one clonal lineage which is adapted to proficient utilisation of metabolites found in soya beans under ambient conditions. We also show that this clonal lineage forms a biofilm at 25 °C, suggesting that this serovar maybe well adapted to survival ex vivo, growing in animal feed. Conversely, we show that S. Derby is made of two distinct lineages, L1 and L2. These lineages differ genotypically and phenotypically, being divided by the presence and absence of SPI-23 and the ability to more proficiently invade porcine jejunum derived cell line IPEC-J2. CONCLUSION:The results of this study lend support to the hypothesis that the differences in host ranges of S. Derby and S. Mbandaka are adaptations to pathogenesis, environmental persistence, as well as utilisation of metabolites abundant in their respective host environments.
Resumo:
Extensive population structuring is known to occur in Anopheles darlingi, the primary malaria vector of the Neotropics. We analysed the phylogeographic structure of the species using the mitochondrial cytochrome oxidase I marker. Diversity is divided into six main population groups in South America: Colombia, central Amazonia, southern Brazil, south-eastern Brazil, and two groups in north-east Brazil. The ancestral distribution of the taxon is hypothesized to be central Amazonia, and there is evidence of expansion from this region during the late Pleistocene. The expansion was not a homogeneous front, however, with at least four subgroups being formed due to geographic barriers. As the species spread, populations became isolated from each other by the Amazon River and the coastal mountain ranges of south-eastern Brazil and the Andes. Analyses incorporating distances around these barriers suggest that the entire South American range of An. darlingi is at mutation-dispersal-drift equilibrium. Because the species is distributed throughout such a broad area, the limited dispersal across some landscape types promotes differentiation between otherwise proximate populations. Moreover, samples from the An. darlingi holotype location in Rio de Janeiro State are substantially derived from all other populations, implying that there may be additional genetic differences of epidemiological relevance. The results obtained contribute to our understanding of gene flow in this species and allow the formulation of human mosquito health protocols in light of the potential population differences in vector capacity or tolerance to control strategies. (C) 2009 The Linnean Society of London, Biological Journal of the Linnean Society, 2009, 97, 854-866.
Resumo:
The aim of this study was to describe the population structure, inbreeding and to quantify their effect for different weights, of Santa Ines sheep. For this reason, 6490 data of production and 17,097 animals in the pedigree data set were utilized to evaluate birth weight (BW), weight at 60 days (W60) and weight at 180 days (W180). The genetic structure analysis of the population was realized by the software ENDOG (v.4.6.), resulting in some level of inbreeding for 21.72% of the animals in the pedigree data, being 41.02% the maximum value, and average of 10.74% for the inbred individuals. The population average inbreeding was 2.33% and the average relatedness was 0.73%. The effective number of ancestors was 156 animals and the effective number of founders was 211 individuals. A significant depressive effect of the inbreeding can be verified for all traits. The monitored parameters related with the genetic variability on this population must be constant in order to prevent the decrease in the genetic progress. The utilization of a program for directed mating in the present flock is an appropriate alternative to keep the level of inbreeding under control. (C) 2010 Elsevier B.V. All rights reserved.
Resumo:
1. Prochilodus lineatus (Prochilodontidae, Characiformes) is a migratory species of great economic importance both in fisheries and aquaculture that is found throughout the Jacui, Paraiba do Sul, Parana, Paraguay and Uruguay river basins in South America. Earlier population studies of P. lineatus in the rio Grande basin (Parana basin) indicated the existence of a single population; however, the range of this species has been fragmented by the construction of several dams. Such dams modified the environmental conditions and could have constrained the reproductive migration of P. lineatus, possibly leading to changes in the population genetic structure. 2. In order to evaluate how genetic diversity is allocated in the rio Grande basin, 141 specimens of P. lineatus from eight collection sites were analysed using polymerase chain reaction-restriction fragment length polymorphism (PCR-RFLP) with 15 restriction enzymes. 3. Forty-six haplotypes were detected, and 70% of them are restricted. The mean genetic variability indexes (h = 0.7721 and pi = 1.6%) were similar to those found in natural populations with a large effective size. Fst and Exact Test values indicated a lack of structuring among the samples, and the model of isolation by distance was tested and rejected. 4. The haplotype network indicated that this population of P. lineatus has been maintained as a single variable stock with some differences in the genetic composition (haplotypes) between samples. Indications of population expansion were detected, and this finding was supported by neutrality tests and mismatch distribution analyses. 5. The present study focused on regions between dams to serve as a parameter for further evaluations of genetic variability and the putative impact of dams and repopulation programmes in natural populations of P. lineatus. Copyright (C) 2011 John Wiley & Sons, Ltd.
Resumo:
The Hyacinth Macaw (Anodorhynchus hyacinthinus) is one of 14 endangered species in the family Psittacidae occurring in Brazil, with an estimated total population of 6,500 specimens. We used nuclear molecular markers (single locus minisatellites and microsatellites) and 472 bp of the mitochondrial DNA control region to characterize levels of genetic variability in this species and to assess the degree of gene flow among three nesting sites in Brazil (Pantanal do Abobral, Pantanal de Miranda and Piaui). The origin of five apprehended specimens was also investigated. The results suggest that, in comparison to other species of parrots, Hyacinth Macaws possess relatively lower genetic variation and that individuals from two different localities within the Pantanal (Abobral and Miranda) belong to a unique interbreeding population and are genetically distinct at nuclear level from birds from the state of Piaui. The analyses of the five apprehended birds suggest that the Pantanal is not the source of birds for illegal trade, but their precise origin could not be assigned. The low genetic variability detected in the Hyacinth Macaw does not seem to pose a threat to the survival of this species. Nevertheless, habitat destruction and nest poaching are the most important factors negatively affecting their populations in the wild. The observed genetic structure emphasizes the need of protection of Hyacinth Macaws from different regions in order to maintain the genetic diversity of this species.
Resumo:
This thesis provides information on the grouping structure, survival, abundance, dive characteristics and habitat preferences of short-finned pilot whales occurring in the oceanic archipelago of Madeira (Portugal, NE Atlantic), based on data collected between 2001-2011, and contributes for its conservation. Photo-identification methods and genetic analyses demonstrated that there is a large degree of variability in site fidelity, including resident, regular visitor and transient whales, and that they may not be genetically isolated. It is proposed that the pilot whales encountered in Madeira belong to a single population encompassing several clans, possibly three clans of island-associated (i.e. resident and regular visitor) whales and others of transients, each containing two to three matrilineal pods. Mark-recapture methods estimated that the island-associated community is composed of less than 150 individuals and that their survival rate is within the range of other long-lived cetacean species, and that around 300 whales of different residency patterns uses the southern area of the island of Madeira from mid-summer to mid-autumn. No significant trend was observed between years. Time-depth recorders deployed in adult whales during daytime revealed that they spend over ¾ of their time at the surface, that they have a low diving rate, and that transient whales also forage during their passage. The analyses of visual data collected from nautical and aerial line-transect surveys indicate a core/preferred habitat area in the south-east of the island of Madeira. That area is used for resting, socializing, foraging, breeding, calving and birthing. Thus, that area should be considered as an important habitat for this species, at least seasonally (during autumn) when the species is more abundant, and included in conservation plans. No direct threat needing urgent measures was identified, although the impact of some activities like whale-watching or marine traffic should be assessed.
Resumo:
The hermit crab Paguras brevidactylus (Crustacea: Anomura: Paguridea) from the infralittoral area of Anchieta Island, Ubatuba, was characterized by population Structure (size, sex ratio, reproduction and recruitment) and growth. Animals were collected monthly during 1999 by SCUBA diving. A total of 1525 individuals was collected (633 males and 892 females), 695 of them were ovigerous females. Overall sex ratio was 0.7:1 in favour of females. The crabs showed a unimodal distribution with males significantly larger than females. Ovigerous females were collected during all months and in high percentages from 1.0 mm of shield length, demonstrating intense and Continuous reproduction. The longevity was approximately 24 months for males and 18 for females, which showed larger growth rate and reached sexual maturity earlier (two months) than males. The low number of males in this Population may be due to the longer life span. Moreover, the sexual dimorphism favours males during the intra- and interspecific fights by shell, food, reproduction and territory. Females demonstrated a short life cycle and intense reproduction.