9 resultados para COLONY GENETIC STRUCTURE
em Brock University, Canada
Resumo:
Three dimensional model design is a well-known and studied field, with numerous real-world applications. However, the manual construction of these models can often be time-consuming to the average user, despite the advantages o ffered through computational advances. This thesis presents an approach to the design of 3D structures using evolutionary computation and L-systems, which involves the automated production of such designs using a strict set of fitness functions. These functions focus on the geometric properties of the models produced, as well as their quantifiable aesthetic value - a topic which has not been widely investigated with respect to 3D models. New extensions to existing aesthetic measures are discussed and implemented in the presented system in order to produce designs which are visually pleasing. The system itself facilitates the construction of models requiring minimal user initialization and no user-based feedback throughout the evolutionary cycle. The genetic programming evolved models are shown to satisfy multiple criteria, conveying a relationship between their assigned aesthetic value and their perceived aesthetic value. Exploration into the applicability and e ffectiveness of a multi-objective approach to the problem is also presented, with a focus on both performance and visual results. Although subjective, these results o er insight into future applications and study in the fi eld of computational aesthetics and automated structure design.
Resumo:
The curse of dimensionality is a major problem in the fields of machine learning, data mining and knowledge discovery. Exhaustive search for the most optimal subset of relevant features from a high dimensional dataset is NP hard. Sub–optimal population based stochastic algorithms such as GP and GA are good choices for searching through large search spaces, and are usually more feasible than exhaustive and deterministic search algorithms. On the other hand, population based stochastic algorithms often suffer from premature convergence on mediocre sub–optimal solutions. The Age Layered Population Structure (ALPS) is a novel metaheuristic for overcoming the problem of premature convergence in evolutionary algorithms, and for improving search in the fitness landscape. The ALPS paradigm uses an age–measure to control breeding and competition between individuals in the population. This thesis uses a modification of the ALPS GP strategy called Feature Selection ALPS (FSALPS) for feature subset selection and classification of varied supervised learning tasks. FSALPS uses a novel frequency count system to rank features in the GP population based on evolved feature frequencies. The ranked features are translated into probabilities, which are used to control evolutionary processes such as terminal–symbol selection for the construction of GP trees/sub-trees. The FSALPS metaheuristic continuously refines the feature subset selection process whiles simultaneously evolving efficient classifiers through a non–converging evolutionary process that favors selection of features with high discrimination of class labels. We investigated and compared the performance of canonical GP, ALPS and FSALPS on high–dimensional benchmark classification datasets, including a hyperspectral image. Using Tukey’s HSD ANOVA test at a 95% confidence interval, ALPS and FSALPS dominated canonical GP in evolving smaller but efficient trees with less bloat expressions. FSALPS significantly outperformed canonical GP and ALPS and some reported feature selection strategies in related literature on dimensionality reduction.
Resumo:
The curse of dimensionality is a major problem in the fields of machine learning, data mining and knowledge discovery. Exhaustive search for the most optimal subset of relevant features from a high dimensional dataset is NP hard. Sub–optimal population based stochastic algorithms such as GP and GA are good choices for searching through large search spaces, and are usually more feasible than exhaustive and determinis- tic search algorithms. On the other hand, population based stochastic algorithms often suffer from premature convergence on mediocre sub–optimal solutions. The Age Layered Population Structure (ALPS) is a novel meta–heuristic for overcoming the problem of premature convergence in evolutionary algorithms, and for improving search in the fitness landscape. The ALPS paradigm uses an age–measure to control breeding and competition between individuals in the population. This thesis uses a modification of the ALPS GP strategy called Feature Selection ALPS (FSALPS) for feature subset selection and classification of varied supervised learning tasks. FSALPS uses a novel frequency count system to rank features in the GP population based on evolved feature frequencies. The ranked features are translated into probabilities, which are used to control evolutionary processes such as terminal–symbol selection for the construction of GP trees/sub-trees. The FSALPS meta–heuristic continuously refines the feature subset selection process whiles simultaneously evolving efficient classifiers through a non–converging evolutionary process that favors selection of features with high discrimination of class labels. We investigated and compared the performance of canonical GP, ALPS and FSALPS on high–dimensional benchmark classification datasets, including a hyperspectral image. Using Tukey’s HSD ANOVA test at a 95% confidence interval, ALPS and FSALPS dominated canonical GP in evolving smaller but efficient trees with less bloat expressions. FSALPS significantly outperformed canonical GP and ALPS and some reported feature selection strategies in related literature on dimensionality reduction.
Resumo:
Strain improvement of the insect pathogenic fungus Metarhizium anisopUae is necessary to increase its virulence towards agricultural pests and thus improve its commercial efficacy. Nevertheless, the release of genetically modified conidia in crop fields may negatively affect the ecosystem. Controlling conidiation is a potential means of limiting the release of engineered strains since conidia are the infective propagules and the means of dispersal. The purpose of this study was to research the colony development of M. anisopUae to identify potential targets for genetic manipulation to control conidiation. Following Agrobacterium tumefaciem insertional mutagenesis, phenotypic mutants were characterized using Y-shaped adaptor dependent extension PCR. Four of 1 8 colony development recombinants had T-DNA flanking sequences with high homology to genes encoding known signaling pathway proteins that regulate pathogenesis and/or asexual development in filamentous fungi. Conidial density counts and insect bioassays suggested that a Serine/Threonine protein kinase COTl homolog is not essential for conidiation or virulence. Furthermore, a choline kinase homolog is important for conidiation, but not virulence. Finally, the regulator of G protein signaling CAG8 and a NADPH oxidase NoxA homolog are necessary for conidiation and virulence. These genes are candidates for further investigation into the regulatory pathways controlling conidiation to yield insight into promising gene targets for biocontrol strain improvement.
Resumo:
Regional structural analysis of the Timmins area indicates four major periods of tectonic deformation. The DI deformation is characterized by a series of isoclinal FI folds which are outlined in the study area by bedding, pillow tops and variolitic flows. The D2 deformation developed the Porcupine Syncline and refolded the Fl folds about a NE. axis. A pervasive S2 foliation developed during low grade (greenschist) regional metamorphism associated with the D2 deformation. The S2 foliation developed south of the Destor-Porcupine Break. The third phase of tectonic D3 deformation is recognized by the development of a S3 sub-horizontal crenulation cleavage which developed on the plane of the S2 foliation. No meso scopic folds are associated with this deformation. The 8 3 crenulation cleavage is observed south of the Destor-Porcupine Break. The D4 tectonic deformation is recorded as a subvertical S4 crenulation cleavage which developed on the plane of the S2 foliation and also offsets the S3 crenulation cleavage. Macroscopic F4 folds have refolded the F2 axial plane. No metamorphic recrystallization is associated with this deformation. The S4 crenulation cleavage is observed south of the Destor-Porcupine Break. Petrographic evidence indicates that the Timmins area has been subjected to pervasive regional low grade (greenschist) metamorphism which has recrystallized the original mineralogy. South of the study are~ the Donut Lake ultramafic lavas have been subjected to contact medium grade (amphibolite facies) metamorphism associated with the intrusion of the Peterlong Lake Complex. The Archean volcanic rocks of the Timmins area have been subdivided into komatiitic, tholeiitic and calcalkaline suites based on Zr, Ti0 2 and Ni. The three elements were used because of their r e lative immobility during subsequent metamorphic events. Geochemical observations in the Timmins area indicates that the composition of the Goose Lake and Donut Lake Formations are a series of peridotitic, pyroxenitic and basaltic komatiites. The Lower Schumacher Formation is a sequence of basaltic komatiites while the upper part of the Lower Schumacher Formation is an intercalated sequence of basaltic komatiites and low Ti0 2 tholeiites. The variolitic flows are felsic tholeiites in composition and geochemical evidenc e sugg ests that they developed as a n immiscible splitting of a tholeiitic magma. The Upper Schumacher Formation is a sequence of tholeiitic rocks dis p laying a mild iron enrichment. The Krist and Boomerang Formations are the felsic calc-alkaline rocks of the study area which are characteristically pyroclastic. The Redstone Fo rmation is dominantly a calc-alkali ne sequence of volcani c rocks whose minor mafic end me mbers exposed in 1t.he study hav e basaltic komatiitic compositions. Geochemical evidence sugges ts that the Keewatin-type se dimentary rocks have a composition similar to a quartz diorite or a granodiorite. Fi e l d obs ervations and petrographic evidence suggests that they were derived fr om a distal source and now repr esent i n part a turbidite sequence. The Timiskaming-type sedimentary rocks approach the c omp osi t ion of the felsic calc-alkaline rocks of the study area . The basal conglomerate in the study are a sugge s ts that th e uni t was derived fr om a proximal source. Petrographic and ge ochemical evidence suggests that the peridotitic and pyroxenitic komatiites originated as a 35-55% partial melt within the mantle, in excess of 100 Km. depth. The melt ros e as a diapir with the subsequent effusion of the ultramafic lavas, The basaltic komatiites and tholeiitic rocks originated in the mantle from lesser degrees of partial melting and fractionated in low pressure chambers. Geochemical evidence suggests a "genetic link" between the basaltic komatiites and tholeiites, The calc-alkaline rocks developed as a result of the increa.se In PO in the magma chamber. The felsic calcalkaline rocks are a late stage effusion possibly the last major volcanic eruptions in the area.
Resumo:
The nucleotide sequence of a genomic DNA fragment thought previously to contain the dihydrofolate reductase gene (DFR1) of Saccharomyces cerevisiae by genetic criteria was determined. This DNA fragment of 1784' basepairs contains a large open reading frame from position 800 to 1432, which encodes a enzyme with a predicted molecular weight of 24,229.8 Daltons. Analysis of the amino acid sequence of this protein revealed that the yeast polypep·tide contained 211 amino acids, compared to the 186 residues commonly found in the polypeptides of other eukaryotes. The difference in size of the gene product can be attributed mainly to an insert in the yeast gene. Within this region, several consensus sequences required for processing of yeast nuclear and class II mitochondrial introns were identified, but appear not sufficient for the RNA splicing. The primary structure of the yeast DHFR protein has considerable sequence homology with analogous polypeptides from other organisms, especially in the consensus residues involved in cofactor and/or inhibitor binding. Analysis of the nucleotide sequence also revealed the presence of a number of canonical sequences identified in yeast as having some function in the regulation of gene expression. These include UAS elements (TGACTC) required for tIle amino acid general control response, and "TATA H boxes as well as several consensus sequences thought to be required for transcriptional termination and polyadenylation. Analysis of the codon usage of the yeast DFRl coding region revealed a codon bias index of 0.0083. this valve very close to zero suggestes 3 that the gene is expressed at a relatively low level under normal physiological conditions. The information concerning the organization of the DFRl were used to construct a variety of fusions of its 5' regulatory region with the coding region of the lacZ gene of E. coli. Some of such fused genes encoded a fusion product that expressed in E.coli and/or in yeast under the control of the 5' regulatory elements of the DFR1. Further studies with these fusion constructions revealed that the beta-galactosidase activity encoded on multicopy plasmids was stimulated transiently by prior exposure of yeast host cells to UV light. This suggests that the yeast PFRl gene is indu.ced by UV light and nlay in1ply a novel function of DHFR protein in the cellular responses to DNA damage. Another novel f~ature of yeast DHFR was revealed during preliminary studies of a diploid strain containing a heterozygous DFRl null allele. The strain was constructed by insertion of a URA3 gene within the coding region of DFR1. Sporulation of this diploid revealed that meiotic products segregated 2:0 for uracil prototrophy when spore clones were germinated on medium supplemented with 5-formyltetrahydrofolate (folinic acid). This finding suggests that, in addition to its catalytic activity, the DFRl gene product nlay play some role in the anabolisln of folinic acid. Alternatively, this result may indicate that Ura+ haploid segregants were inviable and suggest that the enzyme has an essential cellular function in this species.
Resumo:
Complex networks can arise naturally and spontaneously from all things that act as a part of a larger system. From the patterns of socialization between people to the way biological systems organize themselves, complex networks are ubiquitous, but are currently poorly understood. A number of algorithms, designed by humans, have been proposed to describe the organizational behaviour of real-world networks. Consequently, breakthroughs in genetics, medicine, epidemiology, neuroscience, telecommunications and the social sciences have recently resulted. The algorithms, called graph models, represent significant human effort. Deriving accurate graph models is non-trivial, time-intensive, challenging and may only yield useful results for very specific phenomena. An automated approach can greatly reduce the human effort required and if effective, provide a valuable tool for understanding the large decentralized systems of interrelated things around us. To the best of the author's knowledge this thesis proposes the first method for the automatic inference of graph models for complex networks with varied properties, with and without community structure. Furthermore, to the best of the author's knowledge it is the first application of genetic programming for the automatic inference of graph models. The system and methodology was tested against benchmark data, and was shown to be capable of reproducing close approximations to well-known algorithms designed by humans. Furthermore, when used to infer a model for real biological data the resulting model was more representative than models currently used in the literature.
Resumo:
Experimental Extended X-ray Absorption Fine Structure (EXAFS) spectra carry information about the chemical structure of metal protein complexes. However, pre- dicting the structure of such complexes from EXAFS spectra is not a simple task. Currently methods such as Monte Carlo optimization or simulated annealing are used in structure refinement of EXAFS. These methods have proven somewhat successful in structure refinement but have not been successful in finding the global minima. Multiple population based algorithms, including a genetic algorithm, a restarting ge- netic algorithm, differential evolution, and particle swarm optimization, are studied for their effectiveness in structure refinement of EXAFS. The oxygen-evolving com- plex in S1 is used as a benchmark for comparing the algorithms. These algorithms were successful in finding new atomic structures that produced improved calculated EXAFS spectra over atomic structures previously found.
Object-Oriented Genetic Programming for the Automatic Inference of Graph Models for Complex Networks
Resumo:
Complex networks are systems of entities that are interconnected through meaningful relationships. The result of the relations between entities forms a structure that has a statistical complexity that is not formed by random chance. In the study of complex networks, many graph models have been proposed to model the behaviours observed. However, constructing graph models manually is tedious and problematic. Many of the models proposed in the literature have been cited as having inaccuracies with respect to the complex networks they represent. However, recently, an approach that automates the inference of graph models was proposed by Bailey [10] The proposed methodology employs genetic programming (GP) to produce graph models that approximate various properties of an exemplary graph of a targeted complex network. However, there is a great deal already known about complex networks, in general, and often specific knowledge is held about the network being modelled. The knowledge, albeit incomplete, is important in constructing a graph model. However it is difficult to incorporate such knowledge using existing GP techniques. Thus, this thesis proposes a novel GP system which can incorporate incomplete expert knowledge that assists in the evolution of a graph model. Inspired by existing graph models, an abstract graph model was developed to serve as an embryo for inferring graph models of some complex networks. The GP system and abstract model were used to reproduce well-known graph models. The results indicated that the system was able to evolve models that produced networks that had structural similarities to the networks generated by the respective target models.