141 resultados para Algorithms genetics


Relevância:

20.00% 20.00%

Publicador:

Resumo:

This paper presents general problems and approaches for the spatial data analysis using machine learning algorithms. Machine learning is a very powerful approach to adaptive data analysis, modelling and visualisation. The key feature of the machine learning algorithms is that they learn from empirical data and can be used in cases when the modelled environmental phenomena are hidden, nonlinear, noisy and highly variable in space and in time. Most of the machines learning algorithms are universal and adaptive modelling tools developed to solve basic problems of learning from data: classification/pattern recognition, regression/mapping and probability density modelling. In the present report some of the widely used machine learning algorithms, namely artificial neural networks (ANN) of different architectures and Support Vector Machines (SVM), are adapted to the problems of the analysis and modelling of geo-spatial data. Machine learning algorithms have an important advantage over traditional models of spatial statistics when problems are considered in a high dimensional geo-feature spaces, when the dimension of space exceeds 5. Such features are usually generated, for example, from digital elevation models, remote sensing images, etc. An important extension of models concerns considering of real space constrains like geomorphology, networks, and other natural structures. Recent developments in semi-supervised learning can improve modelling of environmental phenomena taking into account on geo-manifolds. An important part of the study deals with the analysis of relevant variables and models' inputs. This problem is approached by using different feature selection/feature extraction nonlinear tools. To demonstrate the application of machine learning algorithms several interesting case studies are considered: digital soil mapping using SVM, automatic mapping of soil and water system pollution using ANN; natural hazards risk analysis (avalanches, landslides), assessments of renewable resources (wind fields) with SVM and ANN models, etc. The dimensionality of spaces considered varies from 2 to more than 30. Figures 1, 2, 3 demonstrate some results of the studies and their outputs. Finally, the results of environmental mapping are discussed and compared with traditional models of geostatistics.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

To make a comprehensive evaluation of organ-specific out-of-field doses using Monte Carlo (MC) simulations for different breast cancer irradiation techniques and to compare results with a commercial treatment planning system (TPS). Three breast radiotherapy techniques using 6MV tangential photon beams were compared: (a) 2DRT (open rectangular fields), (b) 3DCRT (conformal wedged fields), and (c) hybrid IMRT (open conformal+modulated fields). Over 35 organs were contoured in a whole-body CT scan and organ-specific dose distributions were determined with MC and the TPS. Large differences in out-of-field doses were observed between MC and TPS calculations, even for organs close to the target volume such as the heart, the lungs and the contralateral breast (up to 70% difference). MC simulations showed that a large fraction of the out-of-field dose comes from the out-of-field head scatter fluence (>40%) which is not adequately modeled by the TPS. Based on MC simulations, the 3DCRT technique using external wedges yielded significantly higher doses (up to a factor 4-5 in the pelvis) than the 2DRT and the hybrid IMRT techniques which yielded similar out-of-field doses. In sharp contrast to popular belief, the IMRT technique investigated here does not increase the out-of-field dose compared to conventional techniques and may offer the most optimal plan. The 3DCRT technique with external wedges yields the largest out-of-field doses. For accurate out-of-field dose assessment, a commercial TPS should not be used, even for organs near the target volume (contralateral breast, lungs, heart).

Relevância:

20.00% 20.00%

Publicador:

Relevância:

20.00% 20.00%

Publicador:

Resumo:

A population-genetic model indicates that if there is a gene responsible for homosexual behaviour it can readily spread in populations. The model also predicts widespread bisexuality in humans.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Plasmodium falciparum is the parasite responsible for the most acute form of malaria in humans. Recently, the serine repeat antigen (SERA) in P. falciparum has attracted attention as a potential vaccine and drug target, and it has been shown to be a member of a large gene family. To clarify the relationships among the numerous P. falciparum SERAs and to identify orthologs to SERA5 and SERA6 in Plasmodium species affecting rodents, gene trees were inferred from nucleotide and amino acid sequence data for 33 putative SERA homologs in seven different species. (A distance method for nucleotide sequences that is specifically designed to accommodate differing GC content yielded results that were largely compatible with the amino acid tree. Standard-distance and maximum-likelihood methods for nucleotide sequences, on the other hand, yielded gene trees that differed in important respects.) To infer the pattern of duplication, speciation, and gene loss events in the SERA gene family history, the resulting gene trees were then "reconciled" with two competing Plasmodium species tree topologies that have been identified by previous phylogenetic studies. Parsimony of reconciliation was used as a criterion for selecting a gene tree/species tree pair and provided (1) support for one of the two species trees and for the core topology of the amino acid-derived gene tree, (2) a basis for critiquing fine detail in a poorly resolved region of the gene tree, (3) a set of predicted "missing genes" in some species, (4) clarification of the relationship among the P. falciparum SERA, and (5) some information about SERA5 and SERA6 orthologs in the rodent malaria parasites. Parsimony of reconciliation and a second criterion--implied mutational pattern at two key active sites in the SERA proteins-were also seen to be useful supplements to standard "bootstrap" analysis for inferred topologies.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Genetically engineered bioreporters are an excellent complement to traditional methods of chemical analysis. The application of fluorescence flow cytometry to detection of bioreporter response enables rapid and efficient characterization of bacterial bioreporter population response on a single-cell basis. In the present study, intrapopulation response variability was used to obtain higher analytical sensitivity and precision. We have analyzed flow cytometric data for an arsenic-sensitive bacterial bioreporter using an artificial neural network-based adaptive clustering approach (a single-layer perceptron model). Results for this approach are far superior to other methods that we have applied to this fluorescent bioreporter (e.g., the arsenic detection limit is 0.01 microM, substantially lower than for other detection methods/algorithms). The approach is highly efficient computationally and can be implemented on a real-time basis, thus having potential for future development of high-throughput screening applications.

Relevância:

20.00% 20.00%

Publicador:

Relevância:

20.00% 20.00%

Publicador:

Resumo:

BACKGROUND: The evolutionary lineage leading to the teleost fish underwent a whole genome duplication termed FSGD or 3R in addition to two prior genome duplications that took place earlier during vertebrate evolution (termed 1R and 2R). Resulting from the FSGD, additional copies of genes are present in fish, compared to tetrapods whose lineage did not experience the 3R genome duplication. Interestingly, we find that ParaHox genes do not differ in number in extant teleost fishes despite their additional genome duplication from the genomic situation in mammals, but they are distributed over twice as many paralogous regions in fish genomes. RESULTS: We determined the DNA sequence of the entire ParaHox C1 paralogon in the East African cichlid fish Astatotilapia burtoni, and compared it to orthologous regions in other vertebrate genomes as well as to the paralogous vertebrate ParaHox D paralogons. Evolutionary relationships among genes from these four chromosomal regions were studied with several phylogenetic algorithms. We provide evidence that the genes of the ParaHox C paralogous cluster are duplicated in teleosts, just as it had been shown previously for the D paralogon genes. Overall, however, synteny and cluster integrity seems to be less conserved in ParaHox gene clusters than in Hox gene clusters. Comparative analyses of non-coding sequences uncovered conserved, possibly co-regulatory elements, which are likely to contain promoter motives of the genes belonging to the ParaHox paralogons. CONCLUSION: There seems to be strong stabilizing selection for gene order as well as gene orientation in the ParaHox C paralogon, since with a few exceptions, only the lengths of the introns and intergenic regions differ between the distantly related species examined. The high degree of evolutionary conservation of this gene cluster's architecture in particular - but possibly clusters of genes more generally - might be linked to the presence of promoter, enhancer or inhibitor motifs that serve to regulate more than just one gene. Therefore, deletions, inversions or relocations of individual genes could destroy the regulation of the clustered genes in this region. The existence of such a regulation network might explain the evolutionary conservation of gene order and orientation over the course of hundreds of millions of years of vertebrate evolution. Another possible explanation for the highly conserved gene order might be the existence of a regulator not located immediately next to its corresponding gene but further away since a relocation or inversion would possibly interrupt this interaction. Different ParaHox clusters were found to have experienced differential gene loss in teleosts. Yet the complete set of these homeobox genes was maintained, albeit distributed over almost twice the number of chromosomes. Selection due to dosage effects and/or stoichiometric disturbance might act more strongly to maintain a modal number of homeobox genes (and possibly transcription factors more generally) per genome, yet permit the accumulation of other (non regulatory) genes associated with these homeobox gene clusters.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

ABSTRACTThe Online Mendelian Inheritance in Man database (OMIM) reports about 3000 Mendelian diseases of known causal gene and about 2000 that remain to be mapped. These cases are often difficult to solve because of the rareness of the disease, the structure of the family (too big or too small) or the heterogeneity of the phenotype. The goal of this thesis is to explore the current genetic tools, before the advent of ultra high throughput sequencing, and integrate them in the attempt to map the genes behind the four studied cases. In this framework we have studied a small family with a recessive disease, a modifier gene for the penetrance of a dominant mutation, a large extended family with a cardiac phenotype and clinical and/or allelic heterogeneity and we have molecularly analyzed a balanced chromosomal translocation.RESUMELa base de données des maladies à transmission mendélienne, Online Mendelian Inheritance in Man (OMIM), contient environ 3000 affections à caractère mendélien pour lesquelles le gène responsable est connu et environ 2000 qui restent à élucider.Les cas restant à résoudre sont souvent difficiles soit par le caractère intrinsèquement rare de ces maladies soit à cause de difficultés structurelles (famille trop petite ou trop étendue) ou hétérogénéité du phénotype ou génétique. Cette thèse s'inscrit avant l'arrivée des nouveaux outils de séquençage à haut débit. Son but est d'explorer les outils génétiques actuels, et de les intégrer pour trouver les gènes impliqués dans quatre cas représentant chacun une situation génétique différente : nous avons étudié une famille de quatre individus avec une transmission récessive, recherché un gène modificateur de la pénétrance de mutations dominantes, étudié une famille étendue présentant un phénotype cardiaque cliniquement et/ou allèliquement hétérogène et nous avons fait l'analyse moléculaire d'une translocation chromosomique balancée.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Catalase is an important virulence factor for survival in macrophages and other phagocytic cells. In Chlamydiaceae, no catalase had been described so far. With the sequencing and annotation of the full genomes of Chlamydia-related bacteria, the presence of different catalase-encoding genes has been documented. However, their distribution in the Chlamydiales order and the functionality of these catalases remain unknown. Phylogeny of chlamydial catalases was inferred using MrBayes, maximum likelihood, and maximum parsimony algorithms, allowing the description of three clade 3 and two clade 2 catalases. Only monofunctional catalases were found (no catalase-peroxidase or Mn-catalase). All presented a conserved catalytic domain and tertiary structure. Enzymatic activity of cloned chlamydial catalases was assessed by measuring hydrogen peroxide degradation. The catalases are enzymatically active with different efficiencies. The catalase of Parachlamydia acanthamoebae is the least efficient of all (its catalytic activity was 2 logs lower than that of Pseudomonas aeruginosa). Based on the phylogenetic analysis, we hypothesize that an ancestral class 2 catalase probably was present in the common ancestor of all current Chlamydiales but was retained only in Criblamydia sequanensis and Neochlamydia hartmannellae. The catalases of class 3, present in Estrella lausannensis and Parachlamydia acanthamoebae, probably were acquired by lateral gene transfer from Rhizobiales, whereas for Waddlia chondrophila they likely originated from Legionellales or Actinomycetales. The acquisition of catalases on several occasions in the Chlamydiales suggests the importance of this enzyme for the bacteria in their host environment.