23 resultados para hierarchical tree-structure
Resumo:
The package HIERFSTAT for the statistical software R, created by the R Development Core Team, allows the estimate of hierarchical F-statistics from a hierarchy with any numbers of levels. In addition, it allows testing the statistical significance of population differentiation for these different levels, using a generalized likelihood-ratio test. The package HIERFSTAT is available at http://www.unil.ch/popgen/softwares/hierfstat.htm.
Resumo:
Crushed seeds of the Moringa oleifera tree have been used traditionally as natural flocculants to clarify drinking water. We previously showed that one of the seed peptides mediates both the sedimentation of suspended particles such as bacterial cells and a direct bactericidal activity, raising the possibility that the two activities might be related. In this study, the conformational modeling of the peptide was coupled to a functional analysis of synthetic derivatives. This indicated that partly overlapping structural determinants mediate the sedimentation and antibacterial activities. Sedimentation requires a positively charged, glutamine-rich portion of the peptide that aggregates bacterial cells. The bactericidal activity was localized to a sequence prone to form a helix-loop-helix structural motif. Amino acid substitution showed that the bactericidal activity requires hydrophobic proline residues within the protruding loop. Vital dye staining indicated that treatment with peptides containing this motif results in bacterial membrane damage. Assembly of multiple copies of this structural motif into a branched peptide enhanced antibacterial activity, since low concentrations effectively kill bacteria such as Pseudomonas aeruginosa and Streptococcus pyogenes without displaying a toxic effect on human red blood cells. This study thus identifies a synthetic peptide with potent antibacterial activity against specific human pathogens. It also suggests partly distinct molecular mechanisms for each activity. Sedimentation may result from coupled flocculation and coagulation effects, while the bactericidal activity would require bacterial membrane destabilization by a hydrophobic loop.
Resumo:
BACKGROUND AND AIMS: The study of local adaptation in plant reproductive traits has received substantial attention in short-lived species, but studies conducted on forest trees are scarce. This lack of research on long-lived species represents an important gap in our knowledge, because inferences about selection on the reproduction and life history of short-lived species cannot necessarily be extrapolated to trees. This study considers whether the size for first reproduction is locally adapted across a broad geographical range of the Mediterranean conifer species Pinus pinaster. In particular, the study investigates whether this monoecious species varies genetically among populations in terms of whether individuals start to reproduce through their male function, their female function or both sexual functions simultaneously. Whether differences among populations could be attributed to local adaptation across a climatic gradient is then considered. METHODS: Male and female reproduction and growth were measured during early stages of sexual maturity of a P. pinaster common garden comprising 23 populations sampled across the species range. Generalized linear mixed models were used to assess genetic variability of early reproductive life-history traits. Environmental correlations with reproductive life-history traits were tested after controlling for neutral genetic structure provided by 12 nuclear simple sequence repeat markers. KEY RESULTS: Trees tended to reproduce first through their male function, at a size (height) that varied little among source populations. The transition to female reproduction was slower, showed higher levels of variability and was negatively correlated with vegetative growth traits. Several female reproductive traits were correlated with a gradient of growth conditions, even after accounting for neutral genetic structure, with populations from more unfavourable sites tending to commence female reproduction at a lower individual size. CONCLUSIONS: The study represents the first report of genetic variability among populations for differences in the threshold size for first reproduction between male and female sexual functions in a tree species. The relatively uniform size at which individuals begin reproducing through their male function probably represents the fact that pollen dispersal is also relatively invariant among sites. However, the genetic variability in the timing of female reproduction probably reflects environment-dependent costs of cone production. The results also suggest that early sex allocation in this species might evolve under constraints that do not apply to other conifers.
Resumo:
The coverage and volume of geo-referenced datasets are extensive and incessantly¦growing. The systematic capture of geo-referenced information generates large volumes¦of spatio-temporal data to be analyzed. Clustering and visualization play a key¦role in the exploratory data analysis and the extraction of knowledge embedded in¦these data. However, new challenges in visualization and clustering are posed when¦dealing with the special characteristics of this data. For instance, its complex structures,¦large quantity of samples, variables involved in a temporal context, high dimensionality¦and large variability in cluster shapes.¦The central aim of my thesis is to propose new algorithms and methodologies for¦clustering and visualization, in order to assist the knowledge extraction from spatiotemporal¦geo-referenced data, thus improving making decision processes.¦I present two original algorithms, one for clustering: the Fuzzy Growing Hierarchical¦Self-Organizing Networks (FGHSON), and the second for exploratory visual data analysis:¦the Tree-structured Self-organizing Maps Component Planes. In addition, I present¦methodologies that combined with FGHSON and the Tree-structured SOM Component¦Planes allow the integration of space and time seamlessly and simultaneously in¦order to extract knowledge embedded in a temporal context.¦The originality of the FGHSON lies in its capability to reflect the underlying structure¦of a dataset in a hierarchical fuzzy way. A hierarchical fuzzy representation of¦clusters is crucial when data include complex structures with large variability of cluster¦shapes, variances, densities and number of clusters. The most important characteristics¦of the FGHSON include: (1) It does not require an a-priori setup of the number¦of clusters. (2) The algorithm executes several self-organizing processes in parallel.¦Hence, when dealing with large datasets the processes can be distributed reducing the¦computational cost. (3) Only three parameters are necessary to set up the algorithm.¦In the case of the Tree-structured SOM Component Planes, the novelty of this algorithm¦lies in its ability to create a structure that allows the visual exploratory data analysis¦of large high-dimensional datasets. This algorithm creates a hierarchical structure¦of Self-Organizing Map Component Planes, arranging similar variables' projections in¦the same branches of the tree. Hence, similarities on variables' behavior can be easily¦detected (e.g. local correlations, maximal and minimal values and outliers).¦Both FGHSON and the Tree-structured SOM Component Planes were applied in¦several agroecological problems proving to be very efficient in the exploratory analysis¦and clustering of spatio-temporal datasets.¦In this thesis I also tested three soft competitive learning algorithms. Two of them¦well-known non supervised soft competitive algorithms, namely the Self-Organizing¦Maps (SOMs) and the Growing Hierarchical Self-Organizing Maps (GHSOMs); and the¦third was our original contribution, the FGHSON. Although the algorithms presented¦here have been used in several areas, to my knowledge there is not any work applying¦and comparing the performance of those techniques when dealing with spatiotemporal¦geospatial data, as it is presented in this thesis.¦I propose original methodologies to explore spatio-temporal geo-referenced datasets¦through time. Our approach uses time windows to capture temporal similarities and¦variations by using the FGHSON clustering algorithm. The developed methodologies¦are used in two case studies. In the first, the objective was to find similar agroecozones¦through time and in the second one it was to find similar environmental patterns¦shifted in time.¦Several results presented in this thesis have led to new contributions to agroecological¦knowledge, for instance, in sugar cane, and blackberry production.¦Finally, in the framework of this thesis we developed several software tools: (1)¦a Matlab toolbox that implements the FGHSON algorithm, and (2) a program called¦BIS (Bio-inspired Identification of Similar agroecozones) an interactive graphical user¦interface tool which integrates the FGHSON algorithm with Google Earth in order to¦show zones with similar agroecological characteristics.
Resumo:
Contact structure is believed to have a large impact on epidemic spreading and consequently using networks to model such contact structure continues to gain interest in epidemiology. However, detailed knowledge of the exact contact structure underlying real epidemics is limited. Here we address the question whether the structure of the contact network leaves a detectable genetic fingerprint in the pathogen population. To this end we compare phylogenies generated by disease outbreaks in simulated populations with different types of contact networks. We find that the shape of these phylogenies strongly depends on contact structure. In particular, measures of tree imbalance allow us to quantify to what extent the contact structure underlying an epidemic deviates from a null model contact network and illustrate this in the case of random mixing. Using a phylogeny from the Swiss HIV epidemic, we show that this epidemic has a significantly more unbalanced tree than would be expected from random mixing.
Resumo:
Two-way alternating automata were introduced by Vardi in order to study the satisfiability problem for the modal μ-calculus extended with backwards modalities. In this paper, we present a very simple proof by way of Wadge games of the strictness of the hierarchy of Motowski indices of two-way alternating automata over trees.
Resumo:
In this paper, we consider active sampling to label pixels grouped with hierarchical clustering. The objective of the method is to match the data relationships discovered by the clustering algorithm with the user's desired class semantics. The first is represented as a complete tree to be pruned and the second is iteratively provided by the user. The active learning algorithm proposed searches the pruning of the tree that best matches the labels of the sampled points. By choosing the part of the tree to sample from according to current pruning's uncertainty, sampling is focused on most uncertain clusters. This way, large clusters for which the class membership is already fixed are no longer queried and sampling is focused on division of clusters showing mixed labels. The model is tested on a VHR image in a multiclass classification setting. The method clearly outperforms random sampling in a transductive setting, but cannot generalize to unseen data, since it aims at optimizing the classification of a given cluster structure.
Resumo:
Determining the relative roles of vicariance and selection in restricting gene flow between populations is of central importance to the evolutionary process of population divergence and speciation. Here we use molecular and morphological data to contrast the effect of isolation (by mountains and geographical distance) with that of ecological factors (altitudinal gradients) in promoting differentiation in the wedge-billed woodcreeper, Glyphorynchus spirurus, a tropical forest bird, in Ecuador. Tarsus length and beak size increased relative to body size with altitude on both sides of the Andes, and were correlated with the amount of moss on tree trunks, suggesting the role of selection in driving adaptive divergence. In contrast, molecular data revealed a considerable degree of admixture along these altitudinal gradients, suggesting that adaptive divergence in morphological traits has occurred in the presence of gene flow. As suggested by mitochondrial DNA sequence data, the Andes act as a barrier to gene flow between ancient subspecific lineages. Genome-wide amplified fragment length polymorphism markers reflected more recent patterns of gene flow and revealed fine-scale patterns of population differentiation that were not detectable with mitochondrial DNA, including the differentiation of isolated coastal populations west of the Andes. Our results support the predominant role of geographical isolation in driving genetic differentiation in G. spirurus, yet suggest the role of selection in driving parallel morphological divergence along ecological gradients.