889 resultados para Random trees
Resumo:
Feature selection is one of important and frequently used techniques in data preprocessing. It can improve the efficiency and the effectiveness of data mining by reducing the dimensions of feature space and removing the irrelevant and redundant information. Feature selection can be viewed as a global optimization problem of finding a minimum set of M relevant features that describes the dataset as well as the original N attributes. In this paper, we apply the adaptive partitioned random search strategy into our feature selection algorithm. Under this search strategy, the partition structure and evaluation function is proposed for feature selection problem. This algorithm ensures the global optimal solution in theory and avoids complete randomness in search direction. The good property of our algorithm is shown through the theoretical analysis.
Resumo:
Formal Concept Analysis is an unsupervised machine learning technique that has successfully been applied to document organisation by considering documents as objects and keywords as attributes. The basic algorithms of Formal Concept Analysis then allow an intelligent information retrieval system to cluster documents according to keyword views. This paper investigates the scalability of this idea. In particular we present the results of applying spatial data structures to large datasets in formal concept analysis. Our experiments are motivated by the application of the Formal Concept Analysis idea of a virtual filesystem [11,17,15]. In particular the libferris [1] Semantic File System. This paper presents customizations to an RD-Tree Generalized Index Search Tree based index structure to better support the application of Formal Concept Analysis to large data sources.
Resumo:
Deforestation in southeast Brazil has led to the extinction of Hymenaea courbaril var. stilbocarpa and ex situ conservation has been established. In this study, the levels of genetic diversity and the effective population size of H. courbaril in a germplasm bank were investigated using six nuclear microsatellite loci. A total of 79 and 91 alleles were found in 65 seed-trees and their 176 offspring, respectively. Offspring have a higher average number of alleles per locus (A = 15.2) than seed-trees (A = 13.2), but lower observed heterozygosity (offspring: H (o) = 0.566; seed-trees: H (o) = 0.607). The estimate of outcrossing rate shows that the study population is perfectly outcrossed (t (m) = 0.978, P > 0.05). Significant deviations from random mating were detected through mating among relatives and correlated matings. The average variance in effective population size for each family was 2.63, with a total effective population size retained in the bank of 170.1. These results confirm that the preserved population of H. courbaril retains substantial genetic variability.
Resumo:
Most cellular solids are random materials, while practically all theoretical structure-property results are for periodic models. To be able to generate theoretical results for random models, the finite element method (FEM) was used to study the elastic properties of solids with a closed-cell cellular structure. We have computed the density (rho) and microstructure dependence of the Young's modulus (E) and Poisson's ratio (PR) for several different isotropic random models based on Voronoi tessellations and level-cut Gaussian random fields. The effect of partially open cells is also considered. The results, which are best described by a power law E infinity rho (n) (1<n<2), show the influence of randomness and isotropy on the properties of closed-cell cellular materials, and are found to be in good agreement with experimental data. (C) 2001 Acta Materialia Inc. Published by Elsevier Science Ltd. All rights reserved.
Resumo:
A mixture model incorporating long-term survivors has been adopted in the field of biostatistics where some individuals may never experience the failure event under study. The surviving fractions may be considered as cured. In most applications, the survival times are assumed to be independent. However, when the survival data are obtained from a multi-centre clinical trial, it is conceived that the environ mental conditions and facilities shared within clinic affects the proportion cured as well as the failure risk for the uncured individuals. It necessitates a long-term survivor mixture model with random effects. In this paper, the long-term survivor mixture model is extended for the analysis of multivariate failure time data using the generalized linear mixed model (GLMM) approach. The proposed model is applied to analyse a numerical data set from a multi-centre clinical trial of carcinoma as an illustration. Some simulation experiments are performed to assess the applicability of the model based on the average biases of the estimates formed. Copyright (C) 2001 John Wiley & Sons, Ltd.
Resumo:
It has been established that large numbers of certain trees can survive in the beds of rivers of northeastern Australia where a strongly seasonal distribution of precipitation causes extreme variations in flow on both a yearly and longer-term basis. In these rivers, minimal flow occurs throughout much of any year and for periods of up to several years, allowing the trees to become established and to adapt their form in order to facilitate their survival in environments that experience periodic inundation by fast-flowing, debris-laden water. Such trees (notably paperbark trees of the angiosperm genus Melaleuca) adopt a reclined to prostrate, downstream-trailing habit, have a multiple-stemmed form, modified crown with weeping foliage, development of thick, spongy bark, anchoring of roots into firm to lithified substrates beneath the channel floor, root regeneration, and develop in flow-parallel, linear groves. Individuals from within flow-parallel, linear groves are preserved in situ within the alluvial deposit of the river following burial and death. Four examples of in situ tree fossils within alluvial channel deposits in the Permian of eastern Australia demonstrate that specialised riverbed plant communities also existed at times in the geological past. These examples, from the Lower Permian Carmila Beds, Upper Permian Moranbah Coal Measures and Baralaba Coal Measures of central Queensland and the Upper Permian Newcastle Coal Measures of central New South Wales, show several of the characteristics of trees described from modern rivers in northeastern Australia, including preservation in closely-spaced groups. These properties, together with independent sedimentological evidence, suggest that the Permian trees were adapted to an environment affected by highly variable runoff, albeit in a more temperate climatic situation than the modem Australian examples. It is proposed that occurrences of fossil trees preserved in situ within alluvial channel deposits may be diagnostic of environments controlled by seasonal and longer-term variability in fluvial runoff, and hence may have value in interpreting aspects of palaeoclimate from ancient alluvial successions. (C) 2001 Elsevier Science B.V. All rights reserved.
Resumo:
Surrogate methods for detecting lateral gene transfer are those that do not require inference of phylogenetic trees. Herein I apply four such methods to identify open reading frames (ORFs) in the genome of Escherichia coli K12 that may have arisen by lateral gene transfer. Only two of these methods detect the same ORFs more frequently than expected by chance, whereas several intersections contain many fewer ORFs than expected. Each of the four methods detects a different non-random set of ORFs. The methods may detect lateral ORFs of different relative ages; testing this hypothesis will require rigorous inference of trees. (C) 2001 Federation of European Microbiological Societies. Published by Elsevier Science BN. All rights reserved.
Resumo:
A finite-element method is used to study the elastic properties of random three-dimensional porous materials with highly interconnected pores. We show that Young's modulus, E, is practically independent of Poisson's ratio of the solid phase, nu(s), over the entire solid fraction range, and Poisson's ratio, nu, becomes independent of nu(s) as the percolation threshold is approached. We represent this behaviour of nu in a flow diagram. This interesting but approximate behaviour is very similar to the exactly known behaviour in two-dimensional porous materials. In addition, the behaviour of nu versus nu(s) appears to imply that information in the dilute porosity limit can affect behaviour in the percolation threshold limit. We summarize the finite-element results in terms of simple structure-property relations, instead of tables of data, to make it easier to apply the computational results. Without using accurate numerical computations, one is limited to various effective medium theories and rigorous approximations like bounds and expansions. The accuracy of these equations is unknown for general porous media. To verify a particular theory it is important to check that it predicts both isotropic elastic moduli, i.e. prediction of Young's modulus alone is necessary but not sufficient. The subtleties of Poisson's ratio behaviour actually provide a very effective method for showing differences between the theories and demonstrating their ranges of validity. We find that for moderate- to high-porosity materials, none of the analytical theories is accurate and, at present, numerical techniques must be relied upon.
Resumo:
Partitioned Bremer support (PBS) is a valuable means of assessing congruence in combined data sets, but some aspects require clarification. When more than one equally parsimonious tree is found during the constrained search for trees lacking the node of interest, averaging PBS for each data set across these trees can conceal conflict, and PBS should ideally be examined for each constrained tree. Similarly, when multiple most parsimonious trees (MPTs) are generated during analysis of the combined data, PBS is usually calculated on the consensus tree. However, extra information can be obtained if PBS is calculated on each of the MPTs or even suboptimal trees. (C) 2002 The Willi Hennig Society. Published by Elsevier Science (USA). All rights reserved.