959 resultados para Clustering methods
Resumo:
The species of the sandy plains forests (forests of the ''restingas'') have not yet had their spatial patterns studied as aids to the understanding of the diversity found in the different physiognomies along the Brazilian coast. In this paper a 10 x 10 m quadrat framework laid in a hectare of a tree dominant forest in the sandy plains of the Picinguaba area of the Serra do Mar State Park (municipality of Ubatuba, state of São Paulo, Brazil) was used to assess the spatial pattern of distribution for the ten most important species : Pera glabrata, Euterpe edulis, Eugenia brasiliensis, Alchornea triplinervea, Guatteria australis, Myrcia racemosa, Jacaranda semiserrata, Guarea macrophylla, Euplassa cantareirae and Nectandra oppositifolia. The spatial patterns were inferred through the calculations of their T-Square Index (C) and Dispersal Distance Index (I). P. glabrata shows a random pattern, E. edulis aggregate, E. brasiliensis, A. triplinervia, G. australis, E. cantareirae and N. oppositifolia with a tendency between aggregate and uniform and, M. racemosa, J. semiserrata and G. macrophylla between aggregate and random. Although the indexes are dependent of the sample size and of the technique adjustments, the relationship of the pattern with the environmental factors is shown by clustering methods. The results give confirmation of how the spatial patterns bring associations between populations and shape of the vegetation physiognomy.
PHYLOGENETIC STUDIES OF SOME SPECIES OF THE GENUS COFFEA .2. NUMERICAL-ANALYSIS OF ISOENZYMATIC DATA
Resumo:
Thirteen species of Coffea were studied for five enzymes systems, including alpha and beta esterase, alkaline phosphatase, acid phosphatase, malate dehydrogenase and acid dehydrogenase. Three coefficients of similarity: Simple Matching, Jaccard and Ochiai and three different clustering methods: Single Linkage, Complete Linkage and Unweighted Pair Group, using Arithmetic Averages (UPGMA) were used to analyse the data.The phylogenetic relationships among the twelve diploid species and between them and the tetraploid species C. arabica showed that similarity among species of the same subsection is not always greater than among species of different subsections. In addition, although there are several similarity groups in common, established by isoenzymatic polymorphism, morphological characteristics, chemical data, crossability and geographic distribution, there is no common trend among the phylogenetic relationships as indicated by all these different evaluating procedures.
Resumo:
This article presents a quantitative and objective approach to cat ganglion cell characterization and classification. The combination of several biologically relevant features such as diameter, eccentricity, fractal dimension, influence histogram, influence area, convex hull area, and convex hull diameter are derived from geometrical transforms and then processed by three different clustering methods (Ward's hierarchical scheme, K-means and genetic algorithm), whose results are then combined by a voting strategy. These experiments indicate the superiority of some features and also suggest some possible biological implications.
Resumo:
(10) Hygiea is the fourth largest asteroid of the main belt, by volume and mass, and it is the largest member of its family, that is made mostly by low-albedo, C-type asteroids, typical of the outer main belt. Like many other large families, it is associated with a 'halo' of objects, that extends far beyond the boundary of the core family, as detected by traditional hierarchical clustering methods (HCM) in proper element domains. Numerical simulations of the orbital evolution of family members may help in estimating the family and halo family age, and the original ejection velocity field. But, in order to minimize the errors associated with including too many interlopers, it is important to have good estimates of family membership that include available data on local asteroid taxonomy, geometrical albedo and local dynamics. For this purpose, we obtained synthetic proper elements and frequencies of asteroids in the Hygiea orbital region, with their errors. We revised the current knowledge on asteroid taxonomy, including Sloan Digital Sky Survey-Moving Object Catalog 4th release (SDSS-MOC 4) data, and geometric albedo data from Wide-field Infrared Survey Explorer (WISE) and Near-Earth Object WISE (NEOWISE). We identified asteroid family members using HCM in the domain of proper elements (a, e, sin (i)) and in the domains of proper frequencies most appropriate to study diffusion in the local web of secular resonances, and eliminated possible interlopers based on taxonomic and geometrical albedo considerations. To identify the family halo, we devised a new hierarchical clustering method in an extended domain that includes proper elements, principal components PC1, PC2 obtained based on SDSS photometric data and, for the first time, WISE and NEOWISE geometric albedo. Data on asteroid size distribution, light curves and rotations were also revised for the Hygiea family. The Hygiea family is the largest group in its region, with two smaller families in proper element domain and 18 families in various frequencies domains identified in this work for the first time. Frequency groups tend to extend vertically in the (a, sin (i)) plane and cross not only the Hygiea family but also the near C-type families of Themis and Veritas, causing a mixture of objects all of relatively low albedo in the Hygiea family area. A few high-albedo asteroids, most likely associated with the Eos family, are also present in the region. Finally, the new multidomains hierarchical clustering method allowed us to obtain a good and robust estimate of the membership of the Hygiea family halo, quite separated from other asteroids families halo in the region, and with a very limited (about 3 per cent) presence of likely interlopers. © 2013 The Author Published by Oxford University Press on behalf of the Royal Astronomical Society.
Resumo:
Pós-graduação em Agronomia (Genética e Melhoramento de Plantas) - FCAV
Resumo:
Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES)
Resumo:
The aim of this study was to develop an objective method to determine the incidence of pleiomorphisms and its influence on the distribution of sperm morphometric subpopulations in ejaculates of howling monkeys (Alouatta caraya) by using a combination of computerized analysis system (ASMA) and principal component analysis (PCA) methods. Ejaculates were collected by electroejaculation methods on a regular basis from five individuals maintained under identical captive environmental, nutritional, and management conditions. Each sperm head was measured for dimensional parameters (Area [A, (square micrometers)], Perimeter [P, (micrometers)], Length [L, (micrometers)], and Width [W, (micrometers)]) and shape-derived parameters (Ellipticity [(L/W)], Elongation [(L - W)/(L + W)], and Rugosity [(4 pi A/P-2)]). PCA revealed two principal components explaining more than the 96 % of the variance. Clustering methods and discriminant analyzes were performed and seven separate subpopulations were identified. There were differences (P < 0.001) in the distribution of the seven subpopulations as well as in the incidence of abnormal pleiomorphisms (58.6 %, 49.8 %, 35.1 %, 66.4 %, and 55.1 %, P < 0.05) among the five donors tested. Our results indicated that differences among individuals related to the incidence of pleiomorphisms, and sperm subpopulational structure was not related to the captivity conditions or the sperm collection method, since all individuals were studied under identical conditions. In conclusion, the combination of ASMA and PCA is a useful clinical diagnostic resource for detecting deficiencies in sperm morphology and sperm subpopulations in A. caraya ejaculates that could be used in ex situ conservation programs of threatened species in Alouatta genus or even other endangered neotropical primate species.
Resumo:
Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)
Resumo:
Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq)
Resumo:
Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES)
Resumo:
Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES)
Resumo:
Risers are flexible multilayered pipes formed by an inner flexible metal structure surrounded by polymer layers and spiral wound steel ligaments, also known as armor wires. Since these risers are used to link subsea pipelines to floating oil and gas production installations, and their failure could produce catastrophic consequences, some methods have been proposed to monitor the armor integrity. However, until now there is no practical method that allows the automatic non-destructive detection of individual armor wire rupture. In this work we show a method using magnetic Barkhausen noise that has shown high efficiency in the detection of armor wire rupture. The results are examined under the cyclic and static load conditions of the riser. This work also analyzes the theory behind the singular dependence of the magnetic Barkhausen noise on the applied tension in riser armor wires.
Resumo:
O objetivo deste trabalho foi comparar três métodos para determinação do número de grupos em estudos com aplicação de métodos hierárquicos de agrupamentos, baseando-se em dados obtidos a partir da caracterização de acessos de Capsicum, de modo a identificar aquele com maior poder de discriminação. Os métodos de Mojena, de Tocher e o método RMSSTD foram aplicados com a finalidade de determinar o número ideal de grupos formados na fase final do procedimento de agrupamento com o método UPGMA. Foram analisados 49 acessos da espécie Capsicum chinense do Banco de Germoplasma de Hortaliças da Universidade Federal de Viçosa, em relação a dez características morfológicas com o intuito de identificar e agrupar os acessos mais similares, tornando possível a seleção de genótipos superiores, ou seja, com as características comerciais de interesse. Os resultados mostraram que o método RMSSTD permitiu concluir sobre a existência de sete grupos, evidenciando um maior poder de discriminação para este método, em relação ao método de otimização de Tocher e ao método de Mojena, que formaram respectivamente, quatro e três grupos.
Resumo:
The main aim of this Ph.D. dissertation is the study of clustering dependent data by means of copula functions with particular emphasis on microarray data. Copula functions are a popular multivariate modeling tool in each field where the multivariate dependence is of great interest and their use in clustering has not been still investigated. The first part of this work contains the review of the literature of clustering methods, copula functions and microarray experiments. The attention focuses on the K–means (Hartigan, 1975; Hartigan and Wong, 1979), the hierarchical (Everitt, 1974) and the model–based (Fraley and Raftery, 1998, 1999, 2000, 2007) clustering techniques because their performance is compared. Then, the probabilistic interpretation of the Sklar’s theorem (Sklar’s, 1959), the estimation methods for copulas like the Inference for Margins (Joe and Xu, 1996) and the Archimedean and Elliptical copula families are presented. In the end, applications of clustering methods and copulas to the genetic and microarray experiments are highlighted. The second part contains the original contribution proposed. A simulation study is performed in order to evaluate the performance of the K–means and the hierarchical bottom–up clustering methods in identifying clusters according to the dependence structure of the data generating process. Different simulations are performed by varying different conditions (e.g., the kind of margins (distinct, overlapping and nested) and the value of the dependence parameter ) and the results are evaluated by means of different measures of performance. In light of the simulation results and of the limits of the two investigated clustering methods, a new clustering algorithm based on copula functions (‘CoClust’ in brief) is proposed. The basic idea, the iterative procedure of the CoClust and the description of the written R functions with their output are given. The CoClust algorithm is tested on simulated data (by varying the number of clusters, the copula models, the dependence parameter value and the degree of overlap of margins) and is compared with the performance of model–based clustering by using different measures of performance, like the percentage of well–identified number of clusters and the not rejection percentage of H0 on . It is shown that the CoClust algorithm allows to overcome all observed limits of the other investigated clustering techniques and is able to identify clusters according to the dependence structure of the data independently of the degree of overlap of margins and the strength of the dependence. The CoClust uses a criterion based on the maximized log–likelihood function of the copula and can virtually account for any possible dependence relationship between observations. Many peculiar characteristics are shown for the CoClust, e.g. its capability of identifying the true number of clusters and the fact that it does not require a starting classification. Finally, the CoClust algorithm is applied to the real microarray data of Hedenfalk et al. (2001) both to the gene expressions observed in three different cancer samples and to the columns (tumor samples) of the whole data matrix.
Resumo:
There is constant pressure to improve evaluation of animal genetic resources in order to prevent their erosion. Maintaining the integrity of livestock species as well as their genetic diversity is of paramount interest for long-term agricultural policies. One major use of DNA techniques in conservation is to reveal genetic diversity within and between populations. Forty-one microsatellites were analysed to assess genetic diversity in nine Swiss sheep breeds and to measure the loss of the overall diversity when one breed would become extinct. The expected heterozygosities varied from 0.65 to 0.74 and 10.8% of the total genetic diversity can be explained by the variation among breeds. Based on the proportion of shared alleles, each of the nine breeds were clearly defined in their own cluster in the neighbour-joining tree describing the relationships among the breeds. Bayesian clustering methods assign individuals to groups based on their genetic similarity and infer the number of populations. In STRUCTURE, this approach pooled the Valais Blacknose and the Valais Red. With BAPS method the two Valais sheep breeds could be separated. Caballero & Toro approach (2002) was used to calculate the loss or gain of genetic diversity when each of the breeds would be removed from the set. The changes in diversity based on between-breed variation ranged from -12.2% (Valais Blacknose) to 0% (Swiss Black Brown Mountain and Mirror Sheep); based on within-breed diversity the removal of a breed could also produce an increase in diversity (-0.6% to + 0.6%). Allelic richness ranged from 4.9 (Valais Red) to 6.7 (Brown Headed Meat sheep and Red Engadine Sheep). Breed conservation decisions cannot be limited to genetic diversity alone. In Switzerland, conservation goals are embedded in the desire to carry the cultural legacy over to future generations.