16 resultados para Clustering methods

em Repositório Institucional UNESP - Universidade Estadual Paulista "Julio de Mesquita Filho"


Relevância:

60.00% 60.00%

Publicador:

Resumo:

Sessenta e nove acessos de Psidium, coletados em seis estados brasileiros, foram analisados para dois métodos não hierárquicos de agrupamento e por componentes principais (CP), visando orientar programas de melhoramento. Foram analisadas as variáveis ácido ascórbico, β-caroteno, licopeno, fenóis totais, flavonóides totais, atividade antioxidante, acidez titulável, sólidos solúveis, açúcares solúveis totais, teor de umidade, diâmetro lateral e transversal do fruto, peso da polpa e das sementes/fruto, número e produção de frutos/planta. Foram observados agrupamentos específicos para os acessos de araçazeiros no método de Tocher e do k-means e na dispersão tridimensional dos quatro CPs. Os acessos de araçazeiros foram separados dos de goiabeira. Não foi observado nenhum agrupamento específico por estado de coleta, indicando a inexistência de barreiras na propagação dos acessos de goiabeira. As análises sugerem a prospecção de maior número de amostras de germoplasma num menor número de regiões, bem como acessos divergentes com alto teor de compostos nutricionais.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

O objetivo deste trabalho foi comparar diferentes técnicas multivariadas na caracterização de 35 genótipos de gergelim mediante 769 marcadores RAPD. As distâncias genéticas foram obtidas pelo complemento aritmético do coeficiente de Jaccard e agrupadas pelos métodos hierárquicos do vizinho mais próximo, do vizinho mais distante, das médias aritméticas não ponderadas (UPGMA), do método de otimização de Tocher e análises de coordenadas principais. O agrupamento dos genótipos foi alterado em função dos diferentes métodos usados. Adotando-se a mesma distância genética (0,36) como valor de corte, diferenciaram-se quatro grupos no método do vizinho mais próximo, 13 para o vizinho mais distante, 11 no UPGMA e quatro no Tocher. Entre os métodos hierárquicos, o UPGMA apresentou o melhor ajuste das distâncias originais e estimadas (CCC = 0,89). As análises das coordenadas principais confirmaram a baixa diversidade existente entre os genótipos. A maior divergência ocorreu entre as cultivares Seridó 1 e Arawaca 4, e a menor, entre os genótipos VCR-101 e GP-3314. As três primeiras coordenadas principais contabilizaram 35,13% do total da variabilidade, e 18 autovalores foram necessários para explicar 81% da variação genética. Os métodos UPGMA, de otimização de Tocher, e as análises de coordenadas principais são complementares na formação dos grupos.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Flavonoid compounds were analyzed in ripe fruit pulp of ten species of Coffea, including two cultivars of C. arabica and two of C. canephora. Three coefficients of similarity: Simple-Matching, Jaccard and Ochiai and three different clustering methods, Single Linkage, Complete Linkage and Unweighted Pair Group, Using Arithmetic Averages (UPGMA), were used to analyze the data.Jaccard and Ochiai's coefficients of association showed a more coherent result, when compared with taxonomic and hybridization studies. Inclusion of Psilanthopsis kapakata in the genus Coffea, as C. kapakata, is justified by the similarity of this species with other studied species, and clusters clearly approximate the species C. arabica and C. eugenioides. The latter is one of the possible parents of the allotetraploid species C. arabica, C. congensis is the only species whose position remains ambiguous, probably due to the fact that the plants of this species that were introduced into the Campinas collections, were hybrids and not typical of C. congensis.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

The species of the sandy plains forests (forests of the ''restingas'') have not yet had their spatial patterns studied as aids to the understanding of the diversity found in the different physiognomies along the Brazilian coast. In this paper a 10 x 10 m quadrat framework laid in a hectare of a tree dominant forest in the sandy plains of the Picinguaba area of the Serra do Mar State Park (municipality of Ubatuba, state of São Paulo, Brazil) was used to assess the spatial pattern of distribution for the ten most important species : Pera glabrata, Euterpe edulis, Eugenia brasiliensis, Alchornea triplinervea, Guatteria australis, Myrcia racemosa, Jacaranda semiserrata, Guarea macrophylla, Euplassa cantareirae and Nectandra oppositifolia. The spatial patterns were inferred through the calculations of their T-Square Index (C) and Dispersal Distance Index (I). P. glabrata shows a random pattern, E. edulis aggregate, E. brasiliensis, A. triplinervia, G. australis, E. cantareirae and N. oppositifolia with a tendency between aggregate and uniform and, M. racemosa, J. semiserrata and G. macrophylla between aggregate and random. Although the indexes are dependent of the sample size and of the technique adjustments, the relationship of the pattern with the environmental factors is shown by clustering methods. The results give confirmation of how the spatial patterns bring associations between populations and shape of the vegetation physiognomy.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Thirteen species of Coffea were studied for five enzymes systems, including alpha and beta esterase, alkaline phosphatase, acid phosphatase, malate dehydrogenase and acid dehydrogenase. Three coefficients of similarity: Simple Matching, Jaccard and Ochiai and three different clustering methods: Single Linkage, Complete Linkage and Unweighted Pair Group, using Arithmetic Averages (UPGMA) were used to analyse the data.The phylogenetic relationships among the twelve diploid species and between them and the tetraploid species C. arabica showed that similarity among species of the same subsection is not always greater than among species of different subsections. In addition, although there are several similarity groups in common, established by isoenzymatic polymorphism, morphological characteristics, chemical data, crossability and geographic distribution, there is no common trend among the phylogenetic relationships as indicated by all these different evaluating procedures.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

This article presents a quantitative and objective approach to cat ganglion cell characterization and classification. The combination of several biologically relevant features such as diameter, eccentricity, fractal dimension, influence histogram, influence area, convex hull area, and convex hull diameter are derived from geometrical transforms and then processed by three different clustering methods (Ward's hierarchical scheme, K-means and genetic algorithm), whose results are then combined by a voting strategy. These experiments indicate the superiority of some features and also suggest some possible biological implications.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

(10) Hygiea is the fourth largest asteroid of the main belt, by volume and mass, and it is the largest member of its family, that is made mostly by low-albedo, C-type asteroids, typical of the outer main belt. Like many other large families, it is associated with a 'halo' of objects, that extends far beyond the boundary of the core family, as detected by traditional hierarchical clustering methods (HCM) in proper element domains. Numerical simulations of the orbital evolution of family members may help in estimating the family and halo family age, and the original ejection velocity field. But, in order to minimize the errors associated with including too many interlopers, it is important to have good estimates of family membership that include available data on local asteroid taxonomy, geometrical albedo and local dynamics. For this purpose, we obtained synthetic proper elements and frequencies of asteroids in the Hygiea orbital region, with their errors. We revised the current knowledge on asteroid taxonomy, including Sloan Digital Sky Survey-Moving Object Catalog 4th release (SDSS-MOC 4) data, and geometric albedo data from Wide-field Infrared Survey Explorer (WISE) and Near-Earth Object WISE (NEOWISE). We identified asteroid family members using HCM in the domain of proper elements (a, e, sin (i)) and in the domains of proper frequencies most appropriate to study diffusion in the local web of secular resonances, and eliminated possible interlopers based on taxonomic and geometrical albedo considerations. To identify the family halo, we devised a new hierarchical clustering method in an extended domain that includes proper elements, principal components PC1, PC2 obtained based on SDSS photometric data and, for the first time, WISE and NEOWISE geometric albedo. Data on asteroid size distribution, light curves and rotations were also revised for the Hygiea family. The Hygiea family is the largest group in its region, with two smaller families in proper element domain and 18 families in various frequencies domains identified in this work for the first time. Frequency groups tend to extend vertically in the (a, sin (i)) plane and cross not only the Hygiea family but also the near C-type families of Themis and Veritas, causing a mixture of objects all of relatively low albedo in the Hygiea family area. A few high-albedo asteroids, most likely associated with the Eos family, are also present in the region. Finally, the new multidomains hierarchical clustering method allowed us to obtain a good and robust estimate of the membership of the Hygiea family halo, quite separated from other asteroids families halo in the region, and with a very limited (about 3 per cent) presence of likely interlopers. © 2013 The Author Published by Oxford University Press on behalf of the Royal Astronomical Society.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Pós-graduação em Agronomia (Genética e Melhoramento de Plantas) - FCAV

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES)

Relevância:

60.00% 60.00%

Publicador:

Resumo:

The aim of this study was to develop an objective method to determine the incidence of pleiomorphisms and its influence on the distribution of sperm morphometric subpopulations in ejaculates of howling monkeys (Alouatta caraya) by using a combination of computerized analysis system (ASMA) and principal component analysis (PCA) methods. Ejaculates were collected by electroejaculation methods on a regular basis from five individuals maintained under identical captive environmental, nutritional, and management conditions. Each sperm head was measured for dimensional parameters (Area [A, (square micrometers)], Perimeter [P, (micrometers)], Length [L, (micrometers)], and Width [W, (micrometers)]) and shape-derived parameters (Ellipticity [(L/W)], Elongation [(L - W)/(L + W)], and Rugosity [(4 pi A/P-2)]). PCA revealed two principal components explaining more than the 96 % of the variance. Clustering methods and discriminant analyzes were performed and seven separate subpopulations were identified. There were differences (P < 0.001) in the distribution of the seven subpopulations as well as in the incidence of abnormal pleiomorphisms (58.6 %, 49.8 %, 35.1 %, 66.4 %, and 55.1 %, P < 0.05) among the five donors tested. Our results indicated that differences among individuals related to the incidence of pleiomorphisms, and sperm subpopulational structure was not related to the captivity conditions or the sperm collection method, since all individuals were studied under identical conditions. In conclusion, the combination of ASMA and PCA is a useful clinical diagnostic resource for detecting deficiencies in sperm morphology and sperm subpopulations in A. caraya ejaculates that could be used in ex situ conservation programs of threatened species in Alouatta genus or even other endangered neotropical primate species.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq)

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES)

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES)

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Although association mining has been highlighted in the last years, the huge number of rules that are generated hamper its use. To overcome this problem, many post-processing approaches were suggested, such as clustering, which organizes the rules in groups that contain, somehow, similar knowledge. Nevertheless, clustering can aid the user only if good descriptors be associated with each group. This is a relevant issue, since the labels will provide to the user a view of the topics to be explored, helping to guide its search. This is interesting, for example, when the user doesn't have, a priori, an idea where to start. Thus, the analysis of different labeling methods for association rule clustering is important. Considering the exposed arguments, this paper analyzes some labeling methods through two measures that are proposed. One of them, Precision, measures how much the methods can find labels that represent as accurately as possible the rules contained in its group and Repetition Frequency determines how the labels are distributed along the clusters. As a result, it was possible to identify the methods and the domain organizations with the best performances that can be applied in clusters of association rules.