869 resultados para height partition clustering


Relevância:

20.00% 20.00%

Publicador:

Resumo:

Clustering analysis of data from DNA microarray hybridization studies is an essential task for identifying biologically relevant groups of genes. Attribute cluster algorithm (ACA) has provided an attractive way to group and select meaningful genes. However, ACA needs much prior knowledge about the genes to set the number of clusters. In practical applications, if the number of clusters is misspecified, the performance of the ACA will deteriorate rapidly. In fact, it is a very demanding to do that because of our little knowledge. We propose the Cooperative Competition Cluster Algorithm (CCCA) in this paper. In the algorithm, we assume that both cooperation and competition exist simultaneously between clusters in the process of clustering. By using this principle of Cooperative Competition, the number of clusters can be found in the process of clustering. Experimental results on a synthetic and gene expression data are demonstrated. The results show that CCCA can choose the number of clusters automatically and get excellent performance with respect to other competing methods.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This paper deals with Takagi-Sugeno (TS) fuzzy model identification of nonlinear systems using fuzzy clustering. In particular, an extended fuzzy Gustafson-Kessel (EGK) clustering algorithm, using robust competitive agglomeration (RCA), is developed for automatically constructing a TS fuzzy model from system input-output data. The EGK algorithm can automatically determine the 'optimal' number of clusters from the training data set. It is shown that the EGK approach is relatively insensitive to initialization and is less susceptible to local minima, a benefit derived from its agglomerate property. This issue is often overlooked in the current literature on nonlinear identification using conventional fuzzy clustering. Furthermore, the robust statistical concepts underlying the EGK algorithm help to alleviate the difficulty of cluster identification in the construction of a TS fuzzy model from noisy training data. A new hybrid identification strategy is then formulated, which combines the EGK algorithm with a locally weighted, least-squares method for the estimation of local sub-model parameters. The efficacy of this new approach is demonstrated through function approximation examples and also by application to the identification of an automatic voltage regulation (AVR) loop for a simulated 3 kVA laboratory micro-machine system.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Support vector machine (SVM) is a powerful technique for data classification. Despite of its good theoretic foundations and high classification accuracy, normal SVM is not suitable for classification of large data sets, because the training complexity of SVM is highly dependent on the size of data set. This paper presents a novel SVM classification approach for large data sets by using minimum enclosing ball clustering. After the training data are partitioned by the proposed clustering method, the centers of the clusters are used for the first time SVM classification. Then we use the clusters whose centers are support vectors or those clusters which have different classes to perform the second time SVM classification. In this stage most data are removed. Several experimental results show that the approach proposed in this paper has good classification accuracy compared with classic SVM while the training is significantly faster than several other SVM classifiers.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Concrete placed under water should be proportioned to flow readily into place with minimum materials separation. Unlike concrete cast for deep tremie seals, the use of concrete in repairs often necessitates some free fall of the mixture through water. Such placement conditions lead to greater risk of water erosion and segregation, and should be addressed in proportioning highly flowable underwater concrete. This paper evaluates the effect of free-fall height (FFH) of concrete through water on resulting in-place properties. Concrete was cast in blocks measuring 0.54 x 0.44 x 1 m with the initial FFH in water ranging between 0.25 and 0.60 m. In-place compressive and splitting tensile strengths, unit weight, and depth of washed-out and sedimentation materials were determined. In total, 24 highly flowable mixtures with slump flows greater than 500 mm were investigated. The evaluated mixtures were prepared with various hydraulic binders, including conventional Type 10 cement, a binary mixture with 10% of silica fume (SF), and a ternary binder incorporating 20% of fly ash (FA) and 6% of SF. The mixtures were proportioned with water-binder ratios (w/b) ranging between 0.41 and 0.47. Test results show that the increase of FFH of fresh concrete in water can greatly decrease the residual strength and significantly increase the thickness of washed out and sedimentation materials. The incorporation of 10% of SF, or 20% of FA and 6% of SF, and the reduction of the w/b from 0.47 to 0.41 can, however, lead to a significant increase in washout resistance and residual strength. A relationship between residual strength and the coupled factor of free-fall drop of concrete in water and washout resistance is established.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

In this paper we study the classification of spatiotemporal pattern of one-dimensional cellular automata (CA) whereas the classification comprises CA rules including their initial conditions. We propose an exploratory analysis method based on the normalized compression distance (NCD) of spatiotemporal patterns which is used as dissimilarity measure for a hierarchical clustering. Our approach is different with respect to the following points. First, the classification of spatiotemporal pattern is comparative because the NCD evaluates explicitly the difference of compressibility among two objects, e.g., strings corresponding to spatiotemporal patterns. This is in contrast to all other measures applied so far in a similar context because they are essentially univariate. Second, Kolmogorov complexity, which underlies the NCD, was used in the classification of CA with respect to their spatiotemporal pattern. Third, our method is semiautomatic allowing us to investigate hundreds or thousands of CA rules or initial conditions simultaneously to gain insights into their organizational structure. Our numerical results are not only plausible confirming previous classification attempts but also shed light on the intricate influence of random initial conditions on the classification results.