997 resultados para complete linkage clustering


Relevância:

20.00% 20.00%

Publicador:

Resumo:

The effect of separation by biogeographic features followed by secondary contact can blur taxonomic boundaries and produce complex genetic signatures. We analyzed population structure and gene flow across the range of the long-tailed finch (Poephila acuticauda) in northern Australia (1) to test the hypothesis that Ord Arid Intrusion acted as the causative barrier that led to divergence of P. acuticauda subspecies, (2) to determine whether genetic data support the presence of a gradual cline across the range or a sudden shift, both of which have been suggested based on morphological data, and (3) to estimate levels of contemporary gene flow within this species complex. We collected samples from 302 individuals from 10 localities. Analyses of 12 microsatellite loci and sequence data from 333 base pairs of the mitochondrial control region were used to estimate population structure and gene flow, using analysis of molecular variance (AMOVA), haplotype network analysis, frequency statistics, and clustering methods. Mitochondrial sequence data indicated the presence of three genetic groups (regions) across the range of P. acuticauda. Genetic diversity was highest in the east and lowest in the west. The Ord Arid Intrusion appears to have functioned as a biogeographic barrier in the past, according to mtDNA evidence presented here and evidence from previous studies. The absence of isolation by distance between adjacent regions and the lack of population genetic structure of mtDNA within regions indicates that genetic changes across the range of P. acuticauda subspecies are characterized by discrete breaks between regions. While microsatellite data indicate a complete absence of genetic structure across this species’ range, it appears unlikely that this results from high levels of gene flow. Mitochondrial data do not support the presence of contemporary gene flow across the range of this species.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This article is devoted to experimental investigation of a novel application of a clustering technique introduced by the authors recently in order to use robust and stable consensus functions in information security, where it is often necessary to process large data sets and monitor outcomes in real time, as it is required, for example, for intrusion detection. Here we concentrate on a particular case of application to profiling of phishing websites. First, we apply several independent clustering algorithms to a randomized sample of data to obtain independent initial clusterings. Silhouette index is used to determine the number of clusters. Second, rank correlation is used to select a subset of features for dimensionality reduction. We investigate the effectiveness of the Pearson Linear Correlation Coefficient, the Spearman Rank Correlation Coefficient and the Goodman--Kruskal Correlation Coefficient in this application. Third, we use a consensus function to combine independent initial clusterings into one consensus clustering. Fourth, we train fast supervised classification algorithms on the resulting consensus clustering in order to enable them to process the whole large data set as well as new data. The precision and recall of classifiers at the final stage of this scheme are critical for the effectiveness of the whole procedure. We investigated various combinations of several correlation coefficients, consensus functions, and a variety of supervised classification algorithms.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

In this paper, an empirical analysis to examine the effects of image segmentation with different colour models using the fuzzy c-means (FCM) clustering algorithm is conducted. A qualitative evaluation method based on human perceptual judgement is used. Two sets of complex images, i.e., outdoor scenes and satellite imagery, are used for demonstration. These images are employed to examine the characteristics of image segmentation using FCM with eight different colour models. The results obtained from the experimental study are compared and analysed. It is found that the CIELAB colour model yields the best outcomes in colour image segmentation with FCM.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Both the instance level knowledge and the attribute level knowledge can improve clustering quality, but how to effectively utilize both of them is an essential problem to solve. This paper proposes a wrapper framework for semi-supervised clustering, which aims to gracely integrate both kinds of priori knowledge in the clustering process, the instance level knowledge in the form of pairwise constraints and the attribute level knowledge in the form of attribute order preferences. The wrapped algorithm is then designed as a semi-supervised clustering process which transforms this clustering problem into an optimization problem. The experimental results demonstrate the effectiveness and potential of proposed method.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Ensifer (Sinorhizobium) medicae is an effective nitrogen fixing microsymbiont of a diverse range of annual Medicago (medic) species. Strain WSM419 is an aerobic, motile, non-spore forming, Gram-negative rod isolated from a M. murex root nodule collected in Sardinia, Italy in 1981. WSM419 was manufactured commercially in Australia as an inoculant for annual medics during 1985 to 1993 due to its nitrogen fixation, saprophytic competence and acid tolerance properties. Here we describe the basic features of this organism, together with the complete genome sequence, and annotation. This is the first report of a complete genome sequence for a microsymbiont of the group of annual medic species adapted to acid soils. We reveal that its genome size is 6,817,576 bp encoding 6,518 protein-coding genes and 81 RNA only encoding genes. The genome contains a chromosome of size 3,781,904 bp and 3 plasmids of size 1,570,951, 1,245,408 and 219,313 bp. The smallest plasmid is a feature unique to this medic microsymbiont.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Rhizobium leguminosarum bv. trifolii is the effective nitrogen fixing microsymbiont of a diverse range of annual and perennial Trifolium (clover) species. Strain WSM2304 is an aerobic, motile, non-spore forming, Gram-negative rod isolated from Trifolium polymorphum in Uruguay in 1998. This microsymbiont predominated in the perennial grasslands of Glencoe Research Station, in Uruguay, to competitively nodulate its host, and fix atmospheric nitrogen. Here we describe the basic features of WSM2304, together with the complete genome sequence, and annotation. This is the first completed genome sequence for a nitrogen fixing microsymbiont of a clover species from the American centre of origin. We reveal that its genome size is 6,872,702 bp encoding 6,643 protein-coding genes and 62 RNA only encoding genes. This multipartite genome was found to contain 5 distinct replicons; a chromosome of size 4,537,948 bp and four circular plasmids of size 4,537,948, 1,266,105, 501,946, 308,747 and 257,956 bp.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Rhizobium leguminosarum bv trifolii is a soil-inhabiting bacterium that that has the capacity to be an effective nitrogen fixing microsymbiont of a diverse range of annual Trifolium (clover) species. Strain WSM1325 is an aerobic, motile, non-spore forming, Gram-negative rod isolated from root nodules collected in 1993 from the Greek Island of Serifos. WSM1325 is manufactured commercially in Australia as an inoculant for a broad range of annual clovers of Mediterranean origin due to its superior attributes of saprophytic competence, nitrogen fixation and acid-tolerance. Here we describe the basic features of this organism, together with the complete genome sequence, and annotation. This is the first completed genome sequence for a microsymbiont of annual clovers. We reveal that its genome size is 7,418,122 bp encoding 7,232 protein-coding genes and 61 RNA-only encoding genes. This multipartite genome contains 6 distinct replicons; a chromosome of size 4,767,043 bp and 5 plasmids of size 828,924, 660,973, 516,088, 350,312 and 294,782 bp.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

In this paper, a new image segmentation approach that integrates color and texture features using the fuzzy c-means clustering algorithm is described. To demonstrate the applicability of the proposed approach to satellite image retrieval, an interactive region-based image query system is designed and developed. A database comprising 400 multispectral satellite images is used to evaluate the performance of the system. The results are analyzed and discussed, and a performance comparison with other methods is included. The outcomes reveal that the proposed approach is able to improve the quality of the segmentation results as well as the retrieval performance.