15 resultados para Partitions

em Biblioteca Digital da Produção Intelectual da Universidade de São Paulo (BDPI/USP)


Relevância:

20.00% 20.00%

Publicador:

Resumo:

Clustering is a difficult task: there is no single cluster definition and the data can have more than one underlying structure. Pareto-based multi-objective genetic algorithms (e.g., MOCK Multi-Objective Clustering with automatic K-determination and MOCLE-Multi-Objective Clustering Ensemble) were proposed to tackle these problems. However, the output of such algorithms can often contains a high number of partitions, becoming difficult for an expert to manually analyze all of them. In order to deal with this problem, we present two selection strategies, which are based on the corrected Rand, to choose a subset of solutions. To test them, they are applied to the set of solutions produced by MOCK and MOCLE in the context of several datasets. The study was also extended to select a reduced set of partitions from the initial population of MOCLE. These analysis show that both versions of selection strategy proposed are very effective. They can significantly reduce the number of solutions and, at the same time, keep the quality and the diversity of the partitions in the original set of solutions. (C) 2010 Elsevier B.V. All rights reserved.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

There is a family of well-known external clustering validity indexes to measure the degree of compatibility or similarity between two hard partitions of a given data set, including partitions with different numbers of categories. A unified, fully equivalent set-theoretic formulation for an important class of such indexes was derived and extended to the fuzzy domain in a previous work by the author [Campello, R.J.G.B., 2007. A fuzzy extension of the Rand index and other related indexes for clustering and classification assessment. Pattern Recognition Lett., 28, 833-841]. However, the proposed fuzzy set-theoretic formulation is not valid as a general approach for comparing two fuzzy partitions of data. Instead, it is an approach for comparing a fuzzy partition against a hard referential partition of the data into mutually disjoint categories. In this paper, generalized external indexes for comparing two data partitions with overlapping categories are introduced. These indexes can be used as general measures for comparing two partitions of the same data set into overlapping categories. An important issue that is seldom touched in the literature is also addressed in the paper, namely, how to compare two partitions of different subsamples of data. A number of pedagogical examples and three simulation experiments are presented and analyzed in details. A review of recent related work compiled from the literature is also provided. (c) 2010 Elsevier B.V. All rights reserved.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

The aim of this study was to evaluate the effect of Tecoma stans L. Juss. ex Kunth seeds mass on initial emergence, growth and, seedling development under different light conditions. The seeds were separated in four mass classes and sowed in four replicates of 24 seeds for each class, under full sun and canopy shade. Under sun environment was observed a greater percentage of emergence. Heavy seeds presented the greater percentage of emergence under both environments, but a greater rate was observed under canopy shade. One month after the start of experiments, the seedlings at the shade environment presented 100% of mortality. The growth and development seedlings under full sun were noticed for five months. In this period, only in the first three months was possible to observe the effects of Tecoma stans seeds mass on capacity of seedlings to acquire dry mass. The seedlings biomass partitions were similar among the tested mass class. The seedlings of smaller mass tended to a high specific leaf area in relation to the seedlings from large seeds, mainly in the first three months, resulting in a great acquisition of dry mass by these seedlings. In the fourth month, the specific leaf area did not present any tendency. Because the biggest seeds to give rise seedlings with best initial development than smallest seeds can be considered as species reproductive strategy. To produce seeds of different sizes also can be considered as way of species to spread in many microhabitats.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

In this work an iterative strategy is developed to tackle the problem of coupling dimensionally-heterogeneous models in the context of fluid mechanics. The procedure proposed here makes use of a reinterpretation of the original problem as a nonlinear interface problem for which classical nonlinear solvers can be applied. Strong coupling of the partitions is achieved while dealing with different codes for each partition, each code in black-box mode. The main application for which this procedure is envisaged arises when modeling hydraulic networks in which complex and simple subsystems are treated using detailed and simplified models, correspondingly. The potentialities and the performance of the strategy are assessed through several examples involving transient flows and complex network configurations.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

The partitioning of Green Fluorescent Protein (GFP) in poly(ethylene glycol)/Na-poly(acrylate) aqueous two-phase systems (PEG/NaPA-ATPS) has been investigated. The aqueous two-phase systems are formed by mixing the polymers with a salt and a protein solution. The protein partitioning in the two-phase system was investigated at 25 degrees C. The concentration of the GFP was measured by fluorimetry. It was found that the partitioning of GFP depends on the salt type, pH and concentration of PEG. The data indicates that GFP partitions more strongly to the PEG phase in presence of Na2SO4 relative to NaCl. Furthermore, the GFP partitions more to the PEG phase at higher pH. The partition to the PEG phase is strongly favoured in systems with larger tie-line lengths (i.e. systems with higher polymer concentrations). The molecular weight of PEG is important since the partition coefficient (K) of GFP gradually decreases with increasing PEG size, from K ca. 300-400 for PEG 400 to K equal to 1.19 for PEG 8000. A separation process was developed where GFP was separated from a homogenate in two extraction steps: the GFP is first partitioned to the PEG phase in a PEG 3000/NaPA 8000 system containing 3 wt% Na2SO4, where the K value of GFP was 8. The GFP is then re-extracted to a salt phase formed by mixing the previous top-phase with a Na2SO4 solution. The K-value of GFP in this back-extraction was 0.22. The total recovery based on the start material was 74%. (c) 2008 Elsevier B.V. All rights reserved.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

The partition of hemoglobin, lysozyme and glucose-6-phospate dehydrogenase (G6PDH) in a novel inexpensive aqueous two-phase system (ATPS) composed by poly(ethylene glycol) (PEG) and sodium polyacrylate (NaPA) has been studied. The effect of NaCl and Na2SO4, pH and PEG molecular size on the partitioning has been studied. At high pH (above 9), hemoglobin partitions strongly to the PEG-phase. Although some precipitation of hemoglobin occurs, high recovery values are obtained particularly for lysozyme and G6PDH. The partitioning forces are dominated by the hydrophobic and electrochemical (salt) effects, since the positively charged lysozyme and negatively charged G6PDH partitions to the non-charged PEG and the strongly negatively charged polyacrylate enriched phase, respectively. (c) 2007 Elsevier B.V. All rights reserved.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

An investigation of clavulanic acid behavior in an aqueous two-phase micellar system employing the surfactants n-decyltetraethylene oxide (C(10)E(4)) and dodecyldimethylamine oxide (DDAO) was carried out. According to the results, clavulanic acid partitions evenly between the two phases of DDAO micellar system, mixed DDAO C(10)E(4) micellar system, as well as C10E4 micellar system. Therefore, electrostatic interactions between positively charged DDAO-containing micelles and negatively charged drug were not strong enough to influence the partitioning. Nevertheless, clavulanic acid extraction from Streptomyces clavuligerus fermentation broth in C(10)E(4) micellar system employing a previous protein denaturation step provided recovery of 52% clavulanic acid with removal of 70% of the contaminant proteins, which is already promising as a purification strategy. (C) 2011 International Union of Biochemistry and Molecular Biology, Inc. Volume 58, Number 2, March/April 2011, Pages 103-108. E-mail: corangel@usp.br

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Nisin is a natural additive for conservation of food, and can also be used as a therapeutic agent. Nisin inhibits the outgrowth of spores, the growth of a variety of Gram-positive and Grain-negative bacteria. In this paper we present a potentially scalable and cost-effective way to purify commercial and biosynthesized in bioreactor nisin, including simultaneously removal of impurities and contaminants, increasing nisin activity. Aqueous two-phase micellar systems (ATPMS) are considered promising for bioseparation and purification purposes. Triton X-114 was chosen as the as phase-forming surfactant because it is relatively mild to proteins and it also forms two coexisting phases within a convenient temperature range. Nisin activity was determined by the agar diffusion assay utilizing Lactobacillus sake as a sensitive indicator microorganism. Results indicated that nisin partitions preferentially to the micelle rich-phase, despite the surfactant concentration tested, and its antimicrobial activity increases. The successful implementation of this peptide partitioning, from a suspension containing other compounds, represents an important step towards developing a separation method for nisin, and more generally, for other biomolecules of interest. (C) 2007 Elsevier Inc. All rights reserved.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Phylogenetic analyses of chloroplast DNA sequences, morphology, and combined data have provided consistent support for many of the major branches within the angiosperm, clade Dipsacales. Here we use sequences from three mitochondrial loci to test the existing broad scale phylogeny and in an attempt to resolve several relationships that have remained uncertain. Parsimony, maximum likelihood, and Bayesian analyses of a combined mitochondrial data set recover trees broadly consistent with previous studies, although resolution and support are lower than in the largest chloroplast analyses. Combining chloroplast and mitochondrial data results in a generally well-resolved and very strongly supported topology but the previously recognized problem areas remain. To investigate why these relationships have been difficult to resolve we conducted a series of experiments using different data partitions and heterogeneous substitution models. Usually more complex modeling schemes are favored regardless of the partitions recognized but model choice had little effect on topology or support values. In contrast there are consistent but weakly supported differences in the topologies recovered from coding and non-coding matrices. These conflicts directly correspond to relationships that were poorly resolved in analyses of the full combined chloroplast-mitochondrial data set. We suggest incongruent signal has contributed to our inability to confidently resolve these problem areas. (c) 2007 Elsevier Inc. All rights reserved.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Broad-scale phylogenetic analyses of the angiosperms and of the Asteridae have failed to confidently resolve relationships among the major lineages of the campanulid Asteridae (i.e., the euasterid II of APG II, 2003). To address this problem we assembled presently available sequences for a core set of 50 taxa, representing the diversity of the four largest lineages (Apiales, Aquifoliales, Asterales, Dipsacales) as well as the smaller ""unplaced"" groups (e.g., Bruniaceae, Paracryphiaceae, Columelliaceae). We constructed four data matrices for phylogenetic analysis: a chloroplast coding matrix (atpB, matK, ndhF, rbcL), a chloroplast non-coding matrix (rps16 intron, trnT-F region, trnV-atpE IGS), a combined chloroplast dataset (all seven chloroplast regions), and a combined genome matrix (seven chloroplast regions plus 18S and 26S rDNA). Bayesian analyses of these datasets using mixed substitution models produced often well-resolved and supported trees. Consistent with more weakly supported results from previous studies, our analyses support the monophyly of the four major clades and the relationships among them. Most importantly, Asterales are inferred to be sister to a clade containing Apiales and Dipsacales. Paracryphiaceae is consistently placed sister to the Dipsacales. However, the exact relationships of Bruniaceae, Columelliaceae, and an Escallonia clade depended upon the dataset. Areas of poor resolution in combined analyses may be partly explained by conflict between the coding and non-coding data partitions. We discuss the implications of these results for our understanding of campanulid phylogeny and evolution, paying special attention to how our findings bear on character evolution and biogeography in Dipsacales.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

In this paper, we present an algorithm for cluster analysis that integrates aspects from cluster ensemble and multi-objective clustering. The algorithm is based on a Pareto-based multi-objective genetic algorithm, with a special crossover operator, which uses clustering validation measures as objective functions. The algorithm proposed can deal with data sets presenting different types of clusters, without the need of expertise in cluster analysis. its result is a concise set of partitions representing alternative trade-offs among the objective functions. We compare the results obtained with our algorithm, in the context of gene expression data sets, to those achieved with multi-objective Clustering with automatic K-determination (MOCK). the algorithm most closely related to ours. (C) 2009 Elsevier B.V. All rights reserved.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Clustering quality or validation indices allow the evaluation of the quality of clustering in order to support the selection of a specific partition or clustering structure in its natural unsupervised environment, where the real solution is unknown or not available. In this paper, we investigate the use of quality indices mostly based on the concepts of clusters` compactness and separation, for the evaluation of clustering results (partitions in particular). This work intends to offer a general perspective regarding the appropriate use of quality indices for the purpose of clustering evaluation. After presenting some commonly used indices, as well as indices recently proposed in the literature, key issues regarding the practical use of quality indices are addressed. A general methodological approach is presented which considers the identification of appropriate indices thresholds. This general approach is compared with the simple use of quality indices for evaluating a clustering solution.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Habitually, capuchin monkeys access encased hard foods by using their canines and premolars and/or by pounding the food on hard surfaces. Instead, the wild bearded capuchins (Cebus libidinosus) of Boa Vista (Brazil) routinely crack palm fruits with tools. We measured size, weight, structure, and peak-force-at-failure of the four palm fruit species most frequently processed with tools by wild capuchin monkeys living in Boa Vista. Moreover, for each nut species we identify whether peak-force-at-failure was consistently associated with greater weight/volume, endocarp, thickness, and structural complexity. The goals of this study were (a) to investigate whether these palm fruits are difficult, or impossible, to access other than with tools and (b) to collect data on the physical properties of palm fruits that are comparable to those available for the nuts cracked open with tools by wild chimpanzees. Results showed that the four nut species differ in terms of peak-force-at-failure and that peak-force-at-failure is positively associated with greater weight (and consequently volume) and apparently with structural complexity (i.e. more kernels and thus more partitions); finally for three out of four nut species shell thickness is also positively associated with greater volume. The finding that the nuts exploited by capuchins with tools have very high resistance values support the idea that tool use is indeed mandatory to crack them open. Finally, the peak-force-at-failure of the piassava nuts is similar to that reported for the very tough panda nuts cracked open by wild chimpanzees; this highlights the ecological importance of tool use for exploiting high resistance foods in this capuchin species.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

We consider conditions which allow the embedding of linear hypergraphs of fixed size. In particular, we prove that any k-uniform hypergraph H of positive uniform density contains all linear k-uniform hypergraphs of a given size. More precisely, we show that for all integers l >= k >= 2 and every d > 0 there exists Q > 0 for which the following holds: if His a sufficiently large k-uniform hypergraph with the property that the density of H induced on every vertex subset of size on is at least d, then H contains every linear k-uniform hypergraph F with l vertices. The main ingredient in the proof of this result is a counting lemma for linear hypergraphs, which establishes that the straightforward extension of graph epsilon-regularity to hypergraphs suffices for counting linear hypergraphs. We also consider some related problems. (C) 2009 Elsevier Inc. All rights reserved.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Given a branched covering of degree d between closed surfaces, it determines a collection of partitions of d, the branch data. In this work we show that any branch data are realized by an indecomposable primitive branched covering on a connected closed surface N with chi(N) <= 0. This shows that decomposable and indecomposable realizations may coexist. Moreover, we characterize the branch data of a decomposable primitive branched covering. Bibliography: 20 titles.