12 resultados para Similarity measure

em Biblioteca Digital da Produção Intelectual da Universidade de São Paulo (BDPI/USP)


Relevância:

20.00% 20.00%

Publicador:

Resumo:

Analysis of floristic similarity relationships between plant communities can detect patterns of species occurrence and also explain conditioning factors. Searching for such patterns, floristic similarity relationships among Atlantic Forest sites situated at Ibiuna Plateau, Sao Paulo state, Brazil, were analyzed by multivariate techniques. Twenty one forest fragments and six sites within a continuous Forest Reserve were included in the analyses. Floristic composition and structure of the tree community (minimum dbh 5 cm) were assessed using the point centered quarter method. Two methods were used for multivariate analysis: Detrended Correspondence Analysis (DCA) and Two-Way Indicator Species Analysis (TWINSPAN). Similarity relationships among the study areas were based on the successional stage of the community and also on spatial proximity. The more similar the successional stage of the communities, the higher the floristic similarity between them, especially if the communities are geographically close. A floristic gradient from north to south was observed, suggesting a transition between biomes, since northern indicator species are mostly heliophytes, occurring also in cerrado vegetation and seasonal semideciduous forest, while southern indicator species are mostly typical ombrophilous and climax species from typical dense evergreen Atlantic Forest.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

There is a family of well-known external clustering validity indexes to measure the degree of compatibility or similarity between two hard partitions of a given data set, including partitions with different numbers of categories. A unified, fully equivalent set-theoretic formulation for an important class of such indexes was derived and extended to the fuzzy domain in a previous work by the author [Campello, R.J.G.B., 2007. A fuzzy extension of the Rand index and other related indexes for clustering and classification assessment. Pattern Recognition Lett., 28, 833-841]. However, the proposed fuzzy set-theoretic formulation is not valid as a general approach for comparing two fuzzy partitions of data. Instead, it is an approach for comparing a fuzzy partition against a hard referential partition of the data into mutually disjoint categories. In this paper, generalized external indexes for comparing two data partitions with overlapping categories are introduced. These indexes can be used as general measures for comparing two partitions of the same data set into overlapping categories. An important issue that is seldom touched in the literature is also addressed in the paper, namely, how to compare two partitions of different subsamples of data. A number of pedagogical examples and three simulation experiments are presented and analyzed in details. A review of recent related work compiled from the literature is also provided. (c) 2010 Elsevier B.V. All rights reserved.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

An important feature of a database management systems (DBMS) is its client/server architecture, where managing shared memory among the clients and the server is always an tough issue. However, similarity queries are specially sensitive to this kind of architecture, since the answer sizes vary widely. Usually, the answers of similarity query are fully processed to be sent in full to the user, who often is interested in just parts of the answer, e.g. just few elements closer or farther to the query reference. Compelling the DBMS to retrieve the full answer, further ignoring its majority is at least a waste of server processing power. Paging the answer is a technique that splits the answer onto several pages, following client requests. Despite the success of paging on traditional queries, little work has been done to support it in similarity queries. In this work, we present a technique that not only provides paging in similarity range or k-nearest neighbor queries, but also supports them in two variations: the forward similarity query and the backward similarity query. They return elements either increasingly farther of increasingly closer to the query reference. The reported experiments show that, depending on the proportion of the interesting part over the full answer, both techniques allow answering queries much faster than it is obtained in the non-paged way. (C) 2010 Elsevier Inc. All rights reserved.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Modern database applications are increasingly employing database management systems (DBMS) to store multimedia and other complex data. To adequately support the queries required to retrieve these kinds of data, the DBMS need to answer similarity queries. However, the standard structured query language (SQL) does not provide effective support for such queries. This paper proposes an extension to SQL that seamlessly integrates syntactical constructions to express similarity predicates to the existing SQL syntax and describes the implementation of a similarity retrieval engine that allows posing similarity queries using the language extension in a relational DBM. The engine allows the evaluation of every aspect of the proposed extension, including the data definition language and data manipulation language statements, and employs metric access methods to accelerate the queries. Copyright (c) 2008 John Wiley & Sons, Ltd.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

We propose a method to compute the entanglement degree E of bipartite systems having dimension 2 x 2 and demonstrate that the partial transposition of density matrix, the Peres criterion, arise as a consequence Of Our method. Differently from other existing measures of entanglement, the one presented here makes possible the derivation of a criterion to verify if an arbitrary bipartite entanglement will suffers sudden death (SD) based only on the initial-state parameters. Our method also makes possible to characterize the SD as a dynamical quantum phase transition, with order parameter epsilon. having a universal critical exponent -1/2. (C) 2009 Elsevier Inc. All rights reserved.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

In this work we construct the stationary measure of the N species totally asymmetric simple exclusion process in a matrix product formulation. We make the connection between the matrix product formulation and the queueing theory picture of Ferrari and Martin. In particular, in the standard representation, the matrices act on the space of queue lengths. For N > 2 the matrices in fact become tensor products of elements of quadratic algebras. This enables us to give a purely algebraic proof of the stationary measure which we present for N=3.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Each square complex matrix is unitarily similar to an upper triangular matrix with diagonal entries in any prescribed order. Let A = [a(ij)] and B = [b(ij)] be upper triangular n x n matrices that are not similar to direct sums of square matrices of smaller sizes, or are in general position and have the same main diagonal. We prove that A and B are unitarily similar if and only if parallel to h(A(k))parallel to = parallel to h(B(k))parallel to for all h is an element of C vertical bar x vertical bar and k = 1, ..., n, where A(k) := [a(ij)](i.j=1)(k) and B(k) := [b(ij)](i.j=1)(k) are the leading principal k x k submatrices of A and B, and parallel to . parallel to is the Frobenius norm. (C) 2011 Elsevier Inc. All rights reserved.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

A square matrix is nonderogatory if its Jordan blocks have distinct eigenvalues. We give canonical forms for (1) nonderogatory complex matrices up to unitary similarity, and (2) pairs of complex matrices up to similarity, in which one matrix has distinct eigenvalues. The types of these canonical forms are given by undirected and, respectively, directed graphs with no undirected cycles. (C) 2011 Elsevier Inc. All rights reserved.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Lead (Pb) poisoning is preventable but continues to be a public health problem in several countries. Measuring Pb in the surface dental enamel (SDE) using microbiopsies is a rapid, safe, and painless procedure. There are different protocols to perform these microbiopsies, but the reliability of dental enamel lead levels (DELL) determination is dependent upon biopsy depth (BD). It is established that DELL decrease from the outermost superficial layer to the inner layer of dental enamel. The aim of this study was to determine DELL obtained by two different microbiopsy techniques on SDE termed protocol I and protocol II. Two consecutive enamel layers were removed from the same subject group (n = 138) for both protocols. Protocol I consisted of a biopsied site with a diameter of 4 mm after the application of 10 l HCl for 35 s. Protocol II involved a biopsied site of 1.6 mm diameter after application of 5 l HCl for 20 s. The results demonstrated that there were no significant differences for BD and DELL between homologous teeth using protocol I. However, there was a significant difference between DELL in the first and second layers using both protocols. Further, the BD in protocol II overestimated DELL values. In conclusion, SDE analyzed by microbiopsy is a reliable biomarker in protocol I, but the chemical method to calculate BD in protocol II appeared to be inadequate for measurement of DELL. Thus, DELL could not be compared among studies that used different methodologies for SDE microbiopsies.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Hologram quantitative structure-activity relationships (HQSAR) were applied to a data set of 41 cruzain inhibitors. The best HQSAR model (Q(2) = 0.77; R-2 = 0.90) employing Surflex-Sim, as training and test sets generator, was obtained using atoms, bonds, and connections as fragment distinctions and 4-7 as fragment size. This model was then used to predict the potencies of 12 test set compounds, giving satisfactory predictive R-2 value of 0,88. The contribution maps obtained from the best HQSAR model are in agreement with the biological activities of the study compounds. The Trypanosoma cruzi cruzain shares high similarity with the mammalian homolog cathepsin L. The selectivity toward cruzam was checked by a database of 123 compounds, which corresponds to the 41 cruzain inhibitors used in the HQSAR model development plus 82 cathepsin L inhibitors. We screened these compounds by ROCS (Rapid Overlay of Chemical Structures), a Gaussian-shape volume overlap filter that can rapidly identify shapes that match the query molecule. Remarkably, ROCS was able to rank the first 37 hits as being only cruzain inhibitors. In addition, the area under the curve (AUC) obtained with ROCS was 0.96, indicating that the method was very efficient to distinguishing between cruzain and cathepsin L inhibitors. (c) 2007 Elsevier Ltd. All rights reserved.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Conjugated linoleic acids (CLAs) are a group of linoleic acid isomers that are naturally found in food products originating from ruminants (meat and dairy). These acids have received special attention in recent years due to their potential human health benefits. Research efforts have been proposed to increase the CLA content in beef to improve public health. However, because there are more than 30 million beef cattle used each year by the American food industry, it will be necessary to ensure their content in a large number of samples. Therefore, it is important to have an inexpensive and rapid analytical method to measure CLA content in food products. Because gas chromatography (GC), a current popular method for measuring CLAs, is slow, this paper describes a nuclear magnetic resonance spectroscopy ((1)H NMR) method that is potentially >10 times faster than the GC method. Analyses show a correlation coefficient of 0.97, indicating the capacity of NMR to quantify the CLA content in beef samples. Furthermore, the method proposed herein is simple and does not require sophisticated sample preparation.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Cytochrome P450 (CYP450) is a class of enzymes where the substrate identification is particularly important to know. It would help medicinal chemists to design drugs with lower side effects due to drug-drug interactions and to extensive genetic polymorphism. Herein, we discuss the application of the 2D and 3D-similarity searches in identifying reference Structures with higher capacity to retrieve Substrates of three important CYP enzymes (CYP2C9, CYP2D6, and CYP3A4). On the basis of the complementarities of multiple reference structures selected by different similarity search methods, we proposed the fusion of their individual Tanimoto scores into a consensus Tanimoto score (T(consensus)). Using this new score, true positive rates of 63% (CYP2C9) and 81% (CYP2D6) were achieved with false positive rates of 4% for the CYP2C9-CYP2D6 data Set. Extended similarity searches were carried out oil a validation data set, and the results showed that by using the T(consensus) score, not only the area of a ROC graph increased, but also more substrates were recovered at the beginning of a ranked list.