973 resultados para similarity


Relevância:

20.00% 20.00%

Publicador:

Resumo:

Web databases are now pervasive. Such a database can be accessed via its query interface (usually HTML query form) only. Extracting Web query interfaces is a critical step in data integration across multiple Web databases, which creates a formal representation of a query form by extracting a set of query conditions in it. This paper presents a novel approach to extracting Web query interfaces. In this approach, a generic set of query condition rules are created to define query conditions that are semantically equivalent to SQL search conditions. Query condition rules represent the semantic roles that labels and form elements play in query conditions, and how they are hierarchically grouped into constructs of query conditions. To group labels and form elements in a query form, we explore both their structural proximity in the hierarchy of structures in the query form, which is captured by a tree of nested tags in the HTML codes of the form, and their semantic similarity, which is captured by various short texts used in labels, form elements and their properties. We have implemented the proposed approach and our experimental results show that the approach is highly effective.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Chronic myelomonocytic leukemia is similar to but a separate entity from both myeloproliferative neoplasms and myelodysplastic syndromes, and shows either myeloproliferative or myelodysplastic features. We ask whether this distinction may have a molecular basis. We established the gene expression profiles of 39 samples of chronic myelomonocytic leukemia (including 12 CD34-positive) and 32 CD34-positive samples of myelodysplastic syndromes by using Affymetrix microarrays, and studied the status of 18 genes by Sanger sequencing and array-comparative genomic hybridization in 53 samples. Analysis of 12 mRNAS from chronic myelomonocytic leukemia established a gene expression signature of 122 probe sets differentially expressed between proliferative and dysplastic cases of chronic myelomonocytic leukemia. As compared to proliferative cases, dysplastic cases over-expressed genes involved in red blood cell biology. When applied to 32 myelodysplastic syndromes, this gene expression signature was able to discriminate refractory anemias with ring sideroblasts from refractory anemias with excess of blasts. By comparing mRNAS from these two forms of myelodysplastic syndromes we derived a second gene expression signature. This signature separated the myelodysplastic and myeloproliferative forms of chronic myelomonocytic leukemias. These results were validated using two independent gene expression data sets. We found that myelodysplastic chronic myelomonocytic leukemias are characterized by mutations in transcription/epigenetic regulators (ASXL1, RUNX1, TET2) and splicing genes (SRSF2) and the absence of mutations in signaling genes. Myelodysplastic chronic myelomonocytic leukemias and refractory anemias with ring sideroblasts share a common expression program suggesting they are part of a continuum, which is not totally explained by their similar but not, however, identical mutation spectrum. © 2013 Ferrata Storti Foundation.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Advocates of semi-structured interview techniques have often argued that rapport may be built, and power inequalities between interviewer and respondent counteracted, by strategic self-disclosure on the part of the interviewer. Strategies that use self-disclosure to construct similarity between interviewer and respondent rely on the presumption that the respondent will in fact interpret the interviewer's behaviour in this way. In this article we examine the role of interviewer self-disclosure using data drawn from three projects involving interviews with young people. We consider how an interviewer's attempts to ‘do similarity’ may be interpreted variously as displays of similarity or, ironically, as indicators of difference by the participant, and map the implications that this may have for subsequent interview dialogue. A particular object of concern relates to the ways in which self-disclosing acts may function in the negotiation of category entitlement within interview interactions.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

We employ the impulse approximation for a description of positronium-atom scattering. Our analysis and calculations of Ps-Kr and Ps-Ar collisions provide a theoretical explanation of the similarity between the cross sections for positronium scattering and electron scattering for a range of atomic and molecular targets observed by S. J. Brawley et al. [Science 330, 789 (2010)].

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This book provides a comprehensive tutorial on similarity operators. The authors systematically survey the set of similarity operators, primarily focusing on their semantics, while also touching upon mechanisms for processing them effectively.

The book starts off by providing introductory material on similarity search systems, highlighting the central role of similarity operators in such systems. This is followed by a systematic categorized overview of the variety of similarity operators that have been proposed in literature over the last two decades, including advanced operators such as RkNN, Reverse k-Ranks, Skyline k-Groups and K-N-Match. Since indexing is a core technology in the practical implementation of similarity operators, various indexing mechanisms are summarized. Finally, current research challenges are outlined, so as to enable interested readers to identify potential directions for future investigations.

In summary, this book offers a comprehensive overview of the field of similarity search operators, allowing readers to understand the area of similarity operators as it stands today, and in addition providing them with the background needed to understand recent novel approaches.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

A RkNN query returns all objects whose nearest k neighbors
contain the query object. In this paper, we consider RkNN
query processing in the case where the distances between
attribute values are not necessarily metric. Dissimilarities
between objects could then be a monotonic aggregate of dissimilarities
between their values, such aggregation functions
being specified at query time. We outline real world cases
that motivate RkNN processing in such scenarios. We consider
the AL-Tree index and its applicability in RkNN query
processing. We develop an approach that exploits the group
level reasoning enabled by the AL-Tree in RkNN processing.
We evaluate our approach against a Naive approach
that performs sequential scans on contiguous data and an
improved block-based approach that we provide. We use
real-world datasets and synthetic data with varying characteristics
for our experiments. This extensive empirical
evaluation shows that our approach is better than existing
methods in terms of computational and disk access costs,
leading to significantly better response times.