Biblioteca Digital

4 resultados para Similarity measure

em Bulgarian Digital Mathematics Library at IMI-BAS

Using Covariance as a Similarity Measure for Document Language Identification in Hard Contexts

Relevância:

100.00% 100.00%

Publicador:

Resumo:

2000 Mathematics Subject Classification: C2P99.

Veja mais

The Use of Situation Representation when Searching for Solutions in Computer Aided Design Systems

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Projects solutions reuse methodology is offered for software development. The main idea consists in connection of the system objective with the situation using the entities which describe the condition of the system in the process of the objective statement. Every situation is associated with one or several design solutions, which can be used at the development. Based on this connection the situation representing language has been created, it lets to express a problem situation using a natural language describe. The similarity measure has been built to compare situations, it is based on the similarity coefficients with adding the absent part weight.

Veja mais

Combination of Global and Local Attributional Similarities for Synonym Detection

Relevância:

60.00% 60.00%

Publicador:

Resumo:

2000 Mathematics Subject Classification: 68T50.

Veja mais

Topic Segmentation: How Much Can We Do by Counting Words and Sequences of Words

Relevância:

60.00% 60.00%

Publicador:

Resumo:

In this paper, we present an innovative topic segmentation system based on a new informative similarity measure that takes into account word co-occurrence in order to avoid the accessibility to existing linguistic resources such as electronic dictionaries or lexico-semantic databases such as thesauri or ontology. Topic segmentation is the task of breaking documents into topically coherent multi-paragraph subparts. Topic segmentation has extensively been used in information retrieval and text summarization. In particular, our architecture proposes a language-independent topic segmentation system that solves three main problems evidenced by previous research: systems based uniquely on lexical repetition that show reliability problems, systems based on lexical cohesion using existing linguistic resources that are usually available only for dominating languages and as a consequence do not apply to less favored languages and finally systems that need previously existing harvesting training data. For that purpose, we only use statistics on words and sequences of words based on a set of texts. This solution provides a flexible solution that may narrow the gap between dominating languages and less favored languages thus allowing equivalent access to information.

Veja mais

4 resultados para Similarity measure

em Bulgarian Digital Mathematics Library at IMI-BAS

Filtro por publicador

Using Covariance as a Similarity Measure for Document Language Identification in Hard Contexts

The Use of Situation Representation when Searching for Solutions in Computer Aided Design Systems

Combination of Global and Local Attributional Similarities for Synonym Detection

Topic Segmentation: How Much Can We Do by Counting Words and Sequences of Words