Biblioteca Digital

In any data mining applications, automated text and text and image retrieval of information is needed. This becomes essential with the growth of the Internet and digital libraries. Our approach is based on the latent semantic indexing (LSI) and the corresponding term-by-document matrix suggested by Berry and his co-authors. Instead of using deterministic methods to find the required number of first "k" singular triplets, we propose a stochastic approach. First, we use Monte Carlo method to sample and to build much smaller size term-by-document matrix (e.g. we build k x k matrix) from where we then find the first "k" triplets using standard deterministic methods. Second, we investigate how we can reduce the problem to finding the "k"-largest eigenvalues using parallel Monte Carlo methods. We apply these methods to the initial matrix and also to the reduced one. The algorithms are running on a cluster of workstations under MPI and results of the experiments arising in textual retrieval of Web documents as well as comparison of the stochastic methods proposed are presented. (C) 2003 IMACS. Published by Elsevier Science B.V. All rights reserved.

Veja mais

Cross-lingual information retrieval as a side effect of tagging

Relevância:

100.00% 100.00%

Publicador:

Veja mais

The effect of folksonomy in information retrieval: a case study in Arabic documents

Relevância:

100.00% 100.00%

Publicador:

Veja mais

Soft-link hypertext for information retrieval

Relevância:

100.00% 100.00%

Publicador:

Veja mais

Markov Random Fields and Maximum Entropy Modeling for Music Information Retrieval

Relevância:

100.00% 100.00%

Publicador:

Veja mais

Type-safe versioned object query language

Relevância:

100.00% 100.00%

Publicador:

Veja mais

Indexing and information retrieval

Relevância:

100.00% 100.00%

Publicador:

Veja mais

Temporal Information Retrieval

Relevância:

100.00% 100.00%

Publicador:

Veja mais

Generalized Graph Matching for Data Mining and Information Retrieval

Relevância:

100.00% 100.00%

Publicador:

Veja mais

Prometheus Framework for Fuzzy Information Retrieval in Semantic Spaces

Relevância:

100.00% 100.00%

Publicador:

Resumo:

This paper introduces a novel vision for further enhanced Internet of Things services. Based on a variety of data (such as location data, ontology-backed search queries, in- and outdoor conditions) the Prometheus framework is intended to support users with helpful recommendations and information preceding a search for context-aware data. Adapted from artificial intelligence concepts, Prometheus proposes user-readjusted answers on umpteen conditions. A number of potential Prometheus framework applications are illustrated. Added value and possible future studies are discussed in the conclusion.

Veja mais

924 resultados para XML, Information, Retrieval, Query, Language

Filtro por publicador