Biblioteca Digital

2 resultados para Interviews as topic

em Indian Institute of Science - Bangalore - Índia

Filtro por publicador

Efficient classification using phrases generated by topic models

Relevância:

20.00% 20.00%

Publicador:

Resumo:

There are many popular models available for classification of documents like Naïve Bayes Classifier, k-Nearest Neighbors and Support Vector Machine. In all these cases, the representation is based on the “Bag of words” model. This model doesn't capture the actual semantic meaning of a word in a particular document. Semantics are better captured by proximity of words and their occurrence in the document. We propose a new “Bag of Phrases” model to capture this discriminative power of phrases for text classification. We present a novel algorithm to extract phrases from the corpus using the well known topic model, Latent Dirichlet Allocation(LDA), and to integrate them in vector space model for classification. Experiments show a better performance of classifiers with the new Bag of Phrases model against related representation models.

Veja mais

Preface to Special Topic: Emerging Techniques in Fluorescence Microscopy and Imaging

Relevância:

20.00% 20.00%

Publicador:

Veja mais