9 resultados para latent semantic analysis

em Universidad de Alicante


Relevância:

100.00% 100.00%

Publicador:

Resumo:

The present is marked by the availability of large volumes of heterogeneous data, whose management is extremely complex. While the treatment of factual data has been widely studied, the processing of subjective information still poses important challenges. This is especially true in tasks that combine Opinion Analysis with other challenges, such as the ones related to Question Answering. In this paper, we describe the different approaches we employed in the NTCIR 8 MOAT monolingual English (opinionatedness, relevance, answerness and polarity) and cross-lingual English-Chinese tasks, implemented in our OpAL system. The results obtained when using different settings of the system, as well as the error analysis performed after the competition, offered us some clear insights on the best combination of techniques, that balance between precision and recall. Contrary to our initial intuitions, we have also seen that the inclusion of specialized Natural Language Processing tools dealing with Temporality or Anaphora Resolution lowers the system performance, while the use of topic detection techniques using faceted search with Wikipedia and Latent Semantic Analysis leads to satisfactory system performance, both for the monolingual setting, as well as in a multilingual one.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

El trabajo desarrolla una propuesta metodológica para abordar los principales problemas que plantea el análisis de la mortalidad a partir de las expresiones diagnósticas que se recogen en las partidas de defunción de los registros parroquiales y civiles. La cuestión diacrónica o de recorrido cronológico de las expresiones es abordada desde las técnicas del análisis semántico documental y el estudio de sus tipologías demográfico-sanitarias. La agrupación de las diversas causas de muerte se resuelve con la utilización simultánea de la Segunda Nomenclatura de la Primera Clasificación de Causas de Muerte propuesta por Jacques Bertillon en 1899, y una modificación de la clasificación propuesta por Thomas McKeown en su conocida monografía sobre "El crecimiento moderno de la población".

Relevância:

80.00% 80.00%

Publicador:

Resumo:

In this paper we address two issues. The first one analyzes whether the performance of a text summarization method depends on the topic of a document. The second one is concerned with how certain linguistic properties of a text may affect the performance of a number of automatic text summarization methods. For this we consider semantic analysis methods, such as textual entailment and anaphora resolution, and we study how they are related to proper noun, pronoun and noun ratios calculated over original documents that are grouped into related topics. Given the obtained results, we can conclude that although our first hypothesis is not supported, since it has been found no evident relationship between the topic of a document and the performance of the methods employed, adapting summarization systems to the linguistic properties of input documents benefits the process of summarization.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Objetivos: Describir necesidades y experiencias de madres con hijos menores de un año, identificar los factores que dificultan la transición a la maternidad y orientar en el contenido de un programa de promoción de la salud a desarrollar en sesiones grupales de apoyo a la maternidad. Diseño: Estudio cualitativo con enfoque fenomenológico. Emplazamiento: Ocho centros de Atención Primaria de la provincia de Barcelona, entre julio de 2011 y julio de 2012. Participantes: Un total de 21 madres que participan en dinámicas grupales de apoyo a la maternidad. Método: Selección opinática de las participantes en las entrevistas semiestructuradas. Las transcripciones se analizaron en su estructura (análisis de contenido latente) y contenido (análisis de contenido manifiesto), obteniéndose diferentes categorías. Resultados: Las participantes en el estudio definen el constructo de la maternidad en torno a 3 categorías: los cambios en el estilo de vida, los sentimientos y las percepciones. Identifican como momentos más estresantes: «el nuevo rol», «los cambios en la relación de pareja», «sentimientos encontrados», «experiencias del embarazo y parto», «la idealización», «la falta de apoyo», «llantos», «cólicos», «interpretar las señales del niño», «baño», «descanso», «opiniones contradictorias», «aprendizaje» y «adquisición de nuevas habilidades». Destacan como temas principales para las dinámicas grupales: alimentación, desarrollo, relación afectiva, confianza materna, participación de los padres, papel de la familia, aspectos emocionales, descanso, masaje, baño, prevención de accidentes, cólicos, primeros auxilios, puericultura, recursos y vacunas. Conclusión: Las dinámicas grupales deben contextualizarse de acuerdo a las necesidades percibidas por las madres y permitir la participación de otras figuras familiares.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

In this paper we present the enrichment of the Integration of Semantic Resources based in WordNet (ISR-WN Enriched). This new proposal improves the previous one where several semantic resources such as SUMO, WordNet Domains and WordNet Affects were related, adding other semantic resources such as Semantic Classes and SentiWordNet. Firstly, the paper describes the architecture of this proposal explaining the particularities of each integrated resource. After that, we analyze some problems related to the mappings of different versions and how we solve them. Moreover, we show the advantages that this kind of tool can provide to different applications of Natural Language Processing. Related to that question, we can demonstrate that the integration of semantic resources allows acquiring a multidimensional vision in the analysis of natural language.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

This paper addresses the problem of the automatic recognition and classification of temporal expressions and events in human language. Efficacy in these tasks is crucial if the broader task of temporal information processing is to be successfully performed. We analyze whether the application of semantic knowledge to these tasks improves the performance of current approaches. We therefore present and evaluate a data-driven approach as part of a system: TIPSem. Our approach uses lexical semantics and semantic roles as additional information to extend classical approaches which are principally based on morphosyntax. The results obtained for English show that semantic knowledge aids in temporal expression and event recognition, achieving an error reduction of 59% and 21%, while in classification the contribution is limited. From the analysis of the results it may be concluded that the application of semantic knowledge leads to more general models and aids in the recognition of temporal entities that are ambiguous at shallower language analysis levels. We also discovered that lexical semantics and semantic roles have complementary advantages, and that it is useful to combine them. Finally, we carried out the same analysis for Spanish. The results obtained show comparable advantages. This supports the hypothesis that applying the proposed semantic knowledge may be useful for different languages.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

One of the main challenges to be addressed in text summarization concerns the detection of redundant information. This paper presents a detailed analysis of three methods for achieving such goal. The proposed methods rely on different levels of language analysis: lexical, syntactic and semantic. Moreover, they are also analyzed for detecting relevance in texts. The results show that semantic-based methods are able to detect up to 90% of redundancy, compared to only the 19% of lexical-based ones. This is also reflected in the quality of the generated summaries, obtaining better summaries when employing syntactic- or semantic-based approaches to remove redundancy.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

In this work we present a semantic framework suitable of being used as support tool for recommender systems. Our purpose is to use the semantic information provided by a set of integrated resources to enrich texts by conducting different NLP tasks: WSD, domain classification, semantic similarities and sentiment analysis. After obtaining the textual semantic enrichment we would be able to recommend similar content or even to rate texts according to different dimensions. First of all, we describe the main characteristics of the semantic integrated resources with an exhaustive evaluation. Next, we demonstrate the usefulness of our resource in different NLP tasks and campaigns. Moreover, we present a combination of different NLP approaches that provide enough knowledge for being used as support tool for recommender systems. Finally, we illustrate a case of study with information related to movies and TV series to demonstrate that our framework works properly.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The geographical proximity and socioeconomic dependence on the United States brought about a deep rooted anglicization of the Cuban Spanish lexis and social strata, especially throughout the Neocolonial period (1902–1959). This study is based on the revision of a renowned newspaper of that time, Diario de la Marina, and the corresponding elaboration of a corpus of English-induced loanwords. Diario de la Marina particularly targeted upper social class, and only crónicas sociales (society pages’ columns) and print advertising were revised because of their fully descriptive texts, which encoded the ruling class ideology and consumerism. The findings show that there existed a high number of lexical and cultural anglicisms in the sociolect in question, and that the sociolinguistic anglicization was openly embraced by the upper socioeconomic stratum, entailing a differentiating sign of sophistication and social stratification. Likewise, a number of the anglicisms collected, particularly those related with social events, are unused in contemporary Cuban Spanish, which suggests a major semantic shifting in this sociolect after 1959.