2 resultados para HTML (Language for Labelling Documents)

em Aston University Research Archive


Relevância:

30.00% 30.00%

Publicador:

Resumo:

Latent topics derived by topic models such as Latent Dirichlet Allocation (LDA) are the result of hidden thematic structures which provide further insights into the data. The automatic labelling of such topics derived from social media poses however new challenges since topics may characterise novel events happening in the real world. Existing automatic topic labelling approaches which depend on external knowledge sources become less applicable here since relevant articles/concepts of the extracted topics may not exist in external sources. In this paper we propose to address the problem of automatic labelling of latent topics learned from Twitter as a summarisation problem. We introduce a framework which apply summarisation algorithms to generate topic labels. These algorithms are independent of external sources and only rely on the identification of dominant terms in documents related to the latent topic. We compare the efficiency of existing state of the art summarisation algorithms. Our results suggest that summarisation algorithms generate better topic labels which capture event-related context compared to the top-n terms returned by LDA. © 2014 Association for Computational Linguistics.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

In this paper, I examine how language policy acts as a means of both empowering the Welsh language and theminority language worker and as a means of exerting power over them. For this purpose, the study focuses on a particular site: private sector businesses in Wales. Therein, I trace two major discursive processes: first, the Welsh Government’s national language policy documents that promote corporate bilingualism and bilingual employees as value-added resources; second, the practice and discourse of company managers who sustain or appropriate such promotional discourses for creating and promoting their own organisational values. By drawing on concepts from governmentality, critical language policy and discourse studies, I show that promoting bilingualism in business is characterised by local and global governmentalities. These not only bring about critical shifts in valuing language as symbolic entities attached to ethnonational concerns or as promotional objects that bring material gain. Language governmentalities also appear to shape new forms of ‘languaging’ the minority language worker as selfgoverning, and yet, governed subjects who are ultimately made responsible for ‘owning’ Welsh.