Sentiment classification for early detection of health alerts in the chemical textile domain


Autoria(s): Fernández Martínez, Javier; Prieto, Carolina; Lloret, Elena; Gómez, José M.; Martínez-Barco, Patricio; Palomar, Manuel
Contribuinte(s)

Universidad de Alicante. Departamento de Lenguajes y Sistemas Informáticos

Procesamiento del Lenguaje y Sistemas de Información (GPLSI)

Data(s)

14/01/2014

14/01/2014

01/12/2013

Resumo

In the chemical textile domain experts have to analyse chemical components and substances that might be harmful for their usage in clothing and textiles. Part of this analysis is performed searching opinions and reports people have expressed concerning these products in the Social Web. However, this type of information on the Internet is not as frequent for this domain as for others, so its detection and classification is difficult and time-consuming. Consequently, problems associated to the use of chemical substances in textiles may not be detected early enough, and could lead to health problems, such as allergies or burns. In this paper, we propose a framework able to detect, retrieve, and classify subjective sentences related to the chemical textile domain, that could be integrated into a wider health surveillance system. We also describe the creation of several datasets with opinions from this domain, the experiments performed using machine learning techniques and different lexical resources such as WordNet, and the evaluation focusing on the sentiment classification, and complaint detection (i.e., negativity). Despite the challenges involved in this domain, our approach obtains promising results with an F-score of 65% for polarity classification and 82% for complaint detection.

Financial support given by the Department of Software and Computer Systems at the University of Alicante, the Spanish Ministry of Economy and Competitivity (Spanish Government) by the project grants TEXT- MESS 2.0 (TIN2009-13391-C04-01), LEGOLANG (TIN2012-31224), and the Valencian Government (grant no. PROMETEO/2009/119).

Identificador

Computational Linguistics in the Netherlands Journal. 2013, 3: 135-147

2211-4009

http://hdl.handle.net/10045/34948

Idioma(s)

eng

Publicador

Computational Linguistics in the Netherlands

Relação

http://www.clinjournal.org/

Direitos

info:eu-repo/semantics/openAccess

Palavras-Chave #Chemical textile domain #Sentiment classification #Complaint detection #Health surveillance #Lenguajes y Sistemas Informáticos
Tipo

info:eu-repo/semantics/article