DAEDALUS at PAN 2014: Guessing tweet author's gender and age


Autoria(s): Villena Román, Julio; González Cristóbal, José Carlos
Data(s)

2014

Resumo

This paper describes our participation at PAN 2014 author profiling task. Our idea was to define, develop and evaluate a simple machine learning classifier able to guess the gender and the age of a given user based on his/her texts, which could become part of the solution portfolio of the company. We were interested in finding not the best possible classifier that achieves the highest accuracy, but to find the optimum balance between performance and throughput using the most simple strategy and less dependent of external systems. Results show that our software using Naive Bayes Multinomial with a term vector model representation of the text is ranked quite well among the rest of participants in terms of accuracy.

Formato

application/pdf

Identificador

http://oa.upm.es/35363/

Idioma(s)

eng

Publicador

E.T.S.I. Telecomunicación (UPM)

Relação

http://oa.upm.es/35363/1/INVE_MEM_2014_192837.pdf

info:eu-repo/semantics/altIdentifier/doi/null

Direitos

http://creativecommons.org/licenses/by-nc-nd/3.0/es/

info:eu-repo/semantics/openAccess

Fonte

5th Conference and Labs of the Evaluation Forum (CLEF 2014) Information Access Evaluation meets Multilinguality, Multimodality, and Interaction | 5th Conference and Labs of the Evaluation Forum (CLEF 2014) Information Access Evaluation meets Multilinguality, Multimodality, and Interaction | 15/09/2014 - 18/09/2014 | Sheffield, UK

Palavras-Chave #Filología #Informática #Telecomunicaciones
Tipo

info:eu-repo/semantics/conferenceObject

Ponencia en Congreso o Jornada

PeerReviewed