Making sense of microposts (#MSM2013) concept extraction challenge


Autoria(s): Cano Basave, Amparo Elizabeth; Varga, Andrea; Rowe, Matthew; Stankovic, Milan; Dadzie, Aba-Sah
Contribuinte(s)

Cano, Amparo E.

Rowe, Matthew

Stankovic, Milan

Dadzie, Aba-Sah

Data(s)

2013

Resumo

Microposts are small fragments of social media content that have been published using a lightweight paradigm (e.g. Tweets, Facebook likes, foursquare check-ins). Microposts have been used for a variety of applications (e.g., sentiment analysis, opinion mining, trend analysis), by gleaning useful information, often using third-party concept extraction tools. There has been very large uptake of such tools in the last few years, along with the creation and adoption of new methods for concept extraction. However, the evaluation of such efforts has been largely consigned to document corpora (e.g. news articles), questioning the suitability of concept extraction tools and methods for Micropost data. This report describes the Making Sense of Microposts Workshop (#MSM2013) Concept Extraction Challenge, hosted in conjunction with the 2013 World Wide Web conference (WWW'13). The Challenge dataset comprised a manually annotated training corpus of Microposts and an unlabelled test corpus. Participants were set the task of engineering a concept extraction system for a defined set of concepts. Out of a total of 22 complete submissions 13 were accepted for presentation at the workshop; the submissions covered methods ranging from sequence mining algorithms for attribute extraction to part-of-speech tagging for Micropost cleaning and rule-based and discriminative models for token classification. In this report we describe the evaluation process and explain the performance of different approaches in different contexts.

Formato

application/pdf

Identificador

http://eprints.aston.ac.uk/27063/1/Making_sense_of_microposts_MSM2013_concept_extraction_challenge.pdf

Cano Basave, Amparo Elizabeth; Varga, Andrea; Rowe, Matthew; Stankovic, Milan and Dadzie, Aba-Sah (2013). Making sense of microposts (#MSM2013) concept extraction challenge. IN: #MSM2013 : concept extraction challenge at Making Sense of Microposts 2013. Cano, Amparo E.; Rowe, Matthew; Stankovic, Milan and Dadzie, Aba-Sah (eds) CEUR workshop proceedings . CEUR-WS.org.

Publicador

CEUR-WS.org

Relação

http://eprints.aston.ac.uk/27063/

Tipo

Book Section

NonPeerReviewed