A survey of tagging techniques for music, speech and environmental sound


Autoria(s): Duan, Shufei; Zhang, Jinglan; Roe, Paul; Towsey, Michael W.
Data(s)

2012

Resumo

Sound tagging has been studied for years. Among all sound types, music, speech, and environmental sound are three hottest research areas. This survey aims to provide an overview about the state-of-the-art development in these areas.We discuss about the meaning of tagging in different sound areas at the beginning of the journey. Some examples of sound tagging applications are introduced in order to illustrate the significance of this research. Typical tagging techniques include manual, automatic, and semi-automatic approaches.After reviewing work in music, speech and environmental sound tagging, we compare them and state the research progress to date. Research gaps are identified for each research area and the common features and discriminations between three areas are discovered as well. Published datasets, tools used by researchers, and evaluation measures frequently applied in the analysis are listed. In the end, we summarise the worldwide distribution of countries dedicated to sound tagging research for years.

Formato

application/pdf

Identificador

http://eprints.qut.edu.au/56097/

Publicador

Springer Netherlands

Relação

http://eprints.qut.edu.au/56097/1/56097.pdf

DOI:10.1007/s10462-012-9362-y

Duan, Shufei, Zhang, Jinglan, Roe, Paul, & Towsey, Michael W. (2012) A survey of tagging techniques for music, speech and environmental sound. Artificial Intelligence Review, pp. 1-25.

Direitos

Springer Science+Business Media Dordrecht

The final publication is available at www.springerlink.com

Fonte

School of Electrical Engineering & Computer Science; Science & Engineering Faculty

Palavras-Chave #080107 Natural Language Processing #080109 Pattern Recognition and Data Mining #080199 Artificial Intelligence and Image Processing not elsewhere classified #080602 Computer-Human Interaction #Sound tagging #Music tagging #Speech recognition #Environmental sound tagging #Manual tagging #Automatic tagging #Semi-automatic tagging
Tipo

Journal Article