A Cross-modal Approach for Karaoke Artifacts Correction


Autoria(s): Yan, WeiQi
Data(s)

01/09/2008

Resumo

Karaoke singing is a popular form of entertainment in several parts of the world. Since this genre of performance attracts amateurs, the singing often has artifacts related to scale, tempo, and synchrony. We have developed an approach to correct these artifacts using cross-modal multimedia streams information. We first perform adaptive sampling on the user's rendition and then use the original singer's rendition as well as the video caption highlighting information in order to correct the pitch, tempo and the loudness. A method of analogies has been employed to perform this correction. The basic idea is to manipulate the user's rendition in a manner to make it as similar as possible to the original singing. A pre-processing step of noise removal due to feedback and huffing also helps improve the quality of the user's audio. The results are described in the paper which shows the effectiveness of this multimedia approach.

Identificador

http://pure.qub.ac.uk/portal/en/publications/a-crossmodal-approach-for-karaoke-artifacts-correction(17560676-de95-432c-8c9a-06db37957f62).html

http://dx.doi.org/10.1007/s11042-007-0174-z

http://www.scopus.com/inward/record.url?scp=47949092014&partnerID=8YFLogxK

Idioma(s)

eng

Direitos

info:eu-repo/semantics/restrictedAccess

Fonte

Yan , W 2008 , ' A Cross-modal Approach for Karaoke Artifacts Correction ' Multimedia Tools and Applications , vol 39 , no. 3 , pp. 413-439 . DOI: 10.1007/s11042-007-0174-z

Palavras-Chave #/dk/atira/pure/subjectarea/asjc/1700/1703 #Computational Theory and Mathematics #/dk/atira/pure/subjectarea/asjc/1700/1704 #Computer Graphics and Computer-Aided Design #/dk/atira/pure/subjectarea/asjc/1700/1710 #Information Systems #/dk/atira/pure/subjectarea/asjc/1700/1712 #Software #/dk/atira/pure/subjectarea/asjc/2200/2208 #Electrical and Electronic Engineering #/dk/atira/pure/subjectarea/asjc/2600/2614 #Theoretical Computer Science
Tipo

article