928 resultados para Audio indexing


Relevância:

10.00% 10.00%

Publicador:

Resumo:

There is growing evidence that nonlinear time series analysis techniques can be used to successfully characterize, classify, or process signals derived from realworld dynamics even though these are not necessarily deterministic and stationary. In the present study we proceed in this direction by addressing an important problem our modern society is facing, the automatic classification of digital information. In particular, we address the automatic identification of cover songs, i.e. alternative renditions of a previously recorded musical piece. For this purpose we here propose a recurrence quantification analysis measure that allows tracking potentially curved and disrupted traces in cross recurrence plots. We apply this measure to cross recurrence plots constructed from the state space representation of musical descriptor time series extracted from the raw audio signal. We show that our method identifies cover songs with a higher accuracy as compared to previously published techniques. Beyond the particular application proposed here, we discuss how our approach can be useful for the characterization of a variety of signals from different scientific disciplines. We study coupled Rössler dynamics with stochastically modulated mean frequencies as one concrete example to illustrate this point.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Intuitively, music has both predictable and unpredictable components. In this work we assess this qualitative statement in a quantitative way using common time series models fitted to state-of-the-art music descriptors. These descriptors cover different musical facets and are extracted from a large collection of real audio recordings comprising a variety of musical genres. Our findings show that music descriptor time series exhibit a certain predictability not only for short time intervals, but also for mid-term and relatively long intervals. This fact is observed independently of the descriptor, musical facet and time series model we consider. Moreover, we show that our findings are not only of theoretical relevance but can also have practical impact. To this end we demonstrate that music predictability at relatively long time intervals can be exploited in a real-world application, namely the automatic identification of cover songs (i.e. different renditions or versions of the same musical piece). Importantly, this prediction strategy yields a parameter-free approach for cover song identification that is substantially faster, allows for reduced computational storage and still maintains highly competitive accuracies when compared to state-of-the-art systems.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

We present a new technique for audio signal comparison based on tonal subsequence alignment and its application to detect cover versions (i.e., different performances of the same underlying musical piece). Cover song identification is a task whose popularity has increased in the Music Information Retrieval (MIR) community along in the past, as it provides a direct and objective way to evaluate music similarity algorithms.This article first presents a series of experiments carried outwith two state-of-the-art methods for cover song identification.We have studied several components of these (such as chroma resolution and similarity, transposition, beat tracking or Dynamic Time Warping constraints), in order to discover which characteristics would be desirable for a competitive cover song identifier. After analyzing many cross-validated results, the importance of these characteristics is discussed, and the best-performing ones are finally applied to the newly proposed method. Multipleevaluations of this one confirm a large increase in identificationaccuracy when comparing it with alternative state-of-the-artapproaches.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

A new multimodal biometric database designed and acquired within the framework of the European BioSecure Network of Excellence is presented. It is comprised of more than 600 individuals acquired simultaneously in three scenarios: 1) over the Internet, 2) in an office environment with desktop PC, and 3) in indoor/outdoor environments with mobile portable hardware. The three scenarios include a common part of audio/video data. Also, signature and fingerprint data have been acquired both with desktop PC and mobile portable hardware. Additionally, hand and iris data were acquired in the second scenario using desktop PC. Acquisition has been conducted by 11 European institutions. Additional features of the BioSecure Multimodal Database (BMDB) are: two acquisitionsessions, several sensors in certain modalities, balanced gender and age distributions, multimodal realistic scenarios with simple and quick tasks per modality, cross-European diversity, availability of demographic data, and compatibility with other multimodal databases. The novel acquisition conditions of the BMDB allow us to perform new challenging research and evaluation of eithermonomodal or multimodal biometric systems, as in the recent BioSecure Multimodal Evaluation campaign. A description of this campaign including baseline results of individual modalities from the new database is also given. The database is expected to beavailable for research purposes through the BioSecure Association during 2008.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

In this paper we propose a new approach for tonic identification in Indian art music and present a proposal for acomplete iterative system for the same. Our method splits the task of tonic pitch identification into two stages. In the first stage, which is applicable to both vocal and instrumental music, we perform a multi-pitch analysis of the audio signal to identify the tonic pitch-class. Multi-pitch analysisallows us to take advantage of the drone sound, which constantlyreinforces the tonic. In the second stage we estimate the octave in which the tonic of the singer lies and is thusneeded only for the vocal performances. We analyse the predominant melody sung by the lead performer in order to establish the tonic octave. Both stages are individually evaluated on a sizable music collection and are shown toobtain a good accuracy. We also discuss the types of errors made by the method.Further, we present a proposal for a system that aims to incrementally utilize all the available data, both audio and metadata in order to identify the tonic pitch. It produces a tonic estimate and a confidence value, and is iterative in nature. At each iteration, more data is fed into the systemuntil the confidence value for the identified tonic is above a defined threshold. Rather than obtain high overall accuracy for our complete database, ultimately our goal is to develop a system which obtains very high accuracy on a subset of the database with maximum confidence.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

A Carnatic music concert is made up of a sequence of pieces, where each piece corresponds to a particular genre and ra¯aga (melody). Unlike a western music concert, the artist may be applauded intra-performance inter-performance. Most Carnatic music that is archived today correspond to a single audio recordings of entire concerts.The purpose of this paper is to segment single audio recordings into a sequence of pieces using thecharacteristic features of applause and music. Spectral flux, spectral entropy change quite significantly from music to applause and vice-versa. The characteristics of these features for a subset of concerts was studied. A threshold based approach was used to segment the pieces into music fragments and applauses. Preliminary resultson recordings 19 concerts from matched microphones show that the EER is about 17% for a resolution of 0.25 seconds. Further, a parameter called CUSUM is estimatedfor the applause regions. The CUSUM values determine the strength of the applause. The CUSUM is used to characterise the highlights of a concert.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

User generated content shared in online communities is often described using collaborative tagging systems where users assign labels to content resources. As a result, a folksonomy emerges that relates a number of tags with the resources they label and the users that have used them. In this paper we analyze the folksonomy of Freesound, an online audio clip sharing site which contains more than two million users and 150,000 user-contributed sound samplescovering a wide variety of sounds. By following methodologies taken from similar studies, we compute some metrics that characterize the folksonomy both at the globallevel and at the tag level. In this manner, we are able to betterunderstand the behavior of the folksonomy as a whole, and also obtain some indicators that can be used as metadata for describing tags themselves. We expect that such a methodology for characterizing folksonomies can be useful to support processes such as tag recommendation or automatic annotation of online resources.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

The current research in Music Information Retrieval (MIR) is showing the potential that the Information Technologies can have in music related applications. Amajor research challenge in that direction is how to automaticallydescribe/annotate audio recordings and how to use the resulting descriptions to discover and appreciate music in new ways. But music is a complex phenomenonand the description of an audio recording has to deal with this complexity. For example, each musicculture has specificities and emphasizes different musicaland communication aspects, thus the musical recordings of each culture should be described differently. At the same time these cultural specificities give us the opportunity to pay attention to musical concepts andfacets that, despite being present in most world musics, are not easily noticed by listeners. In this paper we present some of the work done in the CompMusic project, including ideas and specific examples on how to take advantage of the cultural specificities of differentmusical repertoires. We will use examples from the art music traditions of India, Turkey and China.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

A pesar de que cada vez son más las investigaciones vinculadas al análisis de los videojuegos, pocas son las orientadas a determinar la articulación de su dimensión persuasiva. Partiendo principalmente de los trabajos de Ian Bogost y Gonzalo Frasca sobre la persuasión, se resiguen las carencias metodológicas de los modelos planteados y se proponen una serie de hipótesis orientadas a la búsqueda de una metodología que posibilite dar respuesta a la siguiente pregunta: ¿Cómo y dónde se articula la dimensión persuasiva de los videojuegos?En este sentido se trata de estudiar la dimensión persuasiva en los videojuegos de manera integral. La supuesta capacidad de las reglas de juego, propiedad intrínseca de los juegos y videojuegos, de determinar tanto la estructura narrativa como la parte audio/visual del texto (videojuego), así como su condición esencial de ser las portadoras de la carga persuasiva, permite apuntar como objetivo fundamental el diseño de un protocolo integral de la persuasión en los videojuegos

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Tämä insinöörityö tehtiin Kuulonhuoltoliitto ry:n Esteetön kuuntelu -projektille. Työ käsittelee kuulolaitteen käyttäjän apuvälineenä käytettäviä induktiosilmukkajärjestelmiä. Induktiivisen äänensiirtojärjestelmän avulla huonokuuloinen saa kuunteluympäristöstään paremman signaali-kohinasuhteen ilman tilan kaikua ja taustahälyä. Tämä helpottaa kuuntelua esimerkiksi puhetilaisuuksissa. Työn tarkoituksena oli päivittää aiempia tutkimuksia kokoamalla uusin tietous induktiosilmukoiden tekniikasta. Kirjallisuustutkimuksen lisäksi kartoitettiin nykyinen silmukkavahvistinkanta haastattelemalla maahantuojia ja jälleenmyyjiä. Silmukkavahvistimien hintakartoituksen tarkoituksena oli rakentaa perusteet sopivan tehoisen ja hintaisen vahvistimen hankintaa varten suunniteltaessa induktiivisella äänensiirtojärjestelmällä varustettavia tiloja. Työ tarkastelee induktiivisen äänensiirtojärjestelmän ensisijaisen kohderyhmän, kuulokojeita käyttävien huonokuuloisten keskeisiä tarpeita ja heidän käyttämäänsä apuvälinetekniikkaa. Työ käy läpi audiotekniikan perusteita, sähkömagneettisen induktioilmiön, induktiivisen äänensiirtojärjestelmän rakenteen, sen ylikuulumisongelmat ja häiriötekijät sekä IEC:n 60118-4 edition 2.0 standardin CDV-luonnoksen mukaiset periaatteelliset asennusja mittausohjeet. IEC:n tuleva standardi tekee työstä ajankohtaisen. Työn tuloksena saatua tietoa voidaan käyttää Kuulonhuoltoliitto ry:n opas- ja tiedotusmateriaalin uudistamisessa, pohjana hyvän kuunteluympäristön malliratkaisujen ja esteettömyyskriteerien soveltamisessa. Työstä on hyötyä erityisesti Kuulonhuoltoliitto ry:n vapaaehtoiskartoittajille kuten myös kenelle tahansa aiheesta syvemmin kiinnostuneelle.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

The thesis studies the launch campaign of Big Brother Finland, especially from the viewpoint of on-air promotion. Interest to the subject arose when participating in the campaign as an on-air promotion planner together with Subtv's marketing director, on-air promotion editor and the channel's advertising agency. The launch of the campaign was a challenge due to the format, since not a lot of information can be revealed before the start of the program. When the planning started, all the material consisted of two logos. The first season of the Finnish version of Big Brother begun on Subtv August 2005. The goal of the program was to become a topic of discussion on TV on the fall 2005 and to raise the profile of the channel. The goal of the launch was to get good ratings for the first episode. The launch campaign was also supposed to open up the format to the viewers and to arouse interest in the show. Secrecy and the size of the program were set to be the marketing tones of the launch. Although partly different messages were told via on-air promotion and external media, the campaign was congruent in visual design. In the study, interviews of Subtv's staff, campaign plans and notes were used as research material. From the aspect of affecting images and emotions, the finished campaign promos and other on-air elements were analyzed. In on-air promotion, all choices in audio and visual design affect the outcome and therefore the images that the viewer constructs. The two promo series were made to affect emotions and to awaken curiosity. Other on-air elements were merely used to present program information. The campaign and the series were accepted with enthusiasm. The launch of the second season was even more massive than the first. Participation in the launch campaign of Big Brother Finland was an essential experience in the development of professional identity. When one has taken part in the creation of a massive campaign from scarce materials, tools are given to future assignments in the field of on-air promotion.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Tämän insinöörityön tavoitteena oli tehdä MATLAB-sovellus, jolla voidaan laskea ja tulostaa yleisradiotoiminnassa käytettävien dipolipaneeliantennien säteilykuvioita. Toinen tavoite oli saattaa tämän työn valvojan Antti Koivumäen vuoden 1980 diplomityön laskenta ja grafiikka modernimmalle tasolle. Aluksi työssä tarkasteltiin yleisiä asioita VHF- ja UHF-taajuuksien lähetystavoista ja lähetinantennityypeistä. Alussa käytiin läpi myös antennijoukon ja -ryhmän käsite. Seuraavaksi tutustuttiin tarkemmin lähetinantennien rakenteeseen ja ominaisuuksiin. Antennien ominaisuuksista tarkasteltiin säteilykuviota, suuntaavuutta, vahvistusta, hyötysuhdetta ja polarisaatiota. Näistä säteilykuvio oli tarkimman tutkinnan kohteena. Säteilykuvion laskeminen esitettiin kaavojen ja havainnoillistavien kuvien avulla. Lopuksi perehdyttiin antennin säteilykuvion laskemiseen tietokoneella, etenkin MATLABilla. Osuudessa tarkasteltiin graafisen käyttöliittymän tekemistä, laskennan suorittamista ja horisontaalisen säteilykuvion esittämistä graafisesti. Työn tuloksena saatiin tehdyksi MATLAB-sovellus, joka laskee elementtiantennin horisontaalisen säteilykuvion annettujen parametrien avulla ja tulostaa sen.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

ABSTRACT This thesis is composed of two main parts. The first addressed the question of whether the auditory and somatosensory systems, like their visual counterpart, comprise parallel functional pathways for processing identity and spatial attributes (so-called `what' and `where' pathways, respectively). The second part examined the independence of control processes mediating task switching across 'what' and `where' pathways in the auditory and visual modalities. Concerning the first part, electrical neuroimaging of event-related potentials identified the spatio-temporal mechanisms subserving auditory (see Appendix, Study n°1) and vibrotactile (see Appendix, Study n°2) processing during two types of blocks of trials. `What' blocks varied stimuli in their frequency independently of their location.. `Where' blocks varied the same stimuli in their location independently of their frequency. Concerning the second part (see Appendix, Study n°3), a psychophysical task-switching paradigm was used to investigate the hypothesis that the efficacy of control processes depends on the extent of overlap between the neural circuitry mediating the different tasks at hand, such that more effective task preparation (and by extension smaller switch costs) is achieved when the anatomical/functional overlap of this circuitry is small. Performance costs associated with switching tasks and/or switching sensory modalities were measured. Tasks required the analysis of either the identity or spatial location of environmental objects (`what' and `where' tasks, respectively) that were presented either visually or acoustically on any given trial. Pretrial cues informed participants of the upcoming task, but not of the sensory modality. - In the audio-visual domain, the results showed that switch costs between tasks were significantly smaller when the sensory modality of the task switched versus when it repeated. In addition, switch costs between the senses were correlated only when the sensory modality of the task repeated across trials and not when it switched. The collective evidence not only supports the independence of control processes mediating task switching and modality switching, but also the hypothesis that switch costs reflect competitive interterence between neural circuits that in turn can be diminished when these neural circuits are distinct. - In the auditory and somatosensory domains, the findings show that a segregation of location vs. recognition information is observed across sensory systems and that these happen around 100ms for both sensory modalities. - Also, our results show that functionally specialized pathways for audition and somatosensation involve largely overlapping brain regions, i.e. posterior superior and middle temporal cortices and inferior parietal areas. Both these properties (synchrony of differential processing and overlapping brain regions) probably optimize the relationships across sensory modalities. - Therefore, these results may be indicative of a computationally advantageous organization for processing spatial anal identity information.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Objective: Presenting a Virtual Environment (VE) based on the Protocol of Treatment of Hypertension and Diabetes Mellitus type 2, used in Primary Care for evaluation of dietary habits in nursing consultations. Method: An experimental study applied by two nurses and a nurse manager, in a sample of 30 deaf patients aged between 30 and 60 years. The environment was built in Visual Basic NET and offered eight screens about feeding containing food pictures, videos in Libras (Brazilian sign language) and audio. The analysis of the VE was done through questionnaires applied to patients and professionals by the Poisson statistical test. Results: The VE shows the possible diagnostics in red, yellow, green and blue colors, depending on the degree of patients’ need. Conclusion: The environment obtained excellent acceptance by patients and nurses, allowing great interaction between them, even without an interpreter. The time in consultation was reduced to 15 minutes, with the preservation of patient privacy.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Objective: To investigate the relationship between the work environment and leadership in nursing. Method: An integrative review of literature which was based on data from LILACS, PubMed, CINAHL and the SciELO portal for journals covering the period from January to April 2013. The inclusion criteria were: the indexing of research covering leadership exercised by nurses over a team and whether the research was available in English, Spanish or Portuguese. Results: The sample consisted of 12 articles that met the criteria. Conclusion: The results showed that leadership had an impact on the work environment. However, no studies were found that showed the influence of the working environment on leadership in nursing.