7 resultados para Radio and music.
em Archivo Digital para la Docencia y la Investigación - Repositorio Institucional de la Universidad del País Vasco
Resumo:
Query-by-Example Spoken Term Detection (QbE STD) aims at retrieving data from a speech data repository given an acoustic query containing the term of interest as input. Nowadays, it has been receiving much interest due to the high volume of information stored in audio or audiovisual format. QbE STD differs from automatic speech recognition (ASR) and keyword spotting (KWS)/spoken term detection (STD) since ASR is interested in all the terms/words that appear in the speech signal and KWS/STD relies on a textual transcription of the search term to retrieve the speech data. This paper presents the systems submitted to the ALBAYZIN 2012 QbE STD evaluation held as a part of ALBAYZIN 2012 evaluation campaign within the context of the IberSPEECH 2012 Conference(a). The evaluation consists of retrieving the speech files that contain the input queries, indicating their start and end timestamps within the appropriate speech file. Evaluation is conducted on a Spanish spontaneous speech database containing a set of talks from MAVIR workshops(b), which amount at about 7 h of speech in total. We present the database metric systems submitted along with all results and some discussion. Four different research groups took part in the evaluation. Evaluation results show the difficulty of this task and the limited performance indicates there is still a lot of room for improvement. The best result is achieved by a dynamic time warping-based search over Gaussian posteriorgrams/posterior phoneme probabilities. This paper also compares the systems aiming at establishing the best technique dealing with that difficult task and looking for defining promising directions for this relatively novel task.
Resumo:
Feature-based vocoders, e.g., STRAIGHT, offer a way to manipulate the perceived characteristics of the speech signal in speech transformation and synthesis. For the harmonic model, which provide excellent perceived quality, features for the amplitude parameters already exist (e.g., Line Spectral Frequencies (LSF), Mel-Frequency Cepstral Coefficients (MFCC)). However, because of the wrapping of the phase parameters, phase features are more difficult to design. To randomize the phase of the harmonic model during synthesis, a voicing feature is commonly used, which distinguishes voiced and unvoiced segments. However, voice production allows smooth transitions between voiced/unvoiced states which makes voicing segmentation sometimes tricky to estimate. In this article, two-phase features are suggested to represent the phase of the harmonic model in a uniform way, without voicing decision. The synthesis quality of the resulting vocoder has been evaluated, using subjective listening tests, in the context of resynthesis, pitch scaling, and Hidden Markov Model (HMM)-based synthesis. The experiments show that the suggested signal model is comparable to STRAIGHT or even better in some scenarios. They also reveal some limitations of the harmonic framework itself in the case of high fundamental frequencies.
Resumo:
En el trabajo que nos ocupa analizaremos la relación entre diversos ámbitos culturales y la publicidad internacional audiovisual. En la publicidad audiovisual, como la televisiva, las imágenes cargadas de valor simbólico son determinantes para el posicionamiento de una marca y para llegar a la mente del consumidor. A lo largo del estudio relacionaremos el simbolismo de estas imágenes con la mitología y la música para acercarnos a emociones universales que faciliten la relación entre la marca y el consumidor, o bien, entre el spot publicitario y el espectador. En el ámbito internacional, donde se encuentran personas con identidades culturales muy diferentes, y desde una perspectiva comunicacional sería de gran utilidad conocer aspectos comunes en la mente de todo ser humano que favoreciesen la estandarización y sus consecuentes sinergias.
Resumo:
[ES] Shoot&Soul Festival es un festival de cine y música para Bilbao. El proyecto pretende convertir la ciudad en un punto de encuentro multicultural con el uso del inglés como lengua principal; a la vez que promover la cultura y costumbres locales como la gastronomía, el folclore, etc.
Resumo:
This paper proposes a new method for local key and chord estimation from audio signals. This method relies primarily on principles from music theory, and does not require any training on a corpus of labelled audio files. A harmonic content of the musical piece is first extracted by computing a set of chroma vectors. A set of chord/key pairs is selected for every frame by correlation with fixed chord and key templates. An acyclic harmonic graph is constructed with these pairs as vertices, using a musical distance to weigh its edges. Finally, the sequences of chords and keys are obtained by finding the best path in the graph using dynamic programming. The proposed method allows a mutual chord and key estimation. It is evaluated on a corpus composed of Beatles songs for both the local key estimation and chord recognition tasks, as well as a larger corpus composed of songs taken from the Billboard dataset.
Resumo:
179 p.
Resumo:
In many micro- and nano-scale technological applications high sensitivity displacement sensors are needed, especially in ultraprecision metrology and manufacturing. In this work a new way of sensing displacement based on radio frequency resonant cavities is presented and experimentally demonstrated using a first laboratory prototype. The principle of operation of the new transducer is summarized and tested. Furthermore, an electronic interface that can be used together with the displacement transducer is designed and proved. It has been experimentally demonstrated that very high and linear sensitivity characteristic curves, in the range of some kHz/nm; are easily obtainable using this kind of transducer when it is combined with a laboratory network analyzer. In order to replace a network analyzer and provide a more affordable, self-contained, compact solution, an electronic interface has been designed, preserving as much as possible the excellent performance of the transducer, and turning it into a true standalone positioning sensor. The results obtained using the transducer together with a first prototype of the electronic interface built with cheap discrete elements show that positioning accuracies in the micrometer range are obtainable using this cost-effective solution. Better accuracies would also be attainable but using more involved and costly electronics interfaces.