8 resultados para voice activity detection
em Consorci de Serveis Universitaris de Catalunya (CSUC), Spain
Resumo:
In this paper we explore the use of non-linear transformations in order to improve the performance of an entropy based voice activity detector (VAD). The idea of using a non-linear transformation comes from some previous work done in speech linear prediction (LPC) field based in source separation techniques, where the score function was added into the classical equations in order to take into account the real distribution of the signal. We explore the possibility of estimating the entropy of frames after calculating its score function, instead of using original frames. We observe that if signal is clean, estimated entropy is essentially the same; but if signal is noisy transformed frames (with score function) are able to give different entropy if the frame is voiced against unvoiced ones. Experimental results show that this fact permits to detect voice activity under high noise, where simple entropy method fails.
Resumo:
This paper deals with non-linear transformations for improving the performance of an entropy-based voice activity detector (VAD). The idea to use a non-linear transformation has already been applied in the field of speech linear prediction, or linear predictive coding (LPC), based on source separation techniques, where a score function is added to classical equations in order to take into account the true distribution of the signal. We explore the possibility of estimating the entropy of frames after calculating its score function, instead of using original frames. We observe that if the signal is clean, the estimated entropy is essentially the same; if the signal is noisy, however, the frames transformed using the score function may give entropy that is different in voiced frames as compared to nonvoiced ones. Experimental evidence is given to show that this fact enables voice activity detection under high noise, where the simple entropy method fails.
Resumo:
tThis paper deals with the potential and limitations of using voice and speech processing to detect Obstruc-tive Sleep Apnea (OSA). An extensive body of voice features has been extracted from patients whopresent various degrees of OSA as well as healthy controls. We analyse the utility of a reduced set offeatures for detecting OSA. We apply various feature selection and reduction schemes (statistical rank-ing, Genetic Algorithms, PCA, LDA) and compare various classifiers (Bayesian Classifiers, kNN, SupportVector Machines, neural networks, Adaboost). S-fold crossvalidation performed on 248 subjects showsthat in the extreme cases (that is, 127 controls and 121 patients with severe OSA) voice alone is able todiscriminate quite well between the presence and absence of OSA. However, this is not the case withmild OSA and healthy snoring patients where voice seems to play a secondary role. We found that thebest classification schemes are achieved using a Genetic Algorithm for feature selection/reduction.
Resumo:
The mismatch negativity is an electrophysiological marker of auditory change detection in the event-related brain potential and has been proposed to reflect an automatic comparison process between an incoming stimulus and the representation of prior items in a sequence. There is evidence for two main functional subcomponents comprising the MMN, generated by temporal and frontal brain areas, respectively. Using data obtained in an MMN paradigm, we performed time-frequency analysis to reveal the changes in oscillatory neural activity in the theta band. The results suggest that the frontal component of the MMN is brought about by an increase in theta power for the deviant trials and, possibly, by an additional contribution of theta phase alignment. By contrast, the temporal component of the MMN, best seen in recordings from mastoid electrodes, is generated by phase resetting of theta rhythm with no concomitant power modulation. Thus, frontal and temporal MMN components do not only differ with regard to their functional significance but also appear to be generated by distinct neurophysiological mechanisms.
Resumo:
This work presents the functional characterisation of a protein phosphatase 2A (PP2A) catalytic subunit obtained by genetic engineering and its conjugation to magnetic particles (MPs) via metal coordination chemistry for the subsequent development of assays for diarrheic lipophilic marine toxins. Colorimetric assays with free enzyme have allowed the determination of the best enzyme activity stabiliser, which is glycerol at 10%. They have also demonstrated that the recombinant enzyme can be as sensitive towards okadaic acid (OA) (LOD=2.3μg/L) and dinophysistoxin-1 (DTX-1) (LOD=15.2μg/L) as a commercial PP2A and, moreover, it has a higher operational stability, which makes possible to perform the protein phosphatase inhibition assay (PPIA) with a lower enzyme amount. Once conjugated to MPs, the PP2A catalytic subunit still retains its enzyme activity and it can also be inhibited by OA (LOD=30.1μg/L).
Resumo:
The theoretical context of this study is related with the observational methodology in the context of group games and sports studies, specifically Handball. Thus, this study intends to analyze the performance of the pivot player in the World Cup 2007 - Germany, European 2008 - Norway 2008 and China OG 2008 in a qualitative dimension. Our purpose was to get as much information as possible about the whole activity of the pivot player, by identifying sequential patterns of behaviour or conduct of the player/game, by using the sequential analysis. The observation instrument used to meet the main purpose of this work consists of a combination of format fields (FF) and systems of categories (SC). The codifications undertaken occurred in several handball games. Using this instrument we have shown that it provides support for the purposes for which it was developed, allowing more research into the offensive process of handball. Besides this, it makes possible the analysis of aspects of the game through perspective and contextual sequences, which we consider to be more accurate, to fit the "reality" of a game such as handball.
Resumo:
Using event-related brain potentials, the time course of error detection and correction was studied in healthy human subjects. A feedforward model of error correction was used to predict the timing properties of the error and corrective movements. Analysis of the multichannel recordings focused on (1) the error-related negativity (ERN) seen immediately after errors in response- and stimulus-locked averages and (2) on the lateralized readiness potential (LRP) reflecting motor preparation. Comparison of the onset and time course of the ERN and LRP components showed that the signs of corrective activity preceded the ERN. Thus, error correction was implemented before or at least in parallel with the appearance of the ERN component. Also, the amplitude of the ERN component was increased for errors, followed by fast corrective movements. The results are compatible with recent views considering the ERN component as the output of an evaluative system engaged in monitoring motor conflict.
Resumo:
This paper addresses some of the challenges inherent in finding and showing a gendered voice in translation. The starting point is my own experience as a feminist translator of both feminist and non-feminist texts. Textual practices like translating necessarily interact with current theoretical debates. In turn, theoretical writing on feminism enriches and informs one’s translating activity. This interplay between theoretical models and textual practices was particularly made evident to me as I rendered Essentially speaking, by Diana Fuss, into Catalan. In this article I intend to transcend anecdotes of translating individual texts and consider how translating equals rewriting oneself; it involves rethinking writing practices. I will specifically address the rethinking of (1) one’s identity when translating ‘like’ a feminist, (2) performativity in gender and in translation, and (3) agency and (In)visibility.