961 resultados para Robust speech recognition
Resumo:
Peer-reviewed
Resumo:
This paper describes an audio watermarking scheme based on lossy compression. The main idea is taken from an image watermarking approach where the JPEG compression algorithm is used to determine where and how the mark should be placed. Similarly, in the audio scheme suggested in this paper, an MPEG 1 Layer 3 algorithm is chosen for compression to determine the position of the mark bits and, thus, the psychoacoustic masking of the MPEG 1 Layer 3compression is implicitly used. This methodology provides with a high robustness degree against compression attacks. The suggested scheme is also shown to succeed against most of the StirMark benchmark attacks for audio.
Resumo:
In a system where tens of thousands of words are made up of a limited number of phonemes, many words are bound to sound alike. This similarity of the words in the lexicon as characterized by phonological neighbourhood density (PhND) has been shown to affect speed and accuracy of word comprehension and production. Whereas there is a consensus about the interfering nature of neighbourhood effects in comprehension, the language production literature offers a more contradictory picture with mainly facilitatory but also interfering effects reported on word production. Here we report both of these two types of effects in the same study. Multiple regression mixed models analyses were conducted on PhND effects on errors produced in a naming task by a group of 21 participants with aphasia. These participants produced more formal errors (interfering effect) for words in dense phonological neighbourhoods, but produced fewer nonwords and semantic errors (a facilitatory effect) with increasing density. In order to investigate the nature of these opposite effects of PhND, we further analysed a subset of formal errors and nonword errors by distinguishing errors differing on a single phoneme from the target (corresponding to the definition of phonological neighbours) from those differing on two or more phonemes. This analysis confirmed that only formal errors that were phonological neighbours of the target increased in dense neighbourhoods, while all other errors decreased. Based on additional observations favouring a lexical origin of these formal errors (they exceeded the probability of producing a real-word error by chance, were of a higher frequency, and preserved the grammatical category of the targets), we suggest that the interfering effect of PhND is due to competition between lexical neighbours and target words in dense neighbourhoods.
Resumo:
Typically developing (TD) preschoolers and age-matched preschoolers with specific language impairment (SLI) received event-related potentials (ERPs) to four monosyllabic speech sounds prior to treatment and, in the SLI group, after 6 months of grammatical treatment. Before treatment, the TD group processed speech sounds faster than the SLI group. The SLI group increased the speed of their speech processing after treatment. Posttreatment speed of speech processing predicted later impairment in comprehending phrase elaboration in the SLI group. During the treatment phase, change in speed of speech processing predicted growth rate of grammar in the SLI group.
Resumo:
This paper presents a Bayesian approach to the design of transmit prefiltering matrices in closed-loop schemes robust to channel estimation errors. The algorithms are derived for a multiple-input multiple-output (MIMO) orthogonal frequency division multiplexing (OFDM) system. Two different optimizationcriteria are analyzed: the minimization of the mean square error and the minimization of the bit error rate. In both cases, the transmitter design is based on the singular value decomposition (SVD) of the conditional mean of the channel response, given the channel estimate. The performance of the proposed algorithms is analyzed,and their relationship with existing algorithms is indicated. As withother previously proposed solutions, the minimum bit error rate algorithmconverges to the open-loop transmission scheme for very poor CSI estimates.
Resumo:
The problem of robust beamformer design for mobile communicationsapplications in the presence of moving co-channel sources isaddressed. A generalization of the optimum beamformer based on a statisticalmodel accounting for source movement is proposed. The new methodis easily implemented and is shown to offer dramatic improvements overconventional optimum beamforming for moving sources under a varietyof operating conditions.
Resumo:
A variety of cellular proteins has the ability to recognize DNA lesions induced by the anti-cancer drug cisplatin, with diverse consequences on their repair and on the therapeutic effectiveness of this drug. We report a novel gene involved in the cell response to cisplatin in vertebrates. The RDM1 gene (for RAD52 Motif 1) was identified while searching databases for sequences showing similarities to RAD52, a protein involved in homologous recombination and DNA double-strand break repair. Ablation of RDM1 in the chicken B cell line DT40 led to a more than 3-fold increase in sensitivity to cisplatin. However, RDM1-/- cells were not hypersensitive to DNA damages caused by ionizing radiation, UV irradiation, or the alkylating agent methylmethane sulfonate. The RDM1 protein displays a nucleic acid binding domain of the RNA recognition motif (RRM) type. By using gel-shift assays and electron microscopy, we show that purified, recombinant chicken RDM1 protein interacts with single-stranded DNA as well as double-stranded DNA, on which it assembles filament-like structures. Notably, RDM1 recognizes DNA distortions induced by cisplatin-DNA adducts in vitro. Finally, human RDM1 transcripts are abundant in the testis, suggesting a possible role during spermatogenesis.
Resumo:
Plants activate direct and indirect defenses in response to insect egg deposition. In Arabidopsis thaliana, oviposition by the butterfly Pieris brassicae triggers cellular and molecular changes that are similar to the changes caused by biotrophic pathogens. Even though this innate immune response did not affect egg survival in Arabidopsis, we could show that different insect eggs elicit specific gene expression changes. Additionally, egg- induced necrosis could be observed in a variety of plants from different families ranging from dicotyledonous plants to monocots, suggesting that insect-egg detection by plants is a widespread mechanism and that different insect species contain elicitors of immune responses. Extracts from caterpillars and eggs contain elicitors that co-purified over several extraction steps. Chemical fractionation of caterpillar extracts lead to the characterisation of an active compound that was determined to be a triglyceride by NMR analysis. The exact structure of the side chains as well as the elicitor's presence in insect eggs have yet to be confirmed.We also found that the plant defense signal salicylic acid (SA) accumulates at the site of oviposition. This is unexpected, as the SA pathway controls the defense against fungal and bacterial pathogens whereas it negatively interacts with the jasmonic acid (JA) pathway, which is crucial for the defense against herbivores. Application of P. brassicae or Spodoptera littoralis egg extract onto leaves reduced the induction of insect-responsive genes after challenge with caterpillars, suggesting that egg-derived elicitors suppress plant defense. Consequently, larval growth of the generalist herbivore S. littoralis, but not of the specialist P. brassicae, was significantly higher on plants treated with egg extract than on control plants. In contrast, suppression of gene induction and enhanced S. littoralis performance were not found in the SA-deficient mutant sid2-l, indicating that SA mediates this phenomenon. These data reveal an intriguing facet of the crosstalk between SA- and JA-signalling pathways and suggest that insects have evolved a way to suppress the induction of defense genes by laying eggs that release elicitors. Additionally, we demonstrated that mutants of known crosstalk regulators, including nprl-1, tga2356, ein2-l and wrky70-l, are not affected in egg-induced suppression of herbivore defenses. JA treatment was not able to alleviate this SA/JA negative crosstalk, suggesting that this suppression operates through a novel mechanism downstream of JA biosynthesis.
Resumo:
Multisensory processes facilitate perception of currently-presented stimuli and can likewise enhance later object recognition. Memories for objects originally encountered in a multisensory context can be more robust than those for objects encountered in an exclusively visual or auditory context [1], upturning the assumption that memory performance is best when encoding and recognition contexts remain constant [2]. Here, we used event-related potentials (ERPs) to provide the first evidence for direct links between multisensory brain activity at one point in time and subsequent object discrimination abilities. Across two experiments we found that individuals showing a benefit and those impaired during later object discrimination could be predicted by their brain responses to multisensory stimuli upon their initial encounter. These effects were observed despite the multisensory information being meaningless, task-irrelevant, and presented only once. We provide critical insights into the advantages associated with multisensory interactions; they are not limited to the processing of current stimuli, but likewise encompass the ability to determine the benefit of one's memories for object recognition in later, unisensory contexts.
Resumo:
The ability to distinguish nestmates from foreign individuals is central to the functioning of insect societies. In ants, workers from multiple-queen colonies are often less aggressive than workers from single-queen ones. In line with this observation, it has been hypothesized that workers from multiple-queen colonies have less precise recognition abilities than workers from single-queen ones because their colonies contain genetically more diverse individuals, which results in a broader template of recognition cues. Here, we assessed the impact of social structure ( queen number) variation on nestmate recognition and aggression in a large population of the socially polymorphic ant Formica selysi. We staged unilateral aggression tests on the nest surface. Workers from single-and multiple-queen colonies had good nestmate recognition ability and did not differ significantly in their level of aggression towards foreign, immobilized workers ( cue-bearers). In particular, workers from multiple-queen colonies efficiently recognized non-nestmates despite the higher genetic diversity in their colony. Cue-bearers from single- and multiple-queen colonies elicited similar reactions. However, the level of aggression was higher between than within social forms, suggesting that workers detect a signal that is specific to the colony social structure. Finally, the level of aggression was not correlated with the genetic distance between colonies. Overall, we found no evidence for the hypothesis that the presence of multiple breeders in the same colony decreases recognition abilities and found no simple relationship between genetic diversity and aggression level. (c) 2007 The Association for the Study of Animal Behaviou
Resumo:
We propose robust estimators of the generalized log-gamma distribution and, more generally, of location-shape-scale families of distributions. A (weighted) Q tau estimator minimizes a tau scale of the differences between empirical and theoretical quantiles. It is n(1/2) consistent; unfortunately, it is not asymptotically normal and, therefore, inconvenient for inference. However, it is a convenient starting point for a one-step weighted likelihood estimator, where the weights are based on a disparity measure between the model density and a kernel density estimate. The one-step weighted likelihood estimator is asymptotically normal and fully efficient under the model. It is also highly robust under outlier contamination. Supplementary materials are available online.
Resumo:
Perceiving the world visually is a basic act for humans, but for computers it is still an unsolved problem. The variability present innatural environments is an obstacle for effective computer vision. The goal of invariant object recognition is to recognise objects in a digital image despite variations in, for example, pose, lighting or occlusion. In this study, invariant object recognition is considered from the viewpoint of feature extraction. Thedifferences between local and global features are studied with emphasis on Hough transform and Gabor filtering based feature extraction. The methods are examined with respect to four capabilities: generality, invariance, stability, and efficiency. Invariant features are presented using both Hough transform and Gabor filtering. A modified Hough transform technique is also presented where the distortion tolerance is increased by incorporating local information. In addition, methods for decreasing the computational costs of the Hough transform employing parallel processing and local information are introduced.
Resumo:
Subjects with autism often show language difficulties, but it is unclear how they relate to neurophysiological anomalies of cortical speech processing. We used combined EEG and fMRI in 13 subjects with autism and 13 control participants and show that in autism, gamma and theta cortical activity do not engage synergistically in response to speech. Theta activity in left auditory cortex fails to track speech modulations, and to down-regulate gamma oscillations in the group with autism. This deficit predicts the severity of both verbal impairment and autism symptoms in the affected sample. Finally, we found that oscillation-based connectivity between auditory and other language cortices is altered in autism. These results suggest that the verbal disorder in autism could be associated with an altered balance of slow and fast auditory oscillations, and that this anomaly could compromise the mapping between sensory input and higher-level cognitive representations.