110 resultados para speech acts
Resumo:
This paper considers the separation and recognition of overlapped speech sentences assuming single-channel observation. A system based on a combination of several different techniques is proposed. The system uses a missing-feature approach for improving crosstalk/noise robustness, a Wiener filter for speech enhancement, hidden Markov models for speech reconstruction, and speaker-dependent/-independent modeling for speaker and speech recognition. We develop the system on the Speech Separation Challenge database, involving a task of separating and recognizing two mixing sentences without assuming advanced knowledge about the identity of the speakers nor about the signal-to-noise ratio. The paper is an extended version of a previous conference paper submitted for the challenge.
Resumo:
Transcription factor RUNX3 is inactivated in a number of malignancies, including breast cancer, and is suggested to function as a tumor suppressor. How RUNX3 functions as a tumor suppressor in breast cancer remains undefined. Here, we show that about 20% of female Runx3(+/-) mice spontaneously developed ductal carcinoma at an average age of 14.5 months. Additionally, RUNX3 inhibits the estrogen-dependent proliferation and transformation potential of ERa-positive MCF-7 breast cancer cells in liquid culture and in soft agar and suppresses the tumorigenicity of MCF-7 cells in severe combined immunodeficiency mice. Furthermore, RUNX3 inhibits ERa-dependent transactivation by reducing the stability of ERa. Consistent with its ability to regulate the levels of ERa, expression of RUNX3 inversely correlates with the expression of ERa in breast cancer cell lines, human breast cancer tissues and Runx3(+/-) mouse mammary tumors. By destabilizing ERa, RUNX3 acts as a novel tumor suppressor in breast cancer.
Resumo:
Three experiments measured the effects of age on informational masking of speech by competing speech. The experiments were designed to minimize the energetic contributions of the competing speech so that informational masking could be measured with no large corrections for energetic masking. Experiment 1 used a "speech-in-speech-in-noise" design, in which the competing speech was presented in noise at a signal-to-noise ratio (SNR) of -4 dB. This ensured that the noise primarily contributed the energetic masking but the competing speech contributed the informational masking. Equal amounts of informational masking (3 dB) were observed for young and elderly listeners, although less was found for hearing-impaired listeners. Experiment 2 tested a range of SNRs in this design and showed that informational masking increased with SNR up to about an SNR of -4 dB, but decreased thereafter. Experiment 3 further reduced the energetic contribution of the competing speech by filtering it into different frequency bands from the target speech. The elderly listeners again showed approximately the same amount of informational masking (4-5 dB), although some elderly listeners had particular difficulty understanding these stimuli in any condition. On the whole, these results suggest that young and elderly listeners were equally susceptible to informational masking. © 2009 Acoustical Society of America.
Resumo:
Many of the items in the “Speech, Spatial, and Qualities of Hearing” scale questionnaire [S. Gatehouse and W. Noble, Int. J. Audiol.43, 85–99 (2004)] are concerned with speech understanding in a variety of backgrounds, both speech and nonspeech. To study if this self-report data reflected informational masking, previously collected data on 414 people were analyzed. The lowest scores (greatest difficulties) were found for the two items in which there were two speech targets, with successively higher scores for competing speech (six items), energetic masking (one item), and no masking (three items). The results suggest significant masking by competing speech in everyday listening situations.
Resumo:
In this paper, I critically assess John Rawls' repeated claim that the duty of civility is only a moral duty and should not be enforced by law. In the first part of the paper, I examine and reject the view that Rawls' position may be due to the practical difficulties that the legal enforcement of the duty of civility might entail. I thus claim that Rawls' position must be driven by deeper normative reasons grounded in a conception of free speech. In the second part of the paper, I therefore examine various arguments for free speech and critically assess whether they are consistent with Rawls' political liberalism. I first focus on the arguments from truth and self-fulfilment. Both arguments, I argue, rely on comprehensive doctrines and therefore cannot provide a freestanding political justification for free speech. Freedom of speech, I claim, can be justified instead on the basis of Rawls' political conception of the person and of the two moral powers. However, Rawls' wide view of public reason already allows scope for the kind of free speech necessary for the exercise of the two moral powers and therefore cannot explain Rawls' opposition to the legal enforcement of the duty of civility. Such opposition, I claim, can only be explained on the basis of a defence of unconstrained freedom of speech grounded in the ideas of democracy and political legitimacy. Yet, I conclude, while public reason and the duty of civility are essential to political liberalism, unconstrained freedom of speech is not. Rawls and political liberals could therefore renounce unconstrained freedom of speech, and endorse the legal enforcement of the duty of civility, while remaining faithful to political liberalism.
Resumo:
A huge variety of proteins are able to form fibrillar structures(1), especially at high protein concentrations. Hence, it is surprising that spider silk proteins can be stored in a soluble form at high concentrations and transformed into extremely stable fibres on demand(2,3). Silk proteins are reminiscent of amphiphilic block copolymers containing stretches of polyalanine and glycine-rich polar elements forming a repetitive core flanked by highly conserved non-repetitive amino-terminal(4,5) and carboxy-terminal(6) domains. The N-terminal domain comprises a secretion signal, but further functions remain unassigned. The C-terminal domain was implicated in the control of solubility and fibre formation(7) initiated by changes in ionic composition(8,9) and mechanical stimuli known to align the repetitive sequence elements and promote beta-sheet formation(10-14). However, despite recent structural data(15), little is known about this remarkable behaviour in molecular detail. Here we present the solution structure of the C-terminal domain of a spider dragline silk protein and provide evidence that the structural state of this domain is essential for controlled switching between the storage and assembly forms of silk proteins. In addition, the C-terminal domain also has a role in the alignment of secondary structural features formed by the repetitive elements in the backbone of spider silk proteins, which is known to be important for the mechanical properties of the fibre.
Resumo:
This book provides a systematic introduction in the German gender equality acts for public services, and also a section per section commentary for each individual act. It analyses the legal base, limits and scope of the so called women's quota, gender mainstreaming in public employment and public policy, provisions to allow conciliation of paid work and work in families and the position of women's equality officers. It compares and analyses 16 state acts and the federal equality act. The introductory chapter, written by Dagmar Schiek, also provides an analysis of the EU level and constitutional frame for this legislation. The combination of a systematic introduction and a section by section commentary ensures that this valuable handbook can be used by trained lawyers as well as by social scientists, taking into account the fact that many equality officers are not trained lawyers.
Resumo:
The comparator account holds that processes of motor prediction contribute to the sense of agency by attenuating incoming sensory information and that disruptions to this process contribute to misattributions of agency in schizophrenia. Over the last 25 years this simple and powerful model has gained widespread support not only as it relates to bodily actions but also as an account of misattributions of agency for inner speech, potentially explaining the etiology of auditory verbal hallucination (AVH). In this paper we provide a detailed analysis of the traditional comparator account for inner speech, pointing out serious problems with the specification of inner speech on which it is based and highlighting inconsistencies in the interpretation of the electrophysiological evidence commonly cited in its favor. In light of these analyses we propose a new comparator account of misattributed inner speech. The new account follows leading models of motor imagery in proposing that inner speech is not attenuated by motor prediction, but rather derived directly from it. We describe how failures of motor prediction would therefore directly affect the phenomenology of inner speech and trigger a mismatch in the comparison between motor prediction and motor intention, contributing to abnormal feelings of agency. We argue that the new account fits with the emerging phenomenological evidence that AVHs are both distinct from ordinary inner speech and heterogeneous. Finally, we explore the possibility that the new comparator account may extend to explain disruptions across a range of imagistic modalities, and outline avenues for future research.
Combining multi-band and frequency-filtering techniques for speech recognition in noisy environments
Resumo:
While current speech recognisers give acceptable performance in carefully controlled environments, their performance degrades rapidly when they are applied in more realistic situations. Generally, the environmental noise may be classified into two classes: the wide-band noise and narrow band noise. While the multi-band model has been shown to be capable of dealing with speech corrupted by narrow-band noise, it is ineffective for wide-band noise. In this paper, we suggest a combination of the frequency-filtering technique with the probabilistic union model in the multi-band approach. The new system has been tested on the TIDIGITS database, corrupted by white noise, noise collected from a railway station, and narrow-band noise, respectively. The results have shown that this approach is capable of dealing with noise of narrow-band or wide-band characteristics, assuming no knowledge about the noisy environment.
Resumo:
This paper presents a novel method of audio-visual fusion for person identification where both the speech and facial modalities may be corrupted, and there is a lack of prior knowledge about the corruption. Furthermore, we assume there is a limited amount of training data for each modality (e.g., a short training speech segment and a single training facial image for each person). A new representation and a modified cosine similarity are introduced for combining and comparing bimodal features with limited training data as well as vastly differing data rates and feature sizes. Optimal feature selection and multicondition training are used to reduce the mismatch between training and testing, thereby making the system robust to unknown bimodal corruption. Experiments have been carried out on a bimodal data set created from the SPIDRE and AR databases with variable noise corruption of speech and occlusion in the face images. The new method has demonstrated improved recognition accuracy.