950 resultados para Text to speech


Relevância:

30.00% 30.00%

Publicador:

Resumo:

Sound source localization (SSL) is an essential task in many applications involving speech capture and enhancement. As such, speaker localization with microphone arrays has received significant research attention. Nevertheless, existing SSL algorithms for small arrays still have two significant limitations: lack of range resolution, and accuracy degradation with increasing reverberation. The latter is natural and expected, given that strong reflections can have amplitudes similar to that of the direct signal, but different directions of arrival. Therefore, correctly modeling the room and compensating for the reflections should reduce the degradation due to reverberation. In this paper, we show a stronger result. If modeled correctly, early reflections can be used to provide more information about the source location than would have been available in an anechoic scenario. The modeling not only compensates for the reverberation, but also significantly increases resolution for range and elevation. Thus, we show that under certain conditions and limitations, reverberation can be used to improve SSL performance. Prior attempts to compensate for reverberation tried to model the room impulse response (RIR). However, RIRs change quickly with speaker position, and are nearly impossible to track accurately. Instead, we build a 3-D model of the room, which we use to predict early reflections, which are then incorporated into the SSL estimation. Simulation results with real and synthetic data show that even a simplistic room model is sufficient to produce significant improvements in range and elevation estimation, tasks which would be very difficult when relying only on direct path signal components.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

One of the goals of the ARC funded Eresearch project called Sharing access and analytical tools for ethnographic digital media using high speed networks, or simply EthnoER is to take outputs of normal linguistic analytical processes and present them online in a system we have called the EthnoER online presentation and annotation system, or EOPAS.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Nine individuals with complex language deficits following left-hemisphere cortical lesions and a matched control group (n 5 9) performed speeded lexical decisions on the third word of auditory word triplets containing a lexical ambiguity. The critical conditions were concordant (e.g., coin–bank–money), discordant (e.g., river–bank–money), neutral (e.g., day–bank– money), and unrelated (e.g., river–day–money). Triplets were presented with an interstimulus interval (ISI) of 100 and 1250 ms. Overall, the left-hemisphere-damaged subjects appeared able to exhaustively access meanings for lexical ambiguities rapidly, but were unable to reduce the level of activation for contextually inappropriate meanings at both short and long ISIs, unlike control subjects. These findings are consistent with a disruption of the proposed role of the left hemisphere in selecting and suppressing meanings via contextual integration and a sparing of the right-hemisphere mechanisms responsible for maintaining alternative meanings.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

This report describes recent updates to the custom-built data-acquisition hardware operated by the Center for Hypersonics. In 2006, an ISA-to-USB bridging card was developed as part of Luke Hillyard's final-year thesis. This card allows the hardware to be connected to any recent personal computers via a (USB or RS232) serial port and it provides a number of simple text-based commands for control of the hardware. A graphical user interface program was also updated to help the experimenter manage the data acquisition functions. Sampled data is stored in text files that have been compressed with the gzip for mat. To simplify the later archiving or transport of the data, all files specific to a shot are stored in a single directory. This includes a text file for the run description, the signal configuration file and the individual sampled-data files, one for each signal that was recorded.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Spectral peak resolution was investigated in normal hearing (NH), hearing impaired (HI), and cochlear implant (CI) listeners. The task involved discriminating between two rippled noise stimuli in which the frequency positions of the log-spaced peaks and valleys were interchanged. The ripple spacing was varied adaptively from 0.13 to 11.31 ripples/octave, and the minimum ripple spacing at which a reversal in peak and trough positions could be detected was determined as the spectral peak resolution threshold for each listener. Spectral peak resolution was best, on average, in NH listeners, poorest in CI listeners, and intermediate for HI listeners. There was a significant relationship between spectral peak resolution and both vowel and consonant recognition in quiet across the three listener groups. The results indicate that the degree of spectral peak resolution required for accurate vowel and consonant recognition in quiet backgrounds is around 4 ripples/octave, and that spectral peak resolution poorer than around 1–2 ripples/octave may result in highly degraded speech recognition. These results suggest that efforts to improve spectral peak resolution for HI and CI users may lead to improved speech recognition

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The purpose of the present study was to examine the benefits of providing audible speech to listeners with sensorineural hearing loss when the speech is presented in a background noise. Previous studies have shown that when listeners have a severe hearing loss in the higher frequencies, providing audible speech (in a quiet background) to these higher frequencies usually results in no improvement in speech recognition. In the present experiments, speech was presented in a background of multitalker babble to listeners with various severities of hearing loss. The signal was low-pass filtered at numerous cutoff frequencies and speech recognition was measured as additional high-frequency speech information was provided to the hearing-impaired listeners. It was found in all cases, regardless of hearing loss or frequency range, that providing audible speech resulted in an increase in recognition score. The change in recognition as the cutoff frequency was increased, along with the amount of audible speech information in each condition (articulation index), was used to calculate the "efficiency" of providing audible speech. Efficiencies were positive for all degrees of hearing loss. However, the gains in recognition were small, and the maximum score obtained by an listener was low, due to the noise background. An analysis of error patterns showed that due to the limited speech audibility in a noise background, even severely impaired listeners used additional speech audibility in the high frequencies to improve their perception of the "easier" features of speech including voicing

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The purpose of this paper is to provide a cross-linguistic survey of the variation of coding strategies that are available for the grammatical distinction between direct and indirect speech representation with a particular focus on the expression of indirect reported speech. Cross-linguistic data from a sample of 42 languages will be provided to illustrate the range of available grammatical coding strategies.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Parkinson's disease (PD) is a neurodegenerative movement disorder primarily due to basal ganglia dysfunction. While much research has been conducted on Parkinsonian deficits in the traditional arena of musculoskeletal limb movement, research in other functional motor tasks is lacking. The present study examined articulation in PD with increasingly complex sequences of articulatory movement. Of interest was whether dysfunction would affect articulation in the same manner as in limb-movement impairment. In particular, since very Similar (homogeneous) articulatory sequences (the tongue twister effect) are more difficult for healthy individuals to achieve than dissimilar (heterogeneous) gestures, while the reverse may apply for skeletal movements in PD, we asked which factor would dominate when PD patients articulated various grades of artificial tongue twisters: the influence of disease or a possible difference between the two motor systems. Execution was especially impaired when articulation involved a sequence of motor program heterogeneous in terms of place of articulation. The results are suggestive of a hypokinesic tendency in complex sequential articulatory movement as in limb movement. It appears that PD patients do show abnormalities in articulatory movement which are similar to those of the musculoskeletal system. The present study suggests that an underlying disease effect modulates movement impairment across different functional motor systems. (C) 1998 Academic Press.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

This work attempts to discuss, in the light of the French Analysis of the Discourse, how the concept of memory and heterogeneity in language actions can contribute to a reflection on information and documentation studies. Starting from cuttings of Clarice Lispector - the hour of the star exhibition pamphlet, accomplished in the second semester of 2007 by the Portuguese Language Museum (Luz train station, Sao Paulo), we interpreted the several voices that surround and sustain the subject and the sense.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The Jabirian Corpus refers to the K. Thahirat Al-`Iskandar, ""The Book of the Treasure of Alexander"" (hereafter BTA), as one of several forgeries suggesting that alchemical secrets were hidden in inscriptions in various places. The book was neglected until 1926, when Julius Ruska discussed it in his work on the Emerald Tablet, placing the BTA within the literature related to the development of Arabic alchemy. His preliminary study became an essential reference and encouraged many scholars to work on the BTA in the following decades. Some years ago, we completed the first translation of the BTA into a Western language. The work was based on the acephalous Escorial manuscript, which we identified as a fourteenth-century copy of the BTA. This manuscript is peculiar, as part of it is encoded. After finishing our translation, we started to establish the text of the BTA. At present, the text is in process of fixation-to be followed by textual criticism-and has been the main focus of a thorough study of ours on medieval hermeticism and alchemy. A sample of the work currently in progress is presented in this paper: an analysis of the variations between different manuscripts along with a study and English translation of its alchemical chapter.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Dysphagia is a symptom associated with an array of anatomical and functional changes which must be assessed by a multidisciplinary team to guarantee optimal evaluation and treatment, preventing potential complications. Aim: The aim of the present study is to present the combined protocol of clinical and swallowing videoendoscopy carried by ENT doctors and speech therapists in the Dysphagia Group of the ENT Department - University Hospital. Materials and Methods: Retrospective study concerning the use of a protocol made up of patient interview and clinical examination, followed by an objective evaluation with swallowing videoendoscopy. The exam was performed in 1,332 patients from May 2001 to December 2008. There were 726 (54.50%) males and 606 (45.50%) females, between 22 days and 99 years old. Results: We found: 427 (32.08%) cases of normal swallowing, 273 (20.48%) mild dysphagia, 224 (16.81%) moderate dysphagia, 373 (27.99%) severe dysphagia and 35 (2.64%) inconclusive exams. Conclusion: The combined protocol (Otolaryngology and Speech Therapy), is a good way to approach the dysphagic patient, helping to achieve early and safe deglutition diagnosis as far as disorder severity and treatment are concerned.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The characterisation of oral-motor movements and speech of patients with tetanus were investigated to determine the existence of possible signs that are characteristic of this pathology. Thirteen patients clinically diagnosed with tetanus (10 with severe tetanus and three with very severe tetanus) and admitted to an intensive care unit underwent clinical evaluation of oral-motor movements and speech. Statistical analysis indicated significant between-group differences for speech motor functions, suggesting that individuals with very severe tetanus present rigidity as a characteristic interfering in articulatory precision (P = 0 035) and movement rate (P = 0 038). For lip closure, tongue movement, palatal elevation, gag reflex and voice quality, no between-group differences were identified for the specific abnormal characteristics. The observed abnormal results indicate that muscle strength and functional status of the oral-motor system presented by most of the participants of the study did not ensure the necessary integrity for satisfactory performance. The characterisation of the oral myofunctional aspects of patients with tetanus provides medical teams, patients and families with a wider and better description of the clinical situation, giving support to the diagnosis, prognostics and treatment.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Objective: To assess, in patients undergoing glossectomy, the influence of the palatal augmentation prosthesis on the speech intelligibility and acoustic spectrographic characteristics of the formants of oral vowels in Brazilian Portuguese, specifically the first 3 formants (F1 [/a,e,u/], F2 [/o,o,u/], and F3 [/a,o/]). Design: Speech evaluation with and without a palatal augmentation prosthesis using blinded randomized listener judgments. Setting: Tertiary referral center. Patients: Thirty-six patients (33 men and 3 women) aged 30 to 80 (mean [SD], 53.9 [10.5]) years underwent glossectomy (14, total glossectomy; 12, total glossectomy and partial mandibulectomy; 6, hemiglossectomy; and 4, subtotal glossectomy) with use of the augmentation prosthesis for at least 3 months before inclusion in the study. Main Outcome Measures: Spontaneous speech intel-ligibility (assessed by expert listeners using a 4-category scale) and spectrographic formants assessment. Results: We found a statistically significant improvement of spontaneous speech intelligibility and the average number of correctly identified syllables with the use of the prosthesis (P < .05). Statistically significant differences occurred for the F1 values of the vowels /a,e,u/; for F2 values, there was a significant difference of the vowels /o,o,u/; and for F3 values, there was a significant difference of the vowels la,61 (P < .001). Conclusions: The palatal augmentation prosthesis improved the intelligibility of spontaneous speech and syllables for patients who underwent glossectomy. It also increased the F2 and F3 values for all vowels and the F I values for the vowels /o,o,u/. This effect brought the values of many vowel formants closer to normal.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Profound hearing loss is a disability that affects personality and when it involves teenagers before language acquisition, these bio-psychosocial conflicts can be exacerbated, requiring careful evaluation and choice of them for cochlear implant. Aim: To evaluate speech perception by adolescents with profound hearing loss, users of cochlear Implants. Study Design: Prospective. Materials and Methods: Twenty-five individuals with severe or profound pre-lingual hearing loss who underwent cochlear implantation during adolescence, between 10 to 17 years and 11 months, who went through speech perception tests before the implant and 2 years after device activation. For comparison and analysis we used the results from tests of four choice, recognition of vowels and recognition of sentences in a closed setting and the open environment. Results: The average percentage of correct answers in the four choice test before the implant was 46.9% and after 24 months of device use, this value went up to 86.1% in the vowels recognition test, the average difference was 45.13% to 83.13% and the sentences recognition test together in closed and open settings was 19.3% to 60.6% and 1.08% to 20.47% respectively. Conclusion: All patients, although with mixed results, achieved statistical improvement in all speech tests that were employed.