950 resultados para Text to speech


Relevância:

90.00% 90.00%

Publicador:

Resumo:

The precise timing of the emergence of language in human prehistory cannot be resolved. But the available evidence is sufficient to constrain it to some degree. This is a review and synthesis of the available evidence, leading to the conclusion that the time when speech in some form became important for our ancestors can be constrained to be not less than 400,000 years ago, thus excluding several popular theories involving a late transition to speech.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq)

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES)

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Hymenoptera exhibit an incredible diversity of phenotypes, the result of similar to 240 million years of evolution and the primary subject of more than 250 years of research. Here we describe the history, development, and utility of the Hymenoptera Anatomy Ontology (HAO) and its associated applications. These resources are designed to facilitate accessible and extensible research on hymenopteran phenotypes. Outreach with the hymenopterist community is of utmost importance to the HAO project, and this paper is a direct response to questions that arose from project workshops. In a concerted attempt to surmount barriers of understanding, especially regarding the format, utility, and development of the HAO, we discuss the roles of homology, "preferred terms", and "structural equivalency". We also outline the use of Universal Resource Identifiers (URIs) and posit that they are a key element necessary for increasing the objectivity and repeatability of science that references hymenopteran anatomy. Pragmatically, we detail a mechanism (the "URI table") by which authors can use URIs to link their published text to the HAO, and we describe an associated tool (the "Analyzer") to derive these tables. These tools, and others, are available through the HAO Portal website (http://portal.hymao.org). We conclude by discussing the future of the HAO with respect to digital publication, cross-taxon ontology alignment, the advent of semantic phenotypes, and community-based curation.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Objective: To characterize the PI component of long latency auditory evoked potentials (LLAEPs) in cochlear implant users with auditory neuropathy spectrum disorder (ANSD) and determine firstly whether they correlate with speech perception performance and secondly whether they correlate with other variables related to cochlear implant use. Methods: This study was conducted at the Center for Audiological Research at the University of Sao Paulo. The sample included 14 pediatric (4-11 years of age) cochlear implant users with ANSD, of both sexes, with profound prelingual hearing loss. Patients with hypoplasia or agenesis of the auditory nerve were excluded from the study. LLAEPs produced in response to speech stimuli were recorded using a Smart EP USB Jr. system. The subjects' speech perception was evaluated using tests 5 and 6 of the Glendonald Auditory Screening Procedure (GASP). Results: The P-1 component was detected in 12/14 (85.7%) children with ANSD. Latency of the P-1 component correlated with duration of sensorial hearing deprivation (*p = 0.007, r = 0.7278), but not with duration of cochlear implant use. An analysis of groups assigned according to GASP performance (k-means clustering) revealed that aspects of prior central auditory system development reflected in the P-1 component are related to behavioral auditory skills. Conclusions: In children with ANSD using cochlear implants, the P-1 component can serve as a marker of central auditory cortical development and a predictor of the implanted child's speech perception performance. (c) 2012 Elsevier Ireland Ltd. All rights reserved.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Introduction: In recent years, the benefits associated with the use of cochlear implants (CIs), especially with regard to speech perception, have proven to surpass those produced by the use of hearing aids, making CIs a highly efficient resource for patients with severe/profound hearing loss. However, few studies so far have assessed the satisfaction of adult users of CIs. Objective: To analyze the relationship between the level of speech perception and degree of satisfaction of adult users of CI. Method: This was a prospective cross-sectional study conducted in the Audiological Research Center (CPA) of the Hospital of Craniofacial Anomalies, University of São Paulo (HRAC/USP), in Bauru, São Paulo, Brazil. A total of 12 users of CIs with pre-lingual or post-lingual hearing loss participated in this study. The following tools were used in the assessment: a questionnaire, "Satisfaction with Amplification in Daily Life" (SADL), culturally adapted to Brazilian Portuguese, as well as its relationship with the speech perception results; a speech perception test under quiet conditions; and the Hearing in Noise Test (HINT)Brazil under free field conditions. Results: The participants in the study were on the whole satisfied with their devices, and the degree of satisfaction correlated positively with the ability to perceive monosyllabic words under quiet conditions. The satisfaction did not correlate with the level of speech perception in noisy environments. Conclusion: Assessments of satisfaction may help professionals to predict what other factors, in addition to speech perception, may contribute to the satisfaction of CI users in order to reorganize the intervention process to improve the users' quality of life.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Computer-assisted translation (or computer-aided translation or CAT) is a form of language translation in which a human translator uses computer software in order to facilitate the translation process. Machine translation (MT) is the automated process by which a computerized system produces a translated text or speech from one natural language to another. Both of them are leading and promising technologies in the translation industry; it therefore seems important that translation students and professional translators become familiar with this relatively new types of technology. Whether used together, not only might these two different types of systems reduce translation time, but also lead to a further improvement in the field of translation technologies. The dissertation consists of four chapters. The first one surveys the chronological development of MT and CAT tools, the emergence of pre-editing, post-editing and controlled language and the very last frontiers in this sector. The second one provide a general overview on the four main CAT tools that are used nowadays and tested hereto. The third chapter is dedicated to the experimentations that have been conducted in order to analyze and evaluate the performance of the four integrated systems that are the core subject of this dissertation. Finally, the fourth chapter deals with the issue of terminological equivalence in interlinguistic translation. The purpose of this dissertation is not to provide an objective and definitive solution to the complex issues that arise at any time in the field of translation technologies, this aim being well away from being achieved, but to supply information about the limits and potentiality that are typical of those instruments which are now essential to any professional translator.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Telephone communication is a challenge for many hearing-impaired individuals. One important technical reason for this difficulty is the restricted frequency range (0.3-3.4 kHz) of conventional landline telephones. Internet telephony (voice over Internet protocol [VoIP]) is transmitted with a larger frequency range (0.1-8 kHz) and therefore includes more frequencies relevant to speech perception. According to a recently published, laboratory-based study, the theoretical advantage of ideal VoIP conditions over conventional telephone quality has translated into improved speech perception by hearing-impaired individuals. However, the speech perception benefits of nonideal VoIP network conditions, which may occur in daily life, have not been explored. VoIP use cannot be recommended to hearing-impaired individuals before its potential under more realistic conditions has been examined.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Open-ended interviews of 90 min length of 38 patients were analyzed with respect to speech stylistics, shown by Schucker and Jacobs to differentiate individuals with type A personality features from those with type B. In our patients, Type A/B had been assessed by the Bortner Personality Inventory. The stylistics studied were: repeated words swallowed words, interruptions, simultaneous speech, silence latency (between question and answer) (SL), speed of speech, uneven speed of speech (USS), explosive words (PW), uneven speech volume (USV), and speech volume. Correlations between both raters for all speech categories were high. Positive correlations between extent of type A and SL (r = 0.33; p = 0.022), USS (r = 0.51; p = 0.002), PW (r = 0.46; p = 0.003) and USV (r = 0.39; p = 0.012) were found. Our results indicate that the speech in nonstress open-ended interviews of type A individuals tends to show a higher emotional tension (positive correlations for USS PW and USV) and is more controlled in conversation (positive correlation for SL).

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Audio-visual documents obtained from German TV news are classified according to the IPTC topic categorization scheme. To this end usual text classification techniques are adapted to speech, video, and non-speech audio. For each of the three modalities word analogues are generated: sequences of syllables for speech, “video words” based on low level color features (color moments, color correlogram and color wavelet), and “audio words” based on low-level spectral features (spectral envelope and spectral flatness) for non-speech audio. Such audio and video words provide a means to represent the different modalities in a uniform way. The frequencies of the word analogues represent audio-visual documents: the standard bag-of-words approach. Support vector machines are used for supervised classification in a 1 vs. n setting. Classification based on speech outperforms all other single modalities. Combining speech with non-speech audio improves classification. Classification is further improved by supplementing speech and non-speech audio with video words. Optimal F-scores range between 62% and 94% corresponding to 50% - 84% above chance. The optimal combination of modalities depends on the category to be recognized. The construction of audio and video words from low-level features provide a good basis for the integration of speech, non-speech audio and video.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

OBJECTIVE To evaluate the speech intelligibility in noise with a new cochlear implant (CI) processor that uses a pinna effect imitating directional microphone system. STUDY DESIGN Prospective experimental study. SETTING Tertiary referral center. PATIENTS Ten experienced, unilateral CI recipients with bilateral severe-to-profound hearing loss. INTERVENTION All participants performed speech in noise tests with the Opus 2 processor (omnidirectional microphone mode only) and the newer Sonnet processor (omnidirectional and directional microphone mode). MAIN OUTCOME MEASURE The speech reception threshold (SRT) in noise was measured in four spatial settings. The test sentences were always presented from the front. The noise was arriving either from the front (S0N0), the ipsilateral side of the CI (S0NIL), the contralateral side of the CI (S0NCL), or the back (S0N180). RESULTS The directional mode improved the SRTs by 3.6 dB (p < 0.01), 2.2 dB (p < 0.01), and 1.3 dB (p < 0.05) in the S0N180, S0NIL, and S0NCL situations, when compared with the Sonnet in the omnidirectional mode. There was no statistically significant difference in the S0N0 situation. No differences between the Opus 2 and the Sonnet in the omnidirectional mode were observed. CONCLUSION Speech intelligibility with the Sonnet system was statistically different to speech recognition with the Opus 2 system suggesting that CI users might profit from the pinna effect imitating directionality mode in noisy environments.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Este trabalho teve como objetivo os enunciados verbais e/ou não-verbais impressos em capas de livros - escritos por autores brasileiros e adaptados para o cinema ou para a televisão - que associam o livro à produção cinematográfica ou televisiva. Seu objetivo foi verificar se tais enunciados poderiam ou não ser classificados como paratexto - conforme é conceituado por Gerard Genette. A motivação para esta pesquisa surgiu pela constatação de que, em sendo aqueles enunciados construídos a partir de uma obra derivada de um livro, em que medida eles poderiam estar a serviço do texto principal? Para responder a essa questão, os enunciados foram analisados segundo os conceitos da análise do discurso e da análise retórica. Os resultados obtidos na análise permitiram concluir que alguns enunciados não se configuram como paratextuais e, com base nos conceitos da Teoria Crítica, possibilitam compreender, criticamente, os procedimentos editoriais com que o livro se relaciona com os demais produtos midiáticos.(AU)

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Este trabalho teve como objetivo os enunciados verbais e/ou não-verbais impressos em capas de livros - escritos por autores brasileiros e adaptados para o cinema ou para a televisão - que associam o livro à produção cinematográfica ou televisiva. Seu objetivo foi verificar se tais enunciados poderiam ou não ser classificados como paratexto - conforme é conceituado por Gerard Genette. A motivação para esta pesquisa surgiu pela constatação de que, em sendo aqueles enunciados construídos a partir de uma obra derivada de um livro, em que medida eles poderiam estar a serviço do texto principal? Para responder a essa questão, os enunciados foram analisados segundo os conceitos da análise do discurso e da análise retórica. Os resultados obtidos na análise permitiram concluir que alguns enunciados não se configuram como paratextuais e, com base nos conceitos da Teoria Crítica, possibilitam compreender, criticamente, os procedimentos editoriais com que o livro se relaciona com os demais produtos midiáticos.(AU)

Relevância:

90.00% 90.00%

Publicador:

Resumo:

In behavior reminiscent of the responsiveness of human infants to speech, young songbirds innately recognize and prefer to learn the songs of their own species. The acoustic and physiological bases for innate recognition were investigated in fledgling white-crowned sparrows lacking song experience. A behavioral test revealed that the complete conspecific song was not essential for innate recognition: songs composed of single white-crowned sparrow phrases and songs played in reverse elicited vocal responses as strongly as did normal song. In all cases, these responses surpassed those to other species’ songs. Although auditory neurons in the song nucleus HVc and the underlying neostriatum of fledglings did not prefer conspecific song over foreign song, some neurons responded strongly to particular phrase types characteristic of white-crowned sparrows and, thus, could contribute to innate song recognition.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

The pars triangular is a portion of Broca's area. The convolutions that form the inferior and caudal extent of the pars triangularis include the anterior horizontal and anterior ascending rami of the sylvian fissure, respectively. To learn if there are anatomic asymmetries of the pars triangularis, these convolutions were measured on volumetric magnetic resonance imaging scans of 11 patients who had undergone selective hemispheric anesthesia (Wada testing) to determine hemispheric speech and language lateralization. Of the 10 patients with language lateralized to the left hemisphere, 9 had a leftward asymmetry of the pars triangularis. The 1 patient with language lateralized to the right hemisphere had a significant rightward asymmetry of the pars triangularis. Our data suggest that asymmetries of the pars triangularis may be related to speech-language lateralization.