926 resultados para Expressive speech
Resumo:
In order to obtain more human like sounding humanmachine interfaces we must first be able to give them expressive capabilities in the way of emotional and stylistic features so as to closely adequate them to the intended task. If we want to replicate those features it is not enough to merely replicate the prosodic information of fundamental frequency and speaking rhythm. The proposed additional layer is the modification of the glottal model, for which we make use of the GlottHMM parameters. This paper analyzes the viability of such an approach by verifying that the expressive nuances are captured by the aforementioned features, obtaining 95% recognition rates on styled speaking and 82% on emotional speech. Then we evaluate the effect of speaker bias and recording environment on the source modeling in order to quantify possible problems when analyzing multi-speaker databases. Finally we propose a speaking styles separation for Spanish based on prosodic features and check its perceptual significance.
Resumo:
When designing human-machine interfaces it is important to consider not only the bare bones functionality but also the ease of use and accessibility it provides. When talking about voice-based inter- faces, it has been proven that imbuing expressiveness into the synthetic voices increases signi?cantly its perceived naturalness, which in the end is very helpful when building user friendly interfaces. This paper proposes an adaptation based expressiveness transplantation system capable of copying the emotions of a source speaker into any desired target speaker with just a few minutes of read speech and without requiring the record- ing of additional expressive data. This system was evaluated through a perceptual test for 3 speakers showing up to an average of 52% emotion recognition rates relative to the natural voice recognition rates, while at the same time keeping good scores in similarity and naturality.
Resumo:
This paper describes a module for the prediction of emotions in text chats in Spanish, oriented to its use in specific-domain text-to-speech systems. A general overview of the system is given, and the results of some evaluations carried out with two corpora of real chat messages are described. These results seem to indicate that this system offers a performance similar to other systems described in the literature, for a more complex task than other systems (identification of emotions and emotional intensity in the chat domain).
Resumo:
There is substantial evidence of the decreased functional capacity, especially everyday functioning, of people with psychotic disorder in clinical settings, but little research about it in the general population. The aim of the present study was to provide information on the magnitude of functional capacity problems in persons with psychotic disorder compared with the general population. It estimated the prevalence and severity of limitations in vision, mobility, everyday functioning and quality of life of persons with psychotic disorder in the Finnish population and determined the factors affecting them. This study is based on the Health 2000 Survey, which is a nationally representative survey of 8028 Finns aged 30 and older. The psychotic diagnoses of the participants were assessed in the Psychoses of Finland survey, a substudy of Health 2000. The everyday functioning of people with schizophrenia is studied widely, but one important factor, mobility has been neglected. Persons with schizophrenia and other non-affective psychotic disorders, but not affective psychoses had a significantly increased risk of having both self-reported and test-based mobility limitations as well as weak handgrip strength. Schizophrenia was associated independently with mobility limitations even after controlling for lifestyle-related factors and chronic medical conditions. Another significant factor associated with problems in everyday functioning in participants with schizophrenia was reduced visual acuity. Their vision was examined significantly less often during the five years before the visual acuity measurement than the general population. In general, persons with schizophrenia and other non-affective psychotic disorder had significantly more limitations in everyday functioning, deficits in verbal fluency and in memory than the general population. More severe negative symptoms, depression, older age, verbal memory deficits, worse expressive speech and reduced distance vision were associated with limitations in everyday functioning. Of all the psychotic disorders, schizoaffective disorder was associated with the largest losses of quality of life, and bipolar I disorder with equal or smaller losses than schizophrenia. However, the subjective loss of qualify of life associated with psychotic disorders may be smaller than objective disability, which warrants attention. Depressive symptoms were the most important determinant of poor quality of life in all psychotic disorders. In conclusion, subjects with psychotic disorders need regular somatic health monitoring. Also, health care workers should evaluate the overall quality of life and depression of subjects with psychotic disorders in order to provide them with the basic necessities of life.
Resumo:
A fim de motivar alunos dos Ensinos Fundamental e Médio a atentarem para o cunho expressivo dos fatos da língua e buscarem conhecer os grandes nomes da música nacional, este trabalho objetivou um estudo estilístico dos recursos linguístico-expressivos de textos musicais de Luiz Gonzaga do Nascimento Jr. o Gonzaguinha. Destacou-se, primeiramente, o viés social da música. Em seguida, pretendeu-se uma aproximação entre textos musicais e o conceito de poesia para, então, tratar mais cuidadosamente do papel didático da canção. Considerando a concepção de contracultura, defendida por Marilena Chauí (1990) como o contexto histórico do momento das produções musicais, partiu-se para explanações acerca da Estilística: sua história, seus âmbitos, seu alcance e possibilidades. Por fim, adentrou-se nas relações lingüísticas a partir de canções de tom ora de protesto inconformado, ora nacionalista-apaixonado do artista mencionado. Desenvolveu-se, portanto, uma abordagem estilística individual do autor a partir de apontamentos teóricos sobre aspectos sonoros, léxico-semânticos, morfossintáticos e enunciativos do processo discursivo expressivo. Com esse intento, elegeram-se composições que ilustraram as marcas lingüísticas e seu entrelaçamento com o plano do conteúdo
Resumo:
Dans ce mémoire, les contes de trois conteurs contemporains du Québec – Jos Gallant d’André Lemelin, Ti Pinge de Joujou Turenne et L’entrain à vapeur, de Fred Pellerin – font avant tout l’objet d’une lecture pragmatique afin de mieux comprendre comment le conteur, qui emploie le canevas en spectacle, transmet une fiction à un auditoire ou à un lectorat. L’étude présente d’abord une analyse comparative de chacune des prestations avec la version publiée d’un même récit et met ainsi en relief leurs points de convergence et de divergence. Selon l’hypothèse avancée, l’analyse de la prestation des conteurs qui suivent un canevas révèlerait comment s’y manifestent les dimensions performatives et les articulations du discours fictionnel. Corrélativement, l’examen des rapports entre le conteur et son public permet ensuite de s’interroger sur le statut du narrateur et de voir en quoi et comment, durant la performance, la fiction est partagée avec l’auditoire. L’analyse des énoncés performatifs, inspirés des travaux de Kerbrat-Orechionni et la dynamique de vectorisation proposée par Pavis pour l’étude de la gestuelle, des mimiques et de la voix, sont mises à contribution et visent également à dégager les outils pouvant servir à l’analyse des spectacles de contes. Au terme de cette recherche, l’auteure démontre les avantages liés au canevas, notamment en ce qui concerne les interactions qu’il favorise avec le public et dans la liberté qu’il procure, en permettant de modifier ou d’adapter le discours et les ressources expressives du conteur à chacune de ses représentations.
Resumo:
Les objectifs de ce programme de recherche étaient, d’une part, d’apporter une compréhension critique des techniques non-invasives utilisées dans la localisation et/ou la latéralisation des aires langagières et mnésiques en tenant compte de leurs avantages, de leurs limites propres ainsi que de leur pertinence dans un contexte clinique. D’autre part, d’approfondir notre compréhension de l’organisation cérébrale langagière auprès d’une population de sujets ayant une agénésie du corps calleux en utilisant un protocole de neuroimagerie. Afin de répondre à notre premier objectif, une revue critique de la littérature des méthodes de neuroimagerie utilisées pour la latéralisation et la localisation des aires cérébrales sous-tendant le traitement langagier et mnésique dans le contexte du bilan préchirurgical des patients épileptiques a été effectuée. Ce travail a permis d’identifier que certaines de ces nouvelles techniques et plus spécialement leur combinaison, montrent un potentiel réel dans ce contexte clinique. Cette recherche a également permis de mettre en lumière que ces méthodes ont encore un grand besoin d’être raffinées et standardisées avant d’être utilisées comme remplacement au test à l’amobarbital intracarotidien dans un contexte clinique sécuritaire. Afin de répondre à notre deuxième objectif, nous avons exploré les patrons de latéralisation du langage auprès de six sujets acalleux en utilisant un protocle d’imagerie par résonance magnétique fonctionnelle (IRMf). Les résultats indiquent que les individus ayant une agénésie du corps calleux montrent un patron d’activation cérébrale tout aussi latéralisé que nos deux groupes contrôles (QI apparié et QI élevé) lors du traitement du langage réceptif. Les sujets ayant une agénésie du corps calleux montrent également un patron de latéralisation comparable à leur groupe contrôle apparié pour le QI pour la tâche de langage expressif. Lorsque l’on compare les sujets ayant une agénésie du corps calleux au groupe contrôle de QI élevé, ces derniers montrent une latéralisation moins marquée uniquement pour la région frontale lors de la tâche de langage expressif. En conclusion, les résultats de cette étude ne supportent pas l’affirmation que le corps calleux jouerait un rôle inhibiteur essentiel afin de permettre un développement normal de la latéralisation hémisphérique pour le langage.
Resumo:
The significance of the body in electronic music parties as a sign for communicating and socializing among participants is the focus of this work. Qualitative research undertaken in this study seeks to investigate how sociability happens at raves and nightclubs in Natal/RN. Sociability is understood here as a play expression involving the dimensions of music, dance and party; the body, seen from a transdisciplinary approach, is understood as a symbolic instance, with its own meanings, as a result and a producer of social and as a cross between the cultural and the biological. The body has a communicative potential, is primary media. An intersection point between nature and culture, it serves as the seat of emotions and sociability, since it is through it that social relations are made. In electronic music parties, the body is interpreted based on its communication signs: clothing, accessories, body movements, tactile contact, body language, interactions between the public and dj, the dj and the public, gestures, expressive speech of emotions. Through such signs, body communication and a sense of community among participants develop sociability in the festive place and change the mood of the dancers. The Natal s electronic music parties young goer interacts on parties, adopts cheerful and receptive positions towards the other, maintains physical contact, values dance as a form of communication and lists happiness as the main feeling aroused in electronic music festivals. To achieve this result, a plurimetodological approach was used, which consisted of various methodological devices and various techniques of investigation: ethnographic observation, individual and informal interview techniques, photographic record of the scene, in-depth interview and application thirty questionnaires to patrons of electronic music parties
Resumo:
A fala apresenta aspectos paralinguísticos que não pertencem ao código linguístico convencional, mas contribuem significativamente para a unidade temática do discurso, Essas realizações se constituem em enunciados não-lexicalizados que funcionam que funcionam como atos de fala completos nas interações comunicativas interpessoais. Sobre essas emissões não-verbais, Campbell (2002a, 2002b, 2003 e 2004), Maekawa (2004), Fujie et. al (2004), Hoult (2004), Key (1958) apud Steimberg (1988) postulam que elas constribuem para a manifestação da fala expressiva. Para os autores, é justamente o fenômeno da paralinguagem que sinaliza informações sobre atitudes, opiniões e emoções do falante em relação ao interlocutor ou ao tópico discursivo. Nesse sentido, investigamos, neste trabalho, as manifestações paralinguísticas recorrentes em conversas informais para demonstrarmos seu papel expressivo na linguagem falada. Para tanto, fizemos um levantamento de 450 ocorrências de elementos paralinguísticos no processo de transcrição de amostras de falas do Português Regional Paraense produzidas em situações reais de conversação. Pressupondo que essas realizações não-verbais são caracterizadas por variações prosódicas, nós as submetemos a uma análise fonética por meio do software PRAAT. A partir dessa análise, constatamos a contribuição de duas propriedades: a frequência fundamental (F0) e o tempo de emissão, para a manifestação expressiva dos elementos paralinguísticos no discurso falado. Além disso, identificamos também a silabação como uma propriedade comum às realizações sonoras focalizadas. Após o processo de análise, fizemos a descrição do uso e do funcionamento desses elementos nas conversas, bem como da contribuição deles para a manifestação da fala expressiva. Os resultados nos mostram que os elementos paralinguísticos, além de contribuírem para a fluência do discurso falado, desempenham a função de sinalizar compreensão, interesse e/ou atenção, gerenciar relações interpessoais e expressar emoções, atitudes e afeto.
Resumo:
This essay asks whether there is a relation between action-serving and meaning-serving intentions. The idea that the intentions involved in meaning and action are nominally designated alike as intentionalities does not guarantee any special logical or conceptual connections between the intentionality of referential thoughts and thought-expressive speech acts with the intentionality of doing. The latter category is typified by overt physical actions in order to communicate by engaging in speech acts, but also includes at the origin of all artistic and symbolic expression such cerebral and linguistic doings as thinking propositional thoughts. There are exactly four possibilities by which meaning and action intentionalities might be related to be systematically investigated. Meaning-serving and action-serving intentionalities, topologically speaking, might exclude one another, partially overlap with one another, or subsume one in the other or the other in the one. The theoretical separation of the two ostensible categories of intendings is criticized, as is their partial overlap, in light of the proposal that thinking and artistic and symbolic expression are activities that favor the inclusion of paradigm meaning-serving intentions as among a larger domain of action-serving intentions. The only remaining alternative is then developed, of including action-serving intentions reductively in meaning-serving intentions, and is defended as offering in an unexpected way the most cogent universal reductive ontology in which the intentionality of doing generally relates to the specific intentionality of referring in thought to the objects of predications, and of its artistic and symbolic expression.
Resumo:
This demo concerns a recently developed prototype of an emotionally-sensitive autonomous HiFi Spoken Conversa- tional Agent, called NEMOHIFI. The baseline agent was developed by the Speech Technology Group (GTH) and has recently been integrated with an emotional engine called NEMO (Need-inspired Emotional Model) to enable it to adapt to users emotion and respond to the users using ap- propriate expressive speech. NEMOHIFI controls and man- ages the HiFi audio system, and for end users, its functions equate a remote control, except that instead of clicking, the user interacts with the agent using voice. A pairwise com- parison between the baseline (non-adaptive) and NEMO- HIFI is also presented.
Resumo:
This paper presents a complete system for expressive visual text-to-speech (VTTS), which is capable of producing expressive output, in the form of a 'talking head', given an input text and a set of continuous expression weights. The face is modeled using an active appearance model (AAM), and several extensions are proposed which make it more applicable to the task of VTTS. The model allows for normalization with respect to both pose and blink state which significantly reduces artifacts in the resulting synthesized sequences. We demonstrate quantitative improvements in terms of reconstruction error over a million frames, as well as in large-scale user studies, comparing the output of different systems. © 2013 IEEE.
Resumo:
The progress of a nationally representative sample of 3632 children was followed from early childhood through to primary school, using data from the Longitudinal Study of Australian Children (LSAC). The aim was to examine the predictive effects of different aspects of communicative ability, and of early vs. sustained identification of speech and language impairment, on children's achievement and adjustment at school. Four indicators identified speech and language impairment: parent-rated expressive language concern; parent-rated receptive language concern; use of speech-language pathology services; below average scores on the adapted Peabody Picture Vocabulary Test-III. School outcomes were assessed by teachers' ratings of language/literacy ability, numeracy/mathematical thinking and approaches to learning. Comparison of group differences, using ANOVA, provided clear evidence that children who were identified as having speech and language impairment in their early childhood years did not perform as well at school, two years later, as their non-impaired peers on all three outcomes: Language and Literacy, Mathematical Thinking, and Approaches to Learning. The effects of early speech and language status on literacy, numeracy, and approaches to learning outcomes were similar in magnitude to the effect of family socio-economic factors, after controlling for child characteristics. Additionally, early identification of speech and language impairment (at age 4-5) was found to be a better predictor of school outcomes than sustained identification (at aged 4-5 and 6-7 years). Parent-reports of speech and language impairment in early childhood are useful in foreshadowing later difficulties with school and providing early intervention and targeted support from speech-language pathologists and specialist teachers.
Resumo:
This article examines what is wrong with some expressive acts, ‘insults’. Their putative wrongfulness is distinguished from the causing of indirect harms, aggregated harms, contextual harms, and damaging misrepresentations. The article clarifies what insults are, making use of work by Neu and Austin, and argues that their wrongfulness cannot lie in the hurt that is caused to those at whom such acts are directed. Rather it must lie in what they seek to do, namely to denigrate the other. The causing of offence is at most evidence that an insult has been communicated; it is not independent grounds of proscription or constraint. The victim of an insult may know that she has been insulted but not accept or agree with the insult, and thereby submit to the insulter. Hence insults need not, as Waldron argues they do, occasion dignitary harms. They do not of themselves subvert their victims' equal moral status. The claim that hateful speech endorses inequality should not be conflated with a claim that such speech directly subverts equality.
Thus, ‘wounding words’ should not unduly trouble the liberal defender of free speech either on the grounds of preventing offence or on those of avoiding dignitary harms.
Resumo:
Background: In Portugal, the routine clinical practice of speech and language therapists (SLTs) in treating children with all types of speech sound disorder (SSD) continues to be articulation therapy (AT). There is limited use of phonological therapy (PT) or phonological awareness training in Portugal. Additionally, at an international level there is a focus on collecting information on and differentiating between the effectiveness of PT and AT for children with different types of phonologically based SSD, as well as on the role of phonological awareness in remediating SSD. It is important to collect more evidence for the most effective and efficient type of intervention approach for different SSDs and for these data to be collected from diverse linguistic and cultural perspectives. Aims: To evaluate the effectiveness of a PT and AT approach for treatment of 14 Portuguese children, aged 4.0–6.7 years, with a phonologically based SSD. Methods & Procedures: The children were randomly assigned to one of the two treatment approaches (seven children in each group). All children were treated by the same SLT, blind to the aims of the study, over three blocks of a total of 25 weekly sessions of intervention. Outcome measures of phonological ability (percentage of consonants correct (PCC), percentage occurrence of different phonological processes and phonetic inventory) were taken before and after intervention. A qualitative assessment of intervention effectiveness from the perspective of the parents of participants was included. Outcomes & Results: Both treatments were effective in improving the participants’ speech, with the children receiving PT showing a more significant improvement in PCC score than those receiving the AT. Children in the PT group also showed greater generalization to untreated words than those receiving AT. Parents reported both intervention approaches to be as effective in improving their children’s speech. Conclusions & Implications: The PT (combination of expressive phonological tasks, phonological awareness, listening and discrimination activities) proved to be an effective integrated method of improving phonological SSD in children. These findings provide some evidence for Portuguese SLTs to employ PT with children with phonologically based SSD