938 resultados para Speech and voice functions
Resumo:
In this paper we present the design and analysis of an intonation model for text-to-speech (TTS) synthesis applications using a combination of Relational Tree (RT) and Fuzzy Logic (FL) technologies. The model is demonstrated using the Standard Yorùbá (SY) language. In the proposed intonation model, phonological information extracted from text is converted into an RT. RT is a sophisticated data structure that represents the peaks and valleys as well as the spatial structure of a waveform symbolically in the form of trees. An initial approximation to the RT, called Skeletal Tree (ST), is first generated algorithmically. The exact numerical values of the peaks and valleys on the ST is then computed using FL. Quantitative analysis of the result gives RMSE of 0.56 and 0.71 for peak and valley respectively. Mean Opinion Scores (MOS) of 9.5 and 6.8, on a scale of 1 - -10, was obtained for intelligibility and naturalness respectively.
Resumo:
∗ The work is partially supported by NSFR Grant No MM 409/94.
Resumo:
One of the overarching questions in the field of infant perceptual and cognitive development concerns how selective attention is organized during early development to facilitate learning. The following study examined how infants' selective attention to properties of social events (i.e., prosody of speech and facial identity) changes in real time as a function of intersensory redundancy (redundant audiovisual, nonredundant unimodal visual) and exploratory time. Intersensory redundancy refers to the spatially coordinated and temporally synchronous occurrence of information across multiple senses. Real time macro- and micro-structural change in infants' scanning patterns of dynamic faces was also examined. ^ According to the Intersensory Redundancy Hypothesis, information presented redundantly and in temporal synchrony across two or more senses recruits infants' selective attention and facilitates perceptual learning of highly salient amodal properties (properties that can be perceived across several sensory modalities such as the prosody of speech) at the expense of less salient modality specific properties. Conversely, information presented to only one sense facilitates infants' learning of modality specific properties (properties that are specific to a particular sensory modality such as facial features) at the expense of amodal properties (Bahrick & Lickliter, 2000, 2002). ^ Infants' selective attention and discrimination of prosody of speech and facial configuration was assessed in a modified visual paired comparison paradigm. In redundant audiovisual stimulation, it was predicted infants would show discrimination of prosody of speech in the early phases of exploration and facial configuration in the later phases of exploration. Conversely, in nonredundant unimodal visual stimulation, it was predicted infants would show discrimination of facial identity in the early phases of exploration and prosody of speech in the later phases of exploration. Results provided support for the first prediction and indicated that following redundant audiovisual exposure, infants showed discrimination of prosody of speech earlier in processing time than discrimination of facial identity. Data from the nonredundant unimodal visual condition provided partial support for the second prediction and indicated that infants showed discrimination of facial identity, but not prosody of speech. The dissertation study contributes to the understanding of the nature of infants' selective attention and processing of social events across exploratory time.^
Resumo:
The purpose of this paper is to draw on research that discusses the relationship between interest and metacognitive functions and its effect on engaging students in the writing process. Results indicate students who are interested in their writing activities engage in metacognitive strategies, remain focused, and complete their tasks.
Resumo:
How do infants learn word meanings? Research has established the impact of both parent and child behaviors on vocabulary development, however the processes and mechanisms underlying these relationships are still not fully understood. Much existing literature focuses on direct paths to word learning, demonstrating that parent speech and child gesture use are powerful predictors of later vocabulary. However, an additional body of research indicates that these relationships don’t always replicate, particularly when assessed in different populations, contexts, or developmental periods.
The current study examines the relationships between infant gesture, parent speech, and infant vocabulary over the course of the second year (10-22 months of age). Through the use of detailed coding of dyadic mother-child play interactions and a combination of quantitative and qualitative data analytic methods, the process of communicative development was explored. Findings reveal non-linear patterns of growth in both parent speech content and child gesture use. Analyses of contingency in dyadic interactions reveal that children are active contributors to communicative engagement through their use of gestures, shaping the type of input they receive from parents, which in turn influences child vocabulary acquisition. Recommendations for future studies and the use of nuanced methodologies to assess changes in the dynamic system of dyadic communication are discussed.
Resumo:
This study investigates the Spanish indefinite pronoun uno (“one”). After a detailed analysis of its occurrences in authentic language, we find that its interpretation varies depending on the linguistic context. Therefore, we examine which elements of the context - we focus on the broader context, beyond the sentence – have an impact on its interpretation and develop a typology of the indefinite pronoun as to its interpretation. The pronoun may be interpreted as completely generic or specific (referring to the speaker, the listener or a third person). Its interpretation can also be located in an intermediate position between these interpretive extremes.In addition, we compare its use in various discursive genres - spontaneous conversations, academic essays and web forum - which are distinguished by the presence or absence of interactivity and of more or less subjectivity / intersubjectivity. The comparison shows that pronoun use depends on these characteristics.
Resumo:
Here we use two filtered speech tasks to investigate children’s processing of slow (<4 Hz) versus faster (∼33 Hz) temporal modulations in speech. We compare groups of children with either developmental dyslexia (Experiment 1) or speech and language impairments (SLIs, Experiment 2) to groups of typically-developing (TD) children age-matched to each disorder group. Ten nursery rhymes were filtered so that their modulation frequencies were either low-pass filtered (<4 Hz) or band-pass filtered (22 – 40 Hz). Recognition of the filtered nursery rhymes was tested in a picture recognition multiple choice paradigm. Children with dyslexia aged 10 years showed equivalent recognition overall to TD controls for both the low-pass and band-pass filtered stimuli, but showed significantly impaired acoustic learning during the experiment from low-pass filtered targets. Children with oral SLIs aged 9 years showed significantly poorer recognition of band pass filtered targets compared to their TD controls, and showed comparable acoustic learning effects to TD children during the experiment. The SLI samples were also divided into children with and without phonological difficulties. The children with both SLI and phonological difficulties were impaired in recognizing both kinds of filtered speech. These data are suggestive of impaired temporal sampling of the speech signal at different modulation rates by children with different kinds of developmental language disorder. Both SLI and dyslexic samples showed impaired discrimination of amplitude rise times. Implications of these findings for a temporal sampling framework for understanding developmental language disorders are discussed.
Resumo:
The current study is a post-hoc analysis of data from the original randomized control trial of the Play and Language for Autistic Youngsters (PLAY) Home Consultation program, a parent-mediated, DIR/Floortime based early intervention program for children with ASD (Solomon, Van Egeren, Mahone, Huber, & Zimmerman, 2014). We examined 22 children from the original RCT who received the PLAY program. Children were split into two groups (high and lower functioning) based on the ADOS module administered prior to intervention. Fifteen-minute parent-child video sessions were coded through the use of CHILDES transcription software. Child and maternal language, communicative behaviors, and communicative functions were assessed in the natural language samples both pre- and post-intervention. Results demonstrated significant improvements in both child and maternal behaviors following intervention. There was a significant increase in child verbal and non-verbal initiations and verbal responses in whole group analysis. Total number of utterances, word production, and grammatical complexity all significantly improved when viewed across the whole group of participants; however, lexical growth did not reach significance. Changes in child communicative function were especially noteworthy, and demonstrated a significant increase in social interaction and a significant decrease in non-interactive behaviors. Further, mothers demonstrated an increase in responsiveness to the child’s conversational bids, increased ability to follow the child’s lead, and a decrease in directiveness. When separated for analyses within groups, trends emerged for child and maternal variables, suggesting greater gains in use of communicative function in both high and low groups over changes in linguistic structure. Additional analysis also revealed a significant inverse relationship between maternal responsiveness and child non-interactive behaviors; as mothers became more responsive, children’s non-engagement was decreased. Such changes further suggest that changes in learned skills following PLAY parent training may result in improvements in child social interaction and language abilities.
Resumo:
Background: Long-term exposure to infrasound and low frequency noise (ILFN <500 Hz, including infrasound) can lead to the development of vibroacoustic disease (VAD). VAD is a systemic pathology characterized by the abnormal growth of extracellular matrices in the absence of inflammatory processes, namely of collagen and elastin, both of which are abundant in the basement membrane zone of the vocal folds. ILFN-exposed workers include pilots, cabin crewmembers, restaurant workers, ship machinists and, in previous studies, even though they did not present vocal symptoms, ILFN-exposed workers had significant different voice acoustic patterns (perturbation and temporal measures) when compared with normative population. Study Aims: The present study investigates the effects of age and years of occupational ILFN-exposure on voice acoustic parameters of 37 cabin crewmembers: 12 males and 25 females. Specifically, the goals of this study are to: 1) Verify if acoustic parameters change over the age and years of ILFN-exposure and 2) Determine if there is any interaction between age and years of ILFNexposure on voice acoustic parameters of crewmembers. Materials and Methods: Spoken phonatory tasks were recorded with a C420III PP AKG head-worn microphone and a DA-P1 Tascam DAT. Acoustic analyses were performed using KayPENTAX Computer Speech Lab and Multi-Dimensional Voice Program. Acoustic parameters included speaking fundamental frequency, perturbation measures (jitter, shimmer and harmonicto- noise ratio), temporal measures (maximum phonation time and s/z ratio) and voice tremor frequency. Results: One-way ANOVA analysis revealed that as the number of ILFN-exposure years increased male cabin crewmembers presented significant different shimmer values of /i/ as well as tremor frequency of /u/. Females presented significantly different jitter % of /i, a, O/ (p <0.05). Lastly, Two-way ANOVA analysis revealed that for females, there was a significant interaction between age and occupational ILFN-exposure for voice acoustic parameters, namely for jitter’s mean for /a, O/ and shimmer’s (%) mean for /a, i/ (p <0.05). Discussion and Conclusion: These perturbation measure patterns may be indicative of histological changes within the vocal folds as a result of ILFN-exposure. The results of this study suggest that voice acoustic analysis may be an important tool for confirming ILFN-induced health effects.