932 resultados para Speech and pioneering sports Colima
Resumo:
At present there is no standard assessment method for rating and comparing the quality of synthesized speech. This study assesses the suitability of Time Frequency Warping (TFW) modulation for use as a reference device for assessing synthesized speech. Time Frequency Warping modulation introduces timing errors into natural speech that produce perceptual errors similar to those found in synthetic speech. It is proposed that TFW modulation used in conjunction with a listening effort test would provide a standard assessment method for rating the quality of synthesized speech. This study identifies the most suitable TFW modulation variable parameter to be used for assessing synthetic speech and assess the results of several assessment tests that rate examples of synthesized speech in terms of the TFW variable parameter and listening effort. The study also attempts to identify the attributes of speech that differentiate synthetic, TFW modulated and natural speech.
Resumo:
Auditory processing disorder (APD) is diagnosed when a patient presents with listening difficulties which can not be explained by a peripheral hearing impairment or higher-order cognitive or language problems. This review explores the association between auditory processing disorder (APD) and other specific developmental disorders such as dyslexia and attention-deficit hyperactivity disorder. The diagnosis and aetiology of APD are similar to those of other developmental disorders and it is well established that APD often co-occurs with impairments of language, literacy, and attention. The genetic and neurological causes of APD are poorly understood, but developmental and behavioural genetic research with other disorders suggests that clinicians should expect APD to co-occur with other symptoms frequently. The clinical implications of co-occurring symptoms of other developmental disorders are considered and the review concludes that a multi-professional approach to the diagnosis and management of APD, involving speech and language therapy and psychology as well as audiology, is essential to ensure that children have access to the most appropriate range of support and interventions.
Resumo:
This paper presents a novel prosody model in the context of computer text-to-speech synthesis applications for tone languages. We have demonstrated its applicability using the Standard Yorùbá (SY) language. Our approach is motivated by the theory that abstract and realised forms of various prosody dimensions should be modelled within a modular and unified framework [Coleman, J.S., 1994. Polysyllabic words in the YorkTalk synthesis system. In: Keating, P.A. (Ed.), Phonological Structure and Forms: Papers in Laboratory Phonology III, Cambridge University Press, Cambridge, pp. 293–324]. We have implemented this framework using the Relational Tree (R-Tree) technique. R-Tree is a sophisticated data structure for representing a multi-dimensional waveform in the form of a tree. The underlying assumption of this research is that it is possible to develop a practical prosody model by using appropriate computational tools and techniques which combine acoustic data with an encoding of the phonological and phonetic knowledge provided by experts. To implement the intonation dimension, fuzzy logic based rules were developed using speech data from native speakers of Yorùbá. The Fuzzy Decision Tree (FDT) and the Classification and Regression Tree (CART) techniques were tested in modelling the duration dimension. For practical reasons, we have selected the FDT for implementing the duration dimension of our prosody model. To establish the effectiveness of our prosody model, we have also developed a Stem-ML prosody model for SY. We have performed both quantitative and qualitative evaluations on our implemented prosody models. The results suggest that, although the R-Tree model does not predict the numerical speech prosody data as accurately as the Stem-ML model, it produces synthetic speech prosody with better intelligibility and naturalness. The R-Tree model is particularly suitable for speech prosody modelling for languages with limited language resources and expertise, e.g. African languages. Furthermore, the R-Tree model is easy to implement, interpret and analyse.
Resumo:
Purpose: The purpose of this paper is to examine the quality of evidence collected during interview. Current UK national guidance on the interviewing of victims and witnesses recommends a phased approach, allowing the interviewee to deliver their free report before any questioning takes place, and stipulating that during this free report the interviewee should not be interrupted. Interviewers, therefore, often find it necessary during questioning to reactivate parts of the interviewee's free report for further elaboration. Design/methodology/approach: The first section of this paper draws on a collection of police interviews with women reporting rape, and discusses one method by which this is achieved - the indirect quotation of the interviewee by the interviewer - exploring the potential implications for the quality of evidence collected during this type of interview. The second section of the paper draws on the same data set and concerns itself with a particular method by which information provided by an interviewee has its meaning "fixed" by the interviewer. Findings: It is found that "formulating" is a recurrent practice arising from the need to clarify elements of the account for the benefit of what is termed the "overhearing audience" - in this context, the police scribe, CPS, and potentially the Court. Since the means by which this "fixing" is achieved necessarily involves the foregrounding of elements of the account deemed to be particularly salient at the expense of other elements which may be entirely deleted, formulations are rarely entirely neutral. Their production, therefore, has the potential to exert undue interviewer influence over the negotiated "final version" of interviewees' accounts. Originality/value: The paper highlights the fact that accurate re-presentations of interviewees' accounts are a crucial tool in ensuring smooth progression of interviews and that re-stated speech and formulation often have implications for the quality of evidence collected during significant witness interviews. © Emerald Group Publishing Limited.
Resumo:
Aim: Sex chromosome aneuploidies increase the risk of spoken or written language disorders but individuals with specific language impairment (SLI) or dyslexia do not routinely undergo cytogenetic analysis. We assess the frequency of sex chromosome aneuploidies in individuals with language impairment or dyslexia. Method: Genome-wide single nucleotide polymorphism genotyping was performed in three sample sets: a clinical cohort of individuals with speech and language deficits (87 probands: 61 males, 26 females; age range 4 to 23 years), a replication cohort of individuals with SLI, from both clinical and epidemiological samples (209 probands: 139 males, 70 females; age range 4 to 17 years), and a set of individuals with dyslexia (314 probands: 224 males, 90 females; age range 7 to 18 years). Results: In the clinical language-impaired cohort, three abnormal karyotypic results were identified in probands (proband yield 3.4%). In the SLI replication cohort, six abnormalities were identified providing a consistent proband yield (2.9%). In the sample of individuals with dyslexia, two sex chromosome aneuploidies were found giving a lower proband yield of 0.6%. In total, two XYY, four XXY (Klinefelter syndrome), three XXX, one XO (Turner syndrome), and one unresolved karyotype were identified. Interpretation: The frequency of sex chromosome aneuploidies within each of the three cohorts was increased over the expected population frequency (approximately 0.25%) suggesting that genetic testing may prove worthwhile for individuals with language and literacy problems and normal non-verbal IQ. Early detection of these aneuploidies can provide information and direct the appropriate management for individuals. © 2013 The Authors. Developmental Medicine & Child Neurology published by John Wiley & Sons Ltd on behalf of Mac Keith Press.
Resumo:
Purpose: Phonological accounts of reading implicate three aspects of phonological awareness tasks that underlie the relationship with reading; a) the language-based nature of the stimuli (words or nonwords), b) the verbal nature of the response, and c) the complexity of the stimuli (words can be segmented into units of speech). Yet, it is uncertain which task characteristics are most important as they are typically confounded. By systematically varying response-type and stimulus complexity across speech and non-speech stimuli, the current study seeks to isolate the characteristics of phonological awareness tasks that drive the prediction of early reading. Method: Four sets of tasks were created; tone stimuli (simple non-speech) requiring a non-verbal response, phonemes (simple speech) requiring a non-verbal response, phonemes requiring a verbal response, and nonwords (complex speech) requiring a verbal response. Tasks were administered to 570 2nd grade children along with standardized tests of reading and non-verbal IQ. Results: Three structural equation models comparing matched sets of tasks were built. Each model consisted of two 'task' factors with a direct link to a reading factor. The following factors predicted unique variance in reading: a) simple speech and non-speech stimuli, b) simple speech requiring a verbal response but not simple speech requiring a non-verbal-response, and c) complex and simple speech stimuli. Conclusions: Results suggest that the prediction of reading by phonological tasks is driven by the verbal nature of the response and not the complexity or 'speechness' of the stimuli. Findings highlight the importance of phonological output processes to early reading.
Resumo:
Yorick Wilks is a central figure in the fields of Natural Language Processing and Artificial Intelligence. His influence has extends to many areas of these fields and includes contributions to Machine Translation, word sense disambiguation, dialogue modeling and Information Extraction.This book celebrates the work of Yorick Wilks from the perspective of his peers. It consists of original chapters each of which analyses an aspect of his work and links it to current thinking in that area. His work has spanned over four decades but is shown to be pertinent to recent developments in language processing such as the Semantic Web.This volume forms a two-part set together with Words and Intelligence I, Selected Works by Yorick Wilks, by the same editors.
Resumo:
In this paper, we present syllable-based duration modelling in the context of a prosody model for Standard Yorùbá (SY) text-to-speech (TTS) synthesis applications. Our prosody model is conceptualised around a modular holistic framework. This framework is implemented using the Relational Tree (R-Tree) techniques. An important feature of our R-Tree framework is its flexibility in that it facilitates the independent implementation of the different dimensions of prosody, i.e. duration, intonation, and intensity, using different techniques and their subsequent integration. We applied the Fuzzy Decision Tree (FDT) technique to model the duration dimension. In order to evaluate the effectiveness of FDT in duration modelling, we have also developed a Classification And Regression Tree (CART) based duration model using the same speech data. Each of these models was integrated into our R-Tree based prosody model. We performed both quantitative (i.e. Root Mean Square Error (RMSE) and Correlation (Corr)) and qualitative (i.e. intelligibility and naturalness) evaluations on the two duration models. The results show that CART models the training data more accurately than FDT. The FDT model, however, shows a better ability to extrapolate from the training data since it achieved a better accuracy for the test data set. Our qualitative evaluation results show that our FDT model produces synthesised speech that is perceived to be more natural than our CART model. In addition, we also observed that the expressiveness of FDT is much better than that of CART. That is because the representation in FDT is not restricted to a set of piece-wise or discrete constant approximation. We, therefore, conclude that the FDT approach is a practical approach for duration modelling in SY TTS applications. © 2006 Elsevier Ltd. All rights reserved.
Resumo:
This paper presents a novel intonation modelling approach and demonstrates its applicability using the Standard Yorùbá language. Our approach is motivated by the theory that abstract and realised forms of intonation and other dimensions of prosody should be modelled within a modular and unified framework. In our model, this framework is implemented using the Relational Tree (R-Tree) technique. The R-Tree is a sophisticated data structure for representing a multi-dimensional waveform in the form of a tree. Our R-Tree for an utterance is generated in two steps. First, the abstract structure of the waveform, called the Skeletal Tree (S-Tree), is generated using tone phonological rules for the target language. Second, the numerical values of the perceptually significant peaks and valleys on the S-Tree are computed using a fuzzy logic based model. The resulting points are then joined by applying interpolation techniques. The actual intonation contour is synthesised by Pitch Synchronous Overlap Technique (PSOLA) using the Praat software. We performed both quantitative and qualitative evaluations of our model. The preliminary results suggest that, although the model does not predict the numerical speech data as accurately as contemporary data-driven approaches, it produces synthetic speech with comparable intelligibility and naturalness. Furthermore, our model is easy to implement, interpret and adapt to other tone languages.
Resumo:
This article considers how conscious use of dialect in writing is an intentional act and can be accounted for through the notion of enregisterment. It does this by exploring the value of dialect in social and ideological contexts in relation to a regional dialect of British speech, that of the Black Country in the West Midlands region of England. The article provides a summary of recent directions in sociolinguistic research and an overview of the Black Country speech community, including a summary of its distinctive linguistic variables. This description is then used as an external evaluation of the authenticity of written representations of Black Country speech and the items enregistered in writing. Analysis of three written texts taken from three different genres across a time span of 30 years reveals the extent to which identified linguistic features are drawn upon in each one of the three texts and the extent to which any one is enregisterd across all three. The article discusses the social and linguistic contexts within which the writing occurs by way of accounting for their enregisterment as markers of identity linked to region and place. It also considers the ways in which the texts juxtapose norms and values of those "within" the community with those from "outside" the community in ways that subvert traditional notions of linguistic hierarchy.
Resumo:
An estimated 30% of individuals with autism spectrum disorders (ASD) remain minimally verbal into late childhood, but research on cognition and brain function in ASD focuses almost exclusively on those with good or only moderately impaired language. Here we present a case study investigating auditory processing of GM, a nonverbal child with ASD and cerebral palsy. At the age of 8 years, GM was tested using magnetoencephalography (MEG) whilst passively listening to speech sounds and complex tones. Where typically developing children and verbal autistic children all demonstrated similar brain responses to speech and nonspeech sounds, GM produced much stronger responses to nonspeech than speech, particularly in the 65–165 ms (M50/M100) time window post-stimulus onset. GM was retested aged 10 years using electroencephalography (EEG) whilst passively listening to pure tone stimuli. Consistent with her MEG response to complex tones, GM showed an unusually early and strong response to pure tones in her EEG responses. The consistency of the MEG and EEG data in this single case study demonstrate both the potential and the feasibility of these methods in the study of minimally verbal children with ASD. Further research is required to determine whether GM's atypical auditory responses are characteristic of other minimally verbal children with ASD or of other individuals with cerebral palsy.
Resumo:
This study investigated the effects of an explicit individualized phonemic awareness intervention administered by a speech-language pathologist to 4 prekindergarten children with phonological speech sound disorders. Research has demonstrated that children with moderate-severe expressive phonological disorders are at-risk for poor literacy development because they often concurrently exhibit weaknesses in the development of phonological awareness skills (Rvachew, Ohberg, Grawburg, & Heyding, 2003).^ The research design chosen for this study was a single subject multiple probe design across subjects. After stable baseline measures, the participants received explicit instruction in each of the three phases separately and sequentially. Dependent measures included same-day tests for Phase I (Phoneme Identity), Phase II (Phoneme Blending), and Phase III (Phoneme Segmentation), and generalization and maintenance tests for all three phases.^ All 4 participants made substantial progress in all three phases. These skills were maintained during weekly and biweekly maintenance measures. Generalization measures indicated that the participants demonstrated some increases in their mean total number of correct responses in Phase II and Phase III baseline while the participants were in Phase I intervention, and more substantial increases in Phase III baseline while the participants were in Phase II intervention. Increased generalization from Phases II to III could likely be explained due to the response similarities in those two skills (Cooper, Heron, & Heward, 2007).^ Based upon the findings of this study, speech-language pathologists should evaluate phonological awareness in the children in their caseloads prior to kindergarten entry, and should allocate time during speech therapy to enhance phonological awareness and letter knowledge to support the development of both skills concurrently. Also, classroom teachers should collaborate with speech-language pathologists to identify at-risk students in their classrooms and successfully implement evidence-based phonemic awareness instruction. Future research should repeat this study including larger groups of children, children with combined speech and language delays, children of different ages, and ESOL students.^
Resumo:
This study investigated the effects of an explicit individualized phonemic awareness intervention administered by a speech-language pathologist to 4 prekindergarten children with phonological speech sound disorders. Research has demonstrated that children with moderate-severe expressive phonological disorders are at-risk for poor literacy development because they often concurrently exhibit weaknesses in the development of phonological awareness skills (Rvachew, Ohberg, Grawburg, & Heyding, 2003). The research design chosen for this study was a single subject multiple probe design across subjects. After stable baseline measures, the participants received explicit instruction in each of the three phases separately and sequentially. Dependent measures included same-day tests for Phase I (Phoneme Identity), Phase II (Phoneme Blending), and Phase III (Phoneme Segmentation), and generalization and maintenance tests for all three phases. All 4 participants made substantial progress in all three phases. These skills were maintained during weekly and biweekly maintenance measures. Generalization measures indicated that the participants demonstrated some increases in their mean total number of correct responses in Phase II and Phase III baseline while the participants were in Phase I intervention, and more substantial increases in Phase III baseline while the participants were in Phase II intervention. Increased generalization from Phases II to III could likely be explained due to the response similarities in those two skills (Cooper, Heron, & Heward, 2007). Based upon the findings of this study, speech-language pathologists should evaluate phonological awareness in the children in their caseloads prior to kindergarten entry, and should allocate time during speech therapy to enhance phonological awareness and letter knowledge to support the development of both skills concurrently. Also, classroom teachers should collaborate with speech-language pathologists to identify at-risk students in their classrooms and successfully implement evidence-based phonemic awareness instruction. Future research should repeat this study including larger groups of children, children with combined speech and language delays, children of different ages, and ESOL students
Resumo:
Sports tourism has received growing attention in academic research over the past two decades (Weed and Bull, 2009, Gibson, 2005) but greater understanding of the consumer is needed, particularly the factors influencing decisions to include sport as part of a leisure trip. This paper provides, through a focus on the sport of golf, insight into the characteristics of the sports tourist and how sports tourist behaviours influence the selection of locations deemed suitable for sports participation. This qualitative research employs a grounded theory methodology, underpinned by a constructivist epistemology, to evaluate twenty-six in-depth interviews with golf tourists. The findings propose a model which explains the relationship between golf tourist behaviours and destination selection. This identifies six strands which determine the relationship between the golf tourist, golf behaviours and destination selection (constructing the golf holiday, emotional rewards of taking a trip, total trip spend, amenities and support facilities, course characteristics and reputation of the destination). Furthermore it illuminates the complexity of these relationships through recognition of four spheres of influence (group dynamics, competition and ability, golfing capital and intermediaries). Discussion elucidates how this increased understanding of the golf tourist behaviours and destination selection might be applied to other sports, with conclusions exploring implications for the sports tourism industry and destinations.
Resumo:
After defining the “enunciative scheme” (sentence type) as a communicative unit, the imperative is characterized as a morphologized modality of appellative kind used when the following conditions occur: appellative meaning, 2nd person, future tense and absence of negation. In Spanish, any variation of any of these requirements determines that the subjunctive is used. We reject the idea that the imperative is a variant of subjunctive specialized in appellative function and that both modes share a desiderative morpheme. Working in this way means attributing to a morphological category of the verb a property that actually corresponds to the enunciative schemes (sentence types). We propose to integrate the imperative and subjunctive in the framework of what we call the “desiderative-appellative space”. This “space” brings together various grammatical or grammaticalized means based on the imperative and the subjunctive. Semantically, it is organized around a component of desirability (action appears as desirable) that, by varying several factors, configures a route that goes from a center (the imperative) to a periphery (the expression of desire).