19 resultados para Digit speech recognition

em Helda - Digital Repository of University of Helsinki


Relevância:

20.00% 20.00%

Publicador:

Resumo:

Speech rhythm is an essential part of speech processing. It is the outcome of the workings of a combination of linguistic and non-linguistic parameters, many of which also have other functions in speech. This study focusses on the acoustic and auditive realization of two linguistic parameters of rhythm: (1) sentence stress, and (2) speech rate and pausing. The aim was to find out how well Finnish comprehensive school pupils realize these two parameters in English and how native speakers of English react to Finnish pupils English rhythm. The material was elicited by means of a story-telling task and questionnaires. Three female and three male pupils representing different levels of oral skills in English were selected as the experimental group. The control group consisted of two female and two male native speakers of English. The stories were analysed acoustically and auditorily with respect to interstress intervals, weak forms, fundamental frequency, pausing, and speech as well as articulation rate. In addition, 52 native speakers of English were asked to rate the intelligibility of the Finnish pupils English with respect to speech rhythm and give their attitudes on what the pupils sounded like. Results showed that Finnish pupils can produce isochronous interstress intervals in English, but that too large a proportion of these intervals contain pauses. A closer analysis of the pauses revealed that Finnish pupils pause too frequently and in inappropriate places when they speak English. Frequent pausing was also found to cause slow speech rates. The findings of the fundamental frequency (F0) measurements indicate that Finnish pupils tend to make a slightly narrower F0 difference between stressed and unstressed syllables than the native speakers of English. Furthermore, Finnish pupils appear to know how to reduce the duration and quality of unstressed sounds, but they fail to do it frequently enough. Native listeners gave lower intelligibility and attitude scores to pupils with more anomalous speech rhythm. Finnish pupils rhythm anomalies seemed to derive from various learning- or learner-related factors rather than from the differences between English and Finnish. This study demonstrates that pausing may be a more important component of English speech rhythm than sentence stress as far as Finnish adolescents are concerned and that interlanguage development is affected by various factors and characterised by jumps or periods of stasis. Other theoretical, methodological and pedagogical implications of the results are also discussed.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This dissertation consists of four articles and an introduction. The five parts address the same topic, nonverbal predication in Erzya, from different perspectives. The work is at the same time linguistic typology and Uralic studies. The findings based on a large corpus of empirical Erzya data, which was collected using several different methods and included recordings of the spoken language, made it possible for the present study to apply, then test and finally discuss the previous theories based on cross-linguistic data. Erzya makes use of multiple predication patterns which vary from totally analytic to the morphologically very complex. Nonverbal predicate clause types are classified on the basis of propositional acts in clauses denoting class-membership, identity, property and location. The predicates of these clauses are nouns, adjectives and locational expressions, respectively. The following three predication strategies in Erzya nonverbal predication can be identified: i. the zero-copula construction, ii. the predicative suffix construction and iii. the copula construction. It has been suggested that verbs and nouns cannot be clearly distinguished on morphological grounds when functioning as predicates in Erzya. This study shows that even though predicativity must not be considered a sufficient tool for defining parts of speech in any language, the Erzya lexical classes of adjective, noun and verb can be distinguished from each other also in predicate position. The relative frequency and degree of obligation for using the predicative suffix construction decreases when moving left to right on the scale verb adjective/locative noun ( identificational statement). The predicative suffix is the main pattern in the present tense over the whole domain of nonverbal predication in Standard Erzya, but if it is replaced it is most likely to be with a zero-copula construction in a nominal predication. This study exploits the theory of (a)symmetry for the first time in order to describe verbal vs. nonverbal predication. It is shown that the asymmetry of paradigms and constructions differentiates the lexical classes. Asymmetrical structures are motivated by functional level asymmetry. Variation in predication as such adds to the complexity of the grammar. When symmetric structures are employed, the functional complexity of grammar decreases, even though morphological complexity increases. The genre affects the employment of predication strategies in Erzya. There are differences in the relative frequency of the patterns, and some patterns are totally lacking from some of the data. The clearest difference is that the past tense predicative suffix construction occurs relatively frequently in Standard Erzya, while it occurs infrequently in the other data. Also, the predicative suffixes of the present tense are used more regularly in written Standard Erzya than in any other genre. The genre also affects the incidence of the translative in uľ(ń)ems copula constructions. In translations from Russian to Erzya the translative case is employed relatively frequently in comparison to other data. This study reveals differences between the two Mordvinic languages Erzya and Moksha. The predicative suffixes (bound person markers) of the present tense are used more regularly in Moksha in all kinds of nonverbal predicate clauses compared to Erzya. It should further be observed that identificational statements are encoded with a predicative suffix in Moksha, but seldom in Erzya. Erzya clauses are more frequently encoded using zero-constructions, displaying agreement in number only.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Comprehension of a complex acoustic signal - speech - is vital for human communication, with numerous brain processes required to convert the acoustics into an intelligible message. In four studies in the present thesis, cortical correlates for different stages of speech processing in a mature linguistic system of adults were investigated. In two further studies, developmental aspects of cortical specialisation and its plasticity in adults were examined. In the present studies, electroencephalographic (EEG) and magnetoencephalographic (MEG) recordings of the mismatch negativity (MMN) response elicited by changes in repetitive unattended auditory events and the phonological mismatch negativity (PMN) response elicited by unexpected speech sounds in attended speech inputs served as the main indicators of cortical processes. Changes in speech sounds elicited the MMNm, the magnetic equivalent of the electric MMN, that differed in generator loci and strength from those elicited by comparable changes in non-speech sounds, suggesting intra- and interhemispheric specialisation in the processing of speech and non-speech sounds at an early automatic processing level. This neuronal specialisation for the mother tongue was also reflected in the more efficient formation of stimulus representations in auditory sensory memory for typical native-language speech sounds compared with those formed for unfamiliar, non-prototype speech sounds and simple tones. Further, adding a speech or non-speech sound context to syllable changes was found to modulate the MMNm strength differently in the left and right hemispheres. Following the acoustic-phonetic processing of speech input, phonological effort related to the selection of possible lexical (word) candidates was linked with distinct left-hemisphere neuronal populations. In summary, the results suggest functional specialisation in the neuronal substrates underlying different levels of speech processing. Subsequently, plasticity of the brain's mature linguistic system was investigated in adults, in whom representations for an aurally-mediated communication system, Morse code, were found to develop within the same hemisphere where representations for the native-language speech sounds were already located. Finally, recording and localization of the MMNm response to changes in speech sounds was successfully accomplished in newborn infants, encouraging future MEG investigations on, for example, the state of neuronal specialisation at birth.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Autism and Asperger syndrome (AS) are neurodevelopmental disorders characterised by deficient social and communication skills, as well as restricted, repetitive patterns of behaviour. The language development in individuals with autism is significantly delayed and deficient, whereas in individuals with AS, the structural aspects of language develop quite normally. Both groups, however, have semantic-pragmatic language deficits. The present thesis investigated auditory processing in individuals with autism and AS. In particular, the discrimination of and orienting to speech and non-speech sounds was studied, as well as the abstraction of invariant sound features from speech-sound input. Altogether five studies were conducted with auditory event-related brain potentials (ERP); two studies also included a behavioural sound-identification task. In three studies, the subjects were children with autism, in one study children with AS, and in one study adults with AS. In children with autism, even the early stages of sound encoding were deficient. In addition, these children had altered sound-discrimination processes characterised by enhanced spectral but deficient temporal discrimination. The enhanced pitch discrimination may partly explain the auditory hypersensitivity common in autism, and it may compromise the filtering of relevant auditory information from irrelevant information. Indeed, it was found that when sound discrimination required abstracting invariant features from varying input, children with autism maintained their superiority in pitch processing, but lost it in vowel processing. Finally, involuntary orienting to sound changes was deficient in children with autism in particular with respect to speech sounds. This finding is in agreement with previous studies on autism suggesting deficits in orienting to socially relevant stimuli. In contrast to children with autism, the early stages of sound encoding were fairly unimpaired in children with AS. However, sound discrimination and orienting were rather similarly altered in these children as in those with autism, suggesting correspondences in the auditory phenotype in these two disorders which belong to the same continuum. Unlike children with AS, adults with AS showed enhanced processing of duration changes, suggesting developmental changes in auditory processing in this disorder.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Schizophrenia is a severe psychotic disorder affecting 0.5-1 % of the population. The disorder is characterized by hallucinations; delusions; disorganized behavior and speech; avolition; anhedonia; flattened affect and cognitive deficits. The etiology of the disorder is complex with evidence for multiple genes contributing to the onset of the disorder along with environmental factors. DISC1 is one of the most promising candidate genes for schizophrenia. It codes for a protein which takes part in numerous molecular interactions along several pathways. This network, termed as the DISC1 pathway, is evidently important for the development and maturation of the central nervous system from the embryo until young adulthood. Disruption at these pathways is thought to predispose schizophrenia. In the present study, we have studied the DISC1 pathway in the etiology of schizophrenia in the Finnish population. We have utilized large Finnish samples; the schizophrenia family sample where DISC1 was originally shown to associate with schizophrenia and the Northern Finland birth cohort 1966 (NFBC66). Several DISC1 binding partners displayed evidence for association in the family sample along with DISC1. Through a genome-wide linkage study, we found a significant linkage signal to a locus where a DISC1 binding partner NDE1 is located at the carriers of a certain DISC1 risk variant. In a follow-up study, genetic markers in NDE1 displayed significant evidence for association with schizophrenia. Further exploration of association between 11 genes of the DISC1 pathway and schizophrenia led to recognition of novel variants in NDEL1, PDE4B and PDE4D that significantly either increased or decreased the risk for schizophrenia. Further, we found evidence that DISC1 itself has a significant role in the human mental functioning even in the healthy population. Variants in DISC1 had a significant effect on anhedonia which is a trait present at everybody but is in its severe form one of the main symptoms of schizophrenia and correlates with the risk of developing the disorder. Further, utilizing genome-wide marker data, we recognized three genes; MIR620; CCDC141 and LCT; that are closely related to the DISC1 pathway but which effects on anhedonia were observable only at the individuals who carried these specific DISC1 variants. Our findings significantly add up to the previous evidence for the involvement of DISC1 and the DISC1 pathway in the etiology of schizophrenia and psychosis. Our results support the concept of a number of DISC1 pathway related genes contributing in the etiology of schizophrenia along with DISC1 and provide new candidates for the studies of schizophrenia. Our findings also significantly increase the importance of DISC1 itself as having a role in psychological functioning in the general population.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The study examines various uses of computer technology in acquisition of information for visually impaired people. For this study 29 visually impaired persons took part in a survey about their experiences concerning acquisition of infomation and use of computers, especially with a screen magnification program, a speech synthesizer and a braille display. According to the responses, the evolution of computer technology offers an important possibility for visually impaired people to cope with everyday activities and interacting with the environment. Nevertheless, the functionality of assistive technology needs further development to become more usable and versatile. Since the challenges of independent observation of environment were emphasized in the survey, the study led into developing a portable text vision system called Tekstinäkö. Contrary to typical stand-alone applications, Tekstinäkö system was constructed by combining devices and programs that are readily available on consumer market. As the system operates, pictures are taken by a digital camera and instantly transmitted to a text recognition program in a laptop computer that talks out loud the text using a speech synthesizer. Visually impaired test users described that even unsure interpretations of the texts in the environment given by Tekstinäkö system are at least a welcome addition to complete perception of the environment. It became clear that even with a modest development work it is possible to bring new, useful and valuable methods to everyday life of disabled people. Unconventional production process of the system appeared to be efficient as well. Achieved results and the proposed working model offer one suggestion for giving enough attention to easily overlooked needs of the people with special abilities. ACM Computing Classification System (1998): K.4.2 Social Issues: Assistive technologies for persons with disabilities I.4.9 Image processing and computer vision: Applications Keywords: Visually impaired, computer-assisted, information, acquisition, assistive technology, computer, screen magnification program, speech synthesizer, braille display, survey, testing, text recognition, camera, text, perception, picture, environment, trasportation, guidance, independence, vision, disabled, blind, speech, synthesizer, braille, software engineering, programming, program, system, freeware, shareware, open source, Tekstinäkö, text vision, TopOCR, Autohotkey, computer engineering, computer science

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This study is part of an ongoing collaborative bipolar research project, the Jorvi Bipolar Study (JoBS). The JoBS is run by the Department of Mental Health and Alcohol Research of the National Public Health Institute, Helsinki, and the Department of Psychiatry, Jorvi Hospital, Helsinki University Central Hospital (HUCH), Espoo, Finland. It is a prospective, naturalistic cohort study of secondary level care psychiatric in- and outpatients with a new episode of bipolar disorder (BD). The second report also included 269 major depressive disorder (MDD) patients from the Vantaa Depression Study (VDS). The VDS was carried out in collaboration with the Department of Psychiatry of the Peijas Medical Care District. Using the Mood Disorder Questionnaire (MDQ), all in- and outpatients at the Department of Psychiatry at Jorvi Hospital who currently had a possible new phase of DSM-IV BD were sought. Altogether, 1630 psychiatric patients were screened, and 490 were interviewed using a semistructured interview (SCID-I/P). The patients included in the cohort (n=191) had at intake a current phase of BD. The patients were evaluated at intake and at 6- and 18-month interviews. Based on this study, BD is poorly recognized even in psychiatric settings. Of the BD patients with acute worsening of illness, 39% had never been correctly diagnosed. The classic presentations of BD with hospitalizations, manic episodes, and psychotic symptoms lead clinicians to correct diagnosis of BD I in psychiatric care. Time of follow-up elapsed in psychiatric care, but none of the clinical features, seemed to explain correct diagnosis of BD II, suggesting reliance on cross- sectional presentation of illness. Even though BD II was clearly less often correctly diagnosed than BD I, few other differences between the two types of BD were detected. BD I and II patients appeared to differ little in terms of clinical picture or comorbidity, and the prevalence of psychiatric comorbidity was strongly related to the current illness phase in both types. At the same time, the difference in outcome was clear. BD II patients spent about 40% more time depressed than BD I patients. Patterns of psychiatric comorbidity of BD and MDD differed somewhat qualitatively. Overall, MDD patients were likely to have more anxiety disorders and cluster A personality disorders, and bipolar patients to have more cluster B personality disorders. The adverse consequences of missing or delayed diagnosis are potentially serious. Thus, these findings strongly support the value of screening for BD in psychiatric settings, especially among the major depressive patients. Nevertheless, the diagnosis must be based on a clinical interview and follow-up of mood. Comorbidity, present in 59% of bipolar patients in a current phase, needs concomitant evaluation, follow-up, and treatment. To improve outcome in BD, treatment of bipolar depression is a major challenge for clinicians.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Speech has both auditory and visual components (heard speech sounds and seen articulatory gestures). During all perception, selective attention facilitates efficient information processing and enables concentration on high-priority stimuli. Auditory and visual sensory systems interact at multiple processing levels during speech perception and, further, the classical motor speech regions seem also to participate in speech perception. Auditory, visual, and motor-articulatory processes may thus work in parallel during speech perception, their use possibly depending on the information available and the individual characteristics of the observer. Because of their subtle speech perception difficulties possibly stemming from disturbances at elemental levels of sensory processing, dyslexic readers may rely more on motor-articulatory speech perception strategies than do fluent readers. This thesis aimed to investigate the neural mechanisms of speech perception and selective attention in fluent and dyslexic readers. We conducted four functional magnetic resonance imaging experiments, during which subjects perceived articulatory gestures, speech sounds, and other auditory and visual stimuli. Gradient echo-planar images depicting blood oxygenation level-dependent contrast were acquired during stimulus presentation to indirectly measure brain hemodynamic activation. Lip-reading activated the primary auditory cortex, and selective attention to visual speech gestures enhanced activity within the left secondary auditory cortex. Attention to non-speech sounds enhanced auditory cortex activity bilaterally; this effect showed modulation by sound presentation rate. A comparison between fluent and dyslexic readers' brain hemodynamic activity during audiovisual speech perception revealed stronger activation of predominantly motor speech areas in dyslexic readers during a contrast test that allowed exploration of the processing of phonetic features extracted from auditory and visual speech. The results show that visual speech perception modulates hemodynamic activity within auditory cortex areas once considered unimodal, and suggest that the left secondary auditory cortex specifically participates in extracting the linguistic content of seen articulatory gestures. They are strong evidence for the importance of attention as a modulator of auditory cortex function during both sound processing and visual speech perception, and point out the nature of attention as an interactive process (influenced by stimulus-driven effects). Further, they suggest heightened reliance on motor-articulatory and visual speech perception strategies among dyslexic readers, possibly compensating for their auditory speech perception difficulties.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This paper describes a new flexible delexicalization method based on glottal excited parametric speech synthesis scheme. The system utilizes inverse filtered glottal flow and all-pole modelling of the vocal tract. The method provides a possibil- ity to retain and manipulate all relevant prosodic features of any kind of speech. Most importantly, the features include voice quality, which has not been properly modeled in earlier delex- icalization methods. The functionality of the new method was tested in a prosodic tagging experiment aimed at providing word prominence data for a text-to-speech synthesis system. The ex- periment confirmed the usefulness of the method and further corroborated earlier evidence that linguistic factors influence the perception of prosodic prominence.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Innate immunity and host defence are rapidly evoked by structurally invariant molecular motifs common to microbial world, called pathogen associated molecular patterns (PAMPs). In addition to PAMPs, endogenous molecules released in response to inflammation and tissue damage, danger associated molecular patterns (DAMPs), are required for eliciting the response. The most important PAMPs of viruses are viral nucleic acids, their genome or its replication intermediates, whereas the identity and characteristics of virus infection-induced DAMPs are poorly defined. PAMPs and DAMPs engage a limited set of germ-line encoded pattern recognition receptors (PRRs) in immune and non-immune cells. Membrane-bound Toll-like receptors (TLRs), cytoplasmic retinoic acid inducible gene-I (RIG-I)-like receptors (RLRs) and nucleotide-binding oligomerization domain-like receptor (NLRs) are important PRRs involved in the recognition of the molecular signatures of viral infection, such as double-stranded ribonucleic acids (dsRNAs). Engagement of PRRs results in local and systemic innate immune responses which, when activated against viruses, evoke secretion of antiviral and pro-inflammatory cytokines, and programmed cell death i.e., apoptosis of the virus-infected cell. Macrophages are the central effector cells of innate immunity. They produce significant amounts of antiviral cytokines, called interferons (IFNs), and pro-inflammatory cytokines, such as interleukin (IL)-1β and IL-18. IL-1β and IL-18 are synthesized as inactive precursors, pro-IL-1β and pro-IL-18, that are processed by caspase-1 in a cytoplasmic multiprotein complex, called the inflammasome. After processing, these cytokines are biologically active and will be secreted. The signals and secretory routes that activate inflammasomes and the secretion of IL-1β and IL-18 during virus infections are poorly characterized. The main goal of this thesis was to characterize influenza A virus-induced innate immune responses and host-virus interactions in human primary macrophages during an infection. Methodologically, various techniques of cellular and molecular biology, as well as proteomic tools combined with bioinformatics, were utilized. Overall, the thesis provides interesting insights into inflammatory and antiviral innate immune responses, and has characterized host-virus interactions during influenza A virus-infection in human primary macrophages.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Prostate cancer is one of the most prevalent cancer types in men. The development of prostate tumors is known to require androgen exposure, and several pathways governing cell growth are deregulated in prostate tumorigenesis. Recent genetic studies have revealed that complex gene fusions and copy - number alterations are frequent in prostate cancer, a unique feature among solid tumors. These chromosomal aberrations are though to arise as a consequence of faulty repair of DNA double strand breaks (DSB). Most repair mechanisms have been studied in detail in cancer cell lines, but how DNA damage is detected and repaired in normal differentiated human cells has not been widely addressed. The events leading to the gene fusions in prostate cancer are under rigorous studies, as they not only shed light on the basic pathobiologic mechanisms but may also produce molecular targets for prostate cancer treatment and prevention. Prostate and seminal vesicles are part of the male reproductive system. They share similar structure and function but differ dramatically in their cancer incidence. Approximately fifty primary seminal vesicle carcinomas have been reported worldwide. Surprisingly, only little is known on why seminal vesicles are resistant to neoplastic changes. As both tissues are androgen dependent, it is a mystery that androgen signaling would only lead to tumors in prostate tissue. In this work, we set up novel ex vivo human tissue culture models of prostate and seminal vesicles, and used them to study how DNA damage is recognized in normal epithelium. One of the major DNA - damage inducible pathways, mediated by the ATM kinase, was robustly activated in all main cell types of both tissues. Interestingly, we discovered that secretory epithelial cells had less histone variant H2A.X and after DNA damage lower levels of H2AX were phosphorylated on serine 139 (γH2AX) than in basal or stromal cells. γH2AX has been considered essential for efficient DSB repair, but as there were no significant differences in the γH2AX levels between the two tissues, it seems more likely that the role of γH2AX is less important in postmitotic cells. We also gained insight into the regulation of p53, an important transcription factor that protects genomic integrity via multiple mechanisms, in human tissues. DSBs did not lead to a pronounced activation of p53, but treatments causing transcriptional stress, on the other hand, were able to launch a notable p53 response in both tissue types. In general, ex vivo culturing of human tissues provided unique means to study differentiated cells in their relevant tissue context, and is suited for testing novel therapeutic drugs before clinical trials. In order to study how prostate and seminal vesicle epithelial cells are able to activate DNA damage induced cell cycle checkpoints, we used primary cultures of prostate and seminal vesicle epithelial cells. To our knowledge, we are the first to report isolation of human primary seminal vesicle cells. Surprisingly, human prostate epithelial cells did not activate cell cycle checkpoints after DSBs in part due to low levels of Wee1A, a kinase regulating CDK activity, while primary seminal vesicle epithelial cells possessed proficient cell cycle checkpoints and expressed high levels of Wee1A. Similarly, seminal vesicle cells showed a distinct activation of the p53 - pathway after DSBs that did not occur in prostate epithelial cells. This indicates that p53 protein function is under different control mechanisms in the two cell types, which together with proficient cell cycle checkpoints may be crucial in protecting seminal vesicles from endogenous and exogenous DNA damaging factors and, as a consequence, from carcinogenesis. These data indicate that two very similar organs of male reproductive system do not respond to DNA damage similarly. The differentiated, non - replicating cells of both tissues were able to recognize DSBs, but under proliferation human prostate epithelial cells had deficient activation of the DNA damage response. This suggests that prostate epithelium is most vulnerable to accumulating genomic aberrations under conditions where it needs to proliferate, for example after inflammatory cellular damage.