995 resultados para Semantic context
Resumo:
When speech is degraded, word report is higher for semantically coherent sentences (e.g., her new skirt was made of denim) than for anomalous sentences (e.g., her good slope was done in carrot). Such increased intelligibility is often described as resulting from "top-down" processes, reflecting an assumption that higher-level (semantic) neural processes support lower-level (perceptual) mechanisms. We used time-resolved sparse fMRI to test for top-down neural mechanisms, measuring activity while participants heard coherent and anomalous sentences presented in speech envelope/spectrum noise at varying signal-to-noise ratios (SNR). The timing of BOLD responses to more intelligible speech provides evidence of hierarchical organization, with earlier responses in peri-auditory regions of the posterior superior temporal gyrus than in more distant temporal and frontal regions. Despite Sentence content × SNR interactions in the superior temporal gyrus, prefrontal regions respond after auditory/perceptual regions. Although we cannot rule out top-down effects, this pattern is more compatible with a purely feedforward or bottom-up account, in which the results of lower-level perceptual processing are passed to inferior frontal regions. Behavioral and neural evidence that sentence content influences perception of degraded speech does not necessarily imply "top-down" neural processes.
Resumo:
Research on semantic processing focused mainly on isolated units in language, which does not reflect the complexity of language. In order to understand how semantic information is processed in a wider context, the first goal of this thesis was to determine whether Swedish pre-school children are able to comprehend semantic context and if that context is semantically built up over time. The second goal was to investigate how the brain distributes attentional resources by means of brain activation amplitude and processing type. Swedish preschool children were tested in a dichotic listening task with longer children’s narratives. The development of event-related potential N400 component and its amplitude were used to investigate both goals. The decrease of the N400 in the attended and unattended channel indicated semantic comprehension and that semantic context was built up over time. The attended stimulus received more resources, processed the stimuli in more of a top-down manner and displayed prominent N400 amplitude in contrast to the unattended stimulus. The N400 and the late positivity were more complex than expected since endings of utterances longer than nine words were not accounted for. More research on wider linguistic context is needed in order to understand how the human brain comprehends natural language.
Resumo:
Conflicting findings regarding the ability of people with schizophrenia to maintain and update semantic contexts have been due, arguably, to vagaries within the experimental design employed (e.g. whether strongly or remotely associated prime-target pairs have been used, what delay between the prime and the target was employed, and what proportion of related prime-target pairs appeared) or to characteristics of the participant cohort (e.g. medication status, chronicity of illness). The aim of the present study was to examine how people with schizophrenia maintain and update contextual information over an extended temporal window by using multiple primes that were either remotely associated or unrelated to the target. Fourteen participants with schizophrenia and 12 healthy matched controls were compared across two stimulus onset asynchronies (SOAs) (short and long) and two relatedness proportions (RP) (high and low) in a crossed design. Analysis of variance statistics revealed significant two- and three-way interactions between Group and SOA, Group and Condition, SOA and RP, and Group, SOA and RP. The participants with schizophrenia showed evidence of enhanced remote priming at the short SOA and low RP, combined with a reduction in the time course over which context could be maintained. There was some sensitivity to biasing contextual information at the short SOA, although the mechanism over which context served to update information appeared to be different from that in the controls. The participants with schizophrenia showed marked performance decrements at the long SOA (both low and high RP). Indices of remote priming at the short (but not the long) SOA correlated with both clinical ratings of thought disorder and with increasing length of illness. The results support and extend the hypothesis that schizophrenia is associated with concurrent increases in tonic dopamine activity and decreases in phasic dopamine activity. (C) 2004 Elsevier Ireland Ltd. All rights reserved.
Resumo:
Recent empirical studies about the neurological executive nature of reading in bilinguals differ in their evaluations of the degree of selective manifestation in lexical access as implicated by data from early and late reading measures in the eye-tracking paradigm. Currently two scenarios are plausible: (1) Lexical access in reading is fundamentally language non-selective and top-down effects from semantic context can influence the degree of selectivity in lexical access; (2) Cross-lingual lexical activation is actuated via bottom-up processes without being affected by top-down effects from sentence context. In an attempt to test these hypotheses empirically, this study analyzed reader-text events arising when cognate facilitation and semantic constraint interact in a 22 factorially designed experiment tracking the eye movements of 26 Swedish-English bilinguals reading in their L2. Stimulus conditions consisted of high- and low-constraint sentences embedded with either a cognate or a non-cognate control word. The results showed clear signs of cognate facilitation in both early and late reading measures and in either sentence conditions. This evidence in favour of the non-selective hypothesis indicates that the manifestation of non-selective lexical access in reading is not constrained by top-down effects from semantic context.
Resumo:
Previous work examining context effects in children has been limited to semantic context. The current research examined the effects of grammatical priming of word-naming in fourth-grade children. In Experiment 1, children named both inflected and uninflected noun and verb target words faster when they were preceded by grammatically constraining primes than when they were preceded by neutral primes. Experiment 1 used a long stimulus onset asynchrony (SOA) interval of 750 msec. Experiment 2 replicated the grammatical priming effect at two SOA intervals (400 msec and 700 msec), suggesting that the grammatical priming effect does not reflect the operation of any gross strategic effects directly attributable to the long SOA interval employed in Experiment 1. Grammatical context appears to facilitate target word naming by constraining target word class. Further work is required to elucidate the loci of this effect.
Resumo:
Evidence for expectancy-based priming in the pronunciation task was provided in three experiments. In Experiments 1 and 2, a high proportion of associatively related trials produced greater associative priming and superior retrieval of primes in a subsequent test of memory for primes, whereas high- and low-proportion groups showed comparable repetition benefits in perceptual identification of previously presented primes. In Experiment 2, the low-proportion condition had few associatively related pairs hut many identity pairs. In Experiment 3, identity priming was greater in a high- than a low-identity proportion group, with similar repetition benefits and prime retrieval responses for the two groups. These results indicate that when the prime-target relationship is salient, subjects strategically vary their processing of the prime according to the nature of the prime-target relationship.
Resumo:
Past multisensory experiences can influence current unisensory processing and memory performance. Repeated images are better discriminated if initially presented as auditory-visual pairs, rather than only visually. An experience's context thus plays a role in how well repetitions of certain aspects are later recognized. Here, we investigated factors during the initial multisensory experience that are essential for generating improved memory performance. Subjects discriminated repeated versus initial image presentations intermixed within a continuous recognition task. Half of initial presentations were multisensory, and all repetitions were only visual. Experiment 1 examined whether purely episodic multisensory information suffices for enhancing later discrimination performance by pairing visual objects with either tones or vibrations. We could therefore also assess whether effects can be elicited with different sensory pairings. Experiment 2 examined semantic context by manipulating the congruence between auditory and visual object stimuli within blocks of trials. Relative to images only encountered visually, accuracy in discriminating image repetitions was significantly impaired by auditory-visual, yet unaffected by somatosensory-visual multisensory memory traces. By contrast, this accuracy was selectively enhanced for visual stimuli with semantically congruent multisensory pasts and unchanged for those with semantically incongruent multisensory pasts. The collective results reveal opposing effects of purely episodic versus semantic information from auditory-visual multisensory events. Nonetheless, both types of multisensory memory traces are accessible for processing incoming stimuli and indeed result in distinct visual object processing, leading to either impaired or enhanced performance relative to unisensory memory traces. We discuss these results as supporting a model of object-based multisensory interactions.
Resumo:
1990-luvun alussa lainsäädäntö ja työmarkkinajärjestöt määrittelivät Suomessa, että vuokratyötä tuli käyttää vain tilapäiseen työvoimatarpeeseen, esimerkiksi sijaisuuksiin ja ruuhkahuippuihin. Joillakin aloilla vuokratyö oli työnantajien ja työntekijöiden yhteissopimuksella kielletty. Vaikka vuokratyösuhteet saivat jo 1980-luvulla niin Suomessa kuin kansainvälisestikin maineen työsuhdekeinotteluna, alkoi vuokratyön määrä Suomessa kasvaa 1990-luvun puolivälissä ja erityisesti tultaessa 2000-luvulle. Suomalaiset akateemiset tutkijat eivät ole juuri vuokratyöstä kiinnostuneet. Aiemmat, harvalukuiset tutkimukset ovat keskittyneet lähinnä työyhteisöjen ja työntekijöiden kokemuksiin sekä vuokratyön työehtoihin. Vuokratyö ymmärretäänkin edelleen lähinnä työntekijän subjektiivisena kokemuksena. Vuokratyössä on kuitenkin kysymys paitsi kokemuksista, myös yhteiskunnallisesta valtakamppailusta, jossa diskursiivisin keinoin pyritään vaikuttamaan ilmiöön nimeltä vuokratyö, laajemmin ilmiöön nimeltä työmarkkinat, sekä toisaalta kansalaisten käsityksiin työelämän ”normaalista”. Käsillä oleva tutkimus laajentaa ymmärrystä vuokratyöstä tarkastelemalla ilmiötä lainsäädännön, uutisoinnin ja markkinoinnin rakentamien julkisten käsitysten ja merkityksenantojen kautta. Teoreettisena viitekehyksenä käytän hallinnan ja työprosessin säätelyn teoriaperinteitä. Se, miten työmarkkinoiden muutosta ja uusia työsuhdemuotoja politiikassa, mediassa, lainsäädännössä, tai työpaikan kahvipöytäkeskusteluissa perustellaan ja tehdään ymmärrettäväksi, on samalla työelämään kiinnitettävien arvojen, merkitysten ja toimijuuksien luomista, rajaamista ja kuvailua. Työelämäpuheessa ei siis ole kyse vain talouden lainalaisuuksista, kansantalouden toimivuudesta, tai yritysten kilpailukyvystä, vaan myös ja erityisesti niiden toimijoiden luomisesta, määrittelemisestä ja legitimoimisesta, jotka työelämän kentällä saavat toimia ja tulevat palkituiksi. Säätelyn ja hallinnan näkökulmasta on relevanttia tarkastella millaisilla käsitteillä ja merkityksillä vuokratyötä ilmiönä rakennetaan . Tutkimuskysymyksinä esitän: 1) Miten ja millä perusteilla vuokratyöstä rakennettiin Suomessa legitiimi tapa työllistää ja työllistyä? 2) Millaisia työntekijäideaaleja vuokratyöhön liittyvissä keskusteluissa rakennetaan? Tutkimusaineistona tarkastelen lainsäädäntöön liittyviä dokumentteja, Helsingin Sanomien uutisointia, vuokratyöyritysten markkinointimateriaaleja, sekä vuokratyöyritysten edustajien haastatteluita. Analyysimenetelmänä käytän kriittistä diskurssianalyysia. Tämä menetelmä mahdollistaa puheen ja dokumenttien tarkastelun sosiaalisena toimintana, jolla eri toimijat pyrkivät osallistumaan yhteiskunnassa hyväksyttyjen ja tunnustettujen käsitysten ja toimintavaihtoehtojen rakentamis-, tulkinta- ja määrittelyprosesseihin. Tutkimukseni päätuloksena esitän, että vuokratyöstä muodostui legitiimi tapa työllistää Suomessa 1990-luvulla, koska vuokratyö käsitteellistettiin sekä lainsäädännön että median diskursseissa ennen kaikkea ratkaisuksi työttömyyteen. Toisaalta vuokratyö käsitteellistettiin vain marginaalisten työntekijäryhmien (naiset ja opiskelijat) rooliksi, jolloin se ei liittynyt miesvaltaisten työpaikkojen arkeen. Ratkaisuna työttömyyteen vuokratyö myös samalla luonnollistettiin osaksi yleisempää työmarkkinakehitystä, jolle ”kukaan ei voi mitään”. 2000-luvulla vuokratyö jatkoi voittokulkuaan ja rakentui pysyväksi ilmiöksi, koska työlainsäädännön uudistus institutionalisoi vuokratyön työehtosopimusmenettelyyn, jolloin sen ”salonkikelpoisuus” ja normaalius vahvistettiin. Vaikka työehtosopimusasia oli ratkaisuna merkittävä, nousi vuokratyön osalta itse työehtosopimus tärkeämmäksi kuin sen sisältö. Työehtosopimuksilla ei kuitenkaan pystytty vaikuttamaan esimerkiksi vuokratyöntekijän olemattomaan työsuhdeturvaan. Lisäksi työnantajapuhe käsitteellistää vuokratyön 2000-luvulla ennen kaikkea työmarkkinavaihtoehdoksi, vapautta ja monipuolisia työkokemuksia tarjoavaksi työmarkkinoiden katalysaattoriksi. Vuokratyö on tässä merkitysavaruudessa työntekijöille ”vain” yksi tapa työllistyä ja löytää oma tiensä työmarkkinoille, ei suinkaan työnantajien sanelema pakko. Työntekijöihin kohdistuva hallintapuhe niin mediassa kuin työnantajien haastatteluissakin pyrkii puolestaan rakentamaan ideaalityöntekijäkuvaksi yrittäjämäisen oman elämänsä toimitusjohtajan. Työnantajien diskursseissa kaikuvatkin työntekijään kohdistuva vaatimus itse itsensä ohjaamisesta sekä työntekijäidentiteetin muotoilemisesta joustavuutta, sopeutuvuutta, vaihtelua ja jatkuvaa muutosta vähintäänkin sietäväksi, mutta mieluiten näitä ominaisuuksia jopa aktiivisesti hakevaksi ja arvostavaksi. Työmarkkinoiden toimijana on nimenomaisesti yksilö, jonka mahdollisuudet menestyä ovat vain ja ainoastaan hänen omissa käsissään. Työntekijän roolin korostaminen aktiivisena toimijana ja vuokratyöstä ”oikeita”, norminmukaisia sisältöjä löytävänä pärjääjänä on diskursiivisesti hallittua yritystä ohjata työntekijöitä näkemään sekä itsensä tietynlaisina toimijoina että työmarkkinat tietyllä tavalla toimivina. Vuokratyössä ei ole kyse vain työntekijöiden yksilöllisistä tai yksittäisistä kokemuksista. Vuokratyö on yhteiskunnallisen merkityskamppailun tulos, jossa käyttövoimana ovat toimineet hallinnalliset ja säätelyyn pyrkivät käsitteellistykset työllisyydestä, yksilön valinnasta ja koko yhteiskunnan edusta. Hallinnan ja säätelyn näkökulmasta katsottuna vuokratyö on myös merkinnyt säätelyn liukumista tasa-arvoa, yhdenmukaista kohtelua ja työntekijän suojelua korostavasta viranomaisten ja poliittisten toimijoiden suorittamasta työmarkkinoiden kollektiivisesta säätelystä työnantajien ylläpitämään työntekijän persoonan ja käyttäytymisen hegemoniseen, yksilölliseen säätelyyn.
Resumo:
McDaniel, Robinson-Riegler, and Einstein (1998) recently reported findings in support of the proposal that prospective remembering is largely conceptually driven. In each of the three experiments they reported, however, the task in which the prospective memory target was encountered at test had a predominantly conceptual focus, thereby potentially facilitating retrieval of conceptually encoded features of the studied target event. We report two experiments in which we manipulated the dimension (perceptual or conceptual) along which a target event varied between study and test while using a processing task, at both study and test, compatible with the relevant dimension of target change. When the target was encountered in a sentence validity task at study and test, and the semantic context in which a target was encountered was changed between these two occasions, prospective remembering declined (Experiment 1). A similar decline occurred, using a readability rating task, when the perceptual context (font in which the word was printed) was altered (Experiment 2). These results indicate that both perceptual and conceptual processes can support prospective remembering.
Resumo:
This investigation moves beyond the traditional studies of word reading to identify how the production complexity of words affects reading accuracy in an individual with deep dyslexia (JO). We examined JO’s ability to read words aloud while manipulating both the production complexity of the words and the semantic context. The classification of words as either phonetically simple or complex was based on the Index of Phonetic Complexity. The semantic context was varied using a semantic blocking paradigm (i.e., semantically blocked and unblocked conditions). In the semantically blocked condition words were grouped by semantic categories (e.g., table, sit, seat, couch,), whereas in the unblocked condition the same words were presented in a random order. JO’s performance on reading aloud was also compared to her performance on a repetition task using the same items. Results revealed a strong interaction between word complexity and semantic blocking for reading aloud but not for repetition. JO produced the greatest number of errors for phonetically complex words in semantically blocked condition. This interaction suggests that semantic processes are constrained by output production processes which are exaggerated when derived from visual rather than auditory targets. This complex relationship between orthographic, semantic, and phonetic processes highlights the need for word recognition models to explicitly account for production processes.
Resumo:
The present article examines production and on-line processing of definite articles in Turkish-speaking sequential bilingual children acquiring English and Dutch as second languages (L2) in the UK and in the Netherlands, respectively. Thirty-nine 6–8-year-old L2 children and 48 monolingual (L1) age-matched children participated in two separate studies examining the production of definite articles in English and Dutch in conditions manipulating semantic context, that is, the anaphoric and the bridging contexts. Sensitivity to article omission was examined in the same groups of children using an on-line processing task involving article use in the same semantic contexts as in the production task. The results indicate that both L2 children and L1 controls are less accurate when definiteness is established by keeping track of the discourse referents (anaphoric) than when it is established via world knowledge (bridging). Moreover, despite variable production, all groups of children were sensitive to the omission of definite articles in the on-line comprehension task. This suggests that the errors of omission are not due to the lack of abstract syntactic representations, but could result from processes implicated in the spell-out of definite articles. The findings are in line with the idea that variable production in child L2 learners does not necessarily indicate lack of abstract representations (Haznedar and Schwartz, 1997).
Resumo:
This project was an experiment in widening the traditional borders of study in the field and looking at the phenomenon of Gothic taste in many genres and kinds of art. The Gothic taste was a major element in the cultural image of the Enlightenment both in western Europe and in Russia. It was an essential component in the world outlook of an educated person and without studying this phenomenon it is impossible to fully understand the thinking of artistic professionals, amateurs and users in Russian society in the 18th century. Mr. Khatchatourov first analysed the reasons for the importance of Gothic taste in the culture of the European Enlightenment and then studied its linguistic and lexicographic evolution in 18th century Russian culture. He sought to determine the semantic context which actively formed the human mind set in the Enlightenment, including potential users and producers of articles in the Gothic taste. He then looked at the process of absorption of this concept by those forms of art which express it most strongly, in particular architecture and the theatre. His study was based on a comprehensive historical and culturological study using a wide range of sources, a formal stylistic method approach considering the interaction of non-classical styles of the Enlightenment with the dominant classicism, and an iconographic approach which revealed the essential aspects in a new image synthesis of the culture of the Enlightenment.
Resumo:
Land degradation is intrinsically complex and involves decisions by many agencies and individuals, land degradation map- ping should be used as a learning tool through which managers, experts and stakeholders can re-examine their views within a wider semantic context. In this paper, we introduce an analytical framework for mapping land degradation, developed by World Overview for Conservation Approaches and technologies (WOCAT) programs, which aims to develop some thematic maps that serve as an useful tool and including effective information on land degradation and conservation status. Consequently, this methodology would provide an important background for decision-making in order to launch rehabilitation/remediation actions in high-priority intervention areas. As land degradation mapping is a problem-solving task that aims to provide clear information, this study entails the implementation of WOCAT mapping tool, which integrate a set of indicators to appraise the severity of land degradation across a representative watershed. So this work focuses on the use of the most relevant indicators for measuring impacts of different degradation processes in El Mkhachbiya catchment, situated in Northwest of Tunisia and those actions taken to deal with them based on the analysis of operating modes and issues of degradation in different land use systems. This study aims to provide a database for surveillance and monitoring of land degradation, in order to support stakeholders in making appropriate choices and judge guidelines and possible suitable recommendations to remedy the situation in order to promote sustainable development. The approach is illustrated through a case study of an urban watershed in Northwest of Tunisia. Results showed that the main land degradation drivers in the study area were related to natural processes, which were exacerbated by human activities. So the output of this analytical framework enabled a better communication of land degradation issues and concerns in a way relevant for policymakers.
Resumo:
La última década ha sido testigo de importantes avances en el campo de la tecnología de reconocimiento de voz. Los sistemas comerciales existentes actualmente poseen la capacidad de reconocer habla continua de múltiples locutores, consiguiendo valores aceptables de error, y sin la necesidad de realizar procedimientos explícitos de adaptación. A pesar del buen momento que vive esta tecnología, el reconocimiento de voz dista de ser un problema resuelto. La mayoría de estos sistemas de reconocimiento se ajustan a dominios particulares y su eficacia depende de manera significativa, entre otros muchos aspectos, de la similitud que exista entre el modelo de lenguaje utilizado y la tarea específica para la cual se está empleando. Esta dependencia cobra aún más importancia en aquellos escenarios en los cuales las propiedades estadísticas del lenguaje varían a lo largo del tiempo, como por ejemplo, en dominios de aplicación que involucren habla espontánea y múltiples temáticas. En los últimos años se ha evidenciado un constante esfuerzo por mejorar los sistemas de reconocimiento para tales dominios. Esto se ha hecho, entre otros muchos enfoques, a través de técnicas automáticas de adaptación. Estas técnicas son aplicadas a sistemas ya existentes, dado que exportar el sistema a una nueva tarea o dominio puede requerir tiempo a la vez que resultar costoso. Las técnicas de adaptación requieren fuentes adicionales de información, y en este sentido, el lenguaje hablado puede aportar algunas de ellas. El habla no sólo transmite un mensaje, también transmite información acerca del contexto en el cual se desarrolla la comunicación hablada (e.g. acerca del tema sobre el cual se está hablando). Por tanto, cuando nos comunicamos a través del habla, es posible identificar los elementos del lenguaje que caracterizan el contexto, y al mismo tiempo, rastrear los cambios que ocurren en estos elementos a lo largo del tiempo. Esta información podría ser capturada y aprovechada por medio de técnicas de recuperación de información (information retrieval) y de aprendizaje de máquina (machine learning). Esto podría permitirnos, dentro del desarrollo de mejores sistemas automáticos de reconocimiento de voz, mejorar la adaptación de modelos del lenguaje a las condiciones del contexto, y por tanto, robustecer al sistema de reconocimiento en dominios con condiciones variables (tales como variaciones potenciales en el vocabulario, el estilo y la temática). En este sentido, la principal contribución de esta Tesis es la propuesta y evaluación de un marco de contextualización motivado por el análisis temático y basado en la adaptación dinámica y no supervisada de modelos de lenguaje para el robustecimiento de un sistema automático de reconocimiento de voz. Esta adaptación toma como base distintos enfoque de los sistemas mencionados (de recuperación de información y aprendizaje de máquina) mediante los cuales buscamos identificar las temáticas sobre las cuales se está hablando en una grabación de audio. Dicha identificación, por lo tanto, permite realizar una adaptación del modelo de lenguaje de acuerdo a las condiciones del contexto. El marco de contextualización propuesto se puede dividir en dos sistemas principales: un sistema de identificación de temática y un sistema de adaptación dinámica de modelos de lenguaje. Esta Tesis puede describirse en detalle desde la perspectiva de las contribuciones particulares realizadas en cada uno de los campos que componen el marco propuesto: _ En lo referente al sistema de identificación de temática, nos hemos enfocado en aportar mejoras a las técnicas de pre-procesamiento de documentos, asimismo en contribuir a la definición de criterios más robustos para la selección de index-terms. – La eficiencia de los sistemas basados tanto en técnicas de recuperación de información como en técnicas de aprendizaje de máquina, y específicamente de aquellos sistemas que particularizan en la tarea de identificación de temática, depende, en gran medida, de los mecanismos de preprocesamiento que se aplican a los documentos. Entre las múltiples operaciones que hacen parte de un esquema de preprocesamiento, la selección adecuada de los términos de indexado (index-terms) es crucial para establecer relaciones semánticas y conceptuales entre los términos y los documentos. Este proceso también puede verse afectado, o bien por una mala elección de stopwords, o bien por la falta de precisión en la definición de reglas de lematización. En este sentido, en este trabajo comparamos y evaluamos diferentes criterios para el preprocesamiento de los documentos, así como también distintas estrategias para la selección de los index-terms. Esto nos permite no sólo reducir el tamaño de la estructura de indexación, sino también mejorar el proceso de identificación de temática. – Uno de los aspectos más importantes en cuanto al rendimiento de los sistemas de identificación de temática es la asignación de diferentes pesos a los términos de acuerdo a su contribución al contenido del documento. En este trabajo evaluamos y proponemos enfoques alternativos a los esquemas tradicionales de ponderado de términos (tales como tf-idf ) que nos permitan mejorar la especificidad de los términos, así como también discriminar mejor las temáticas de los documentos. _ Respecto a la adaptación dinámica de modelos de lenguaje, hemos dividimos el proceso de contextualización en varios pasos. – Para la generación de modelos de lenguaje basados en temática, proponemos dos tipos de enfoques: un enfoque supervisado y un enfoque no supervisado. En el primero de ellos nos basamos en las etiquetas de temática que originalmente acompañan a los documentos del corpus que empleamos. A partir de estas, agrupamos los documentos que forman parte de la misma temática y generamos modelos de lenguaje a partir de dichos grupos. Sin embargo, uno de los objetivos que se persigue en esta Tesis es evaluar si el uso de estas etiquetas para la generación de modelos es óptimo en términos del rendimiento del reconocedor. Por esta razón, nosotros proponemos un segundo enfoque, un enfoque no supervisado, en el cual el objetivo es agrupar, automáticamente, los documentos en clusters temáticos, basándonos en la similaridad semántica existente entre los documentos. Por medio de enfoques de agrupamiento conseguimos mejorar la cohesión conceptual y semántica en cada uno de los clusters, lo que a su vez nos permitió refinar los modelos de lenguaje basados en temática y mejorar el rendimiento del sistema de reconocimiento. – Desarrollamos diversas estrategias para generar un modelo de lenguaje dependiente del contexto. Nuestro objetivo es que este modelo refleje el contexto semántico del habla, i.e. las temáticas más relevantes que se están discutiendo. Este modelo es generado por medio de la interpolación lineal entre aquellos modelos de lenguaje basados en temática que estén relacionados con las temáticas más relevantes. La estimación de los pesos de interpolación está basada principalmente en el resultado del proceso de identificación de temática. – Finalmente, proponemos una metodología para la adaptación dinámica de un modelo de lenguaje general. El proceso de adaptación tiene en cuenta no sólo al modelo dependiente del contexto sino también a la información entregada por el proceso de identificación de temática. El esquema usado para la adaptación es una interpolación lineal entre el modelo general y el modelo dependiente de contexto. Estudiamos también diferentes enfoques para determinar los pesos de interpolación entre ambos modelos. Una vez definida la base teórica de nuestro marco de contextualización, proponemos su aplicación dentro de un sistema automático de reconocimiento de voz. Para esto, nos enfocamos en dos aspectos: la contextualización de los modelos de lenguaje empleados por el sistema y la incorporación de información semántica en el proceso de adaptación basado en temática. En esta Tesis proponemos un marco experimental basado en una arquitectura de reconocimiento en ‘dos etapas’. En la primera etapa, empleamos sistemas basados en técnicas de recuperación de información y aprendizaje de máquina para identificar las temáticas sobre las cuales se habla en una transcripción de un segmento de audio. Esta transcripción es generada por el sistema de reconocimiento empleando un modelo de lenguaje general. De acuerdo con la relevancia de las temáticas que han sido identificadas, se lleva a cabo la adaptación dinámica del modelo de lenguaje. En la segunda etapa de la arquitectura de reconocimiento, usamos este modelo adaptado para realizar de nuevo el reconocimiento del segmento de audio. Para determinar los beneficios del marco de trabajo propuesto, llevamos a cabo la evaluación de cada uno de los sistemas principales previamente mencionados. Esta evaluación es realizada sobre discursos en el dominio de la política usando la base de datos EPPS (European Parliamentary Plenary Sessions - Sesiones Plenarias del Parlamento Europeo) del proyecto europeo TC-STAR. Analizamos distintas métricas acerca del rendimiento de los sistemas y evaluamos las mejoras propuestas con respecto a los sistemas de referencia. ABSTRACT The last decade has witnessed major advances in speech recognition technology. Today’s commercial systems are able to recognize continuous speech from numerous speakers, with acceptable levels of error and without the need for an explicit adaptation procedure. Despite this progress, speech recognition is far from being a solved problem. Most of these systems are adjusted to a particular domain and their efficacy depends significantly, among many other aspects, on the similarity between the language model used and the task that is being addressed. This dependence is even more important in scenarios where the statistical properties of the language fluctuates throughout the time, for example, in application domains involving spontaneous and multitopic speech. Over the last years there has been an increasing effort in enhancing the speech recognition systems for such domains. This has been done, among other approaches, by means of techniques of automatic adaptation. These techniques are applied to the existing systems, specially since exporting the system to a new task or domain may be both time-consuming and expensive. Adaptation techniques require additional sources of information, and the spoken language could provide some of them. It must be considered that speech not only conveys a message, it also provides information on the context in which the spoken communication takes place (e.g. on the subject on which it is being talked about). Therefore, when we communicate through speech, it could be feasible to identify the elements of the language that characterize the context, and at the same time, to track the changes that occur in those elements over time. This information can be extracted and exploited through techniques of information retrieval and machine learning. This allows us, within the development of more robust speech recognition systems, to enhance the adaptation of language models to the conditions of the context, thus strengthening the recognition system for domains under changing conditions (such as potential variations in vocabulary, style and topic). In this sense, the main contribution of this Thesis is the proposal and evaluation of a framework of topic-motivated contextualization based on the dynamic and non-supervised adaptation of language models for the enhancement of an automatic speech recognition system. This adaptation is based on an combined approach (from the perspective of both information retrieval and machine learning fields) whereby we identify the topics that are being discussed in an audio recording. The topic identification, therefore, enables the system to perform an adaptation of the language model according to the contextual conditions. The proposed framework can be divided in two major systems: a topic identification system and a dynamic language model adaptation system. This Thesis can be outlined from the perspective of the particular contributions made in each of the fields that composes the proposed framework: _ Regarding the topic identification system, we have focused on the enhancement of the document preprocessing techniques in addition to contributing in the definition of more robust criteria for the selection of index-terms. – Within both information retrieval and machine learning based approaches, the efficiency of topic identification systems, depends, to a large extent, on the mechanisms of preprocessing applied to the documents. Among the many operations that encloses the preprocessing procedures, an adequate selection of index-terms is critical to establish conceptual and semantic relationships between terms and documents. This process might also be weakened by a poor choice of stopwords or lack of precision in defining stemming rules. In this regard we compare and evaluate different criteria for preprocessing the documents, as well as for improving the selection of the index-terms. This allows us to not only reduce the size of the indexing structure but also to strengthen the topic identification process. – One of the most crucial aspects, in relation to the performance of topic identification systems, is to assign different weights to different terms depending on their contribution to the content of the document. In this sense we evaluate and propose alternative approaches to traditional weighting schemes (such as tf-idf ) that allow us to improve the specificity of terms, and to better identify the topics that are related to documents. _ Regarding the dynamic language model adaptation, we divide the contextualization process into different steps. – We propose supervised and unsupervised approaches for the generation of topic-based language models. The first of them is intended to generate topic-based language models by grouping the documents, in the training set, according to the original topic labels of the corpus. Nevertheless, a goal of this Thesis is to evaluate whether or not the use of these labels to generate language models is optimal in terms of recognition accuracy. For this reason, we propose a second approach, an unsupervised one, in which the objective is to group the data in the training set into automatic topic clusters based on the semantic similarity between the documents. By means of clustering approaches we expect to obtain a more cohesive association of the documents that are related by similar concepts, thus improving the coverage of the topic-based language models and enhancing the performance of the recognition system. – We develop various strategies in order to create a context-dependent language model. Our aim is that this model reflects the semantic context of the current utterance, i.e. the most relevant topics that are being discussed. This model is generated by means of a linear interpolation between the topic-based language models related to the most relevant topics. The estimation of the interpolation weights is based mainly on the outcome of the topic identification process. – Finally, we propose a methodology for the dynamic adaptation of a background language model. The adaptation process takes into account the context-dependent model as well as the information provided by the topic identification process. The scheme used for the adaptation is a linear interpolation between the background model and the context-dependent one. We also study different approaches to determine the interpolation weights used in this adaptation scheme. Once we defined the basis of our topic-motivated contextualization framework, we propose its application into an automatic speech recognition system. We focus on two aspects: the contextualization of the language models used by the system, and the incorporation of semantic-related information into a topic-based adaptation process. To achieve this, we propose an experimental framework based in ‘a two stages’ recognition architecture. In the first stage of the architecture, Information Retrieval and Machine Learning techniques are used to identify the topics in a transcription of an audio segment. This transcription is generated by the recognition system using a background language model. According to the confidence on the topics that have been identified, the dynamic language model adaptation is carried out. In the second stage of the recognition architecture, an adapted language model is used to re-decode the utterance. To test the benefits of the proposed framework, we carry out the evaluation of each of the major systems aforementioned. The evaluation is conducted on speeches of political domain using the EPPS (European Parliamentary Plenary Sessions) database from the European TC-STAR project. We analyse several performance metrics that allow us to compare the improvements of the proposed systems against the baseline ones.
Resumo:
The standard Kratzerian analysis of modal auxiliaries, such as ‘may’ and ‘can’, takes them to be univocal and context-sensitive. Our first aim is to argue for an alternative view, on which such expressions are polysemous. Our second aim is to thereby shed light on the distinction between semantic context-sensitivity and polysemy. To achieve these aims, we examine the mechanisms of polysemy and context-sensitivity and provide criteria with which they can be held apart. We apply the criteria to modal auxiliaries and show that the default hypothesis should be that they are polysemous, and not merely context-sensitive. We then respond to arguments against modal ambiguity (and thus against polysemy). Finally, we show why modal polysemy has significant philosophical implications.