947 resultados para semantic annotation
Resumo:
En aquest projecte presentem un mètode per generar bases de imatges de vianants, requerides per a l'entrenament o validació de sistemes d'aprenentatge basats en exemples, en un entorn virtual. S'ha desenvolupat una plataforma que permet simular una navegació d'una càmara en una escena virtual i recuperar el fluxe de vídeo amb el seu groundtruth. Amb l'ús d'aquesta plataforma es suprimeix el procés d'anotació, necesari per obtenir el groundtruth en entorns reals, i es redueixen els costos al treballar en un entorn virtual.
Resumo:
Past multisensory experiences can influence current unisensory processing and memory performance. Repeated images are better discriminated if initially presented as auditory-visual pairs, rather than only visually. An experience's context thus plays a role in how well repetitions of certain aspects are later recognized. Here, we investigated factors during the initial multisensory experience that are essential for generating improved memory performance. Subjects discriminated repeated versus initial image presentations intermixed within a continuous recognition task. Half of initial presentations were multisensory, and all repetitions were only visual. Experiment 1 examined whether purely episodic multisensory information suffices for enhancing later discrimination performance by pairing visual objects with either tones or vibrations. We could therefore also assess whether effects can be elicited with different sensory pairings. Experiment 2 examined semantic context by manipulating the congruence between auditory and visual object stimuli within blocks of trials. Relative to images only encountered visually, accuracy in discriminating image repetitions was significantly impaired by auditory-visual, yet unaffected by somatosensory-visual multisensory memory traces. By contrast, this accuracy was selectively enhanced for visual stimuli with semantically congruent multisensory pasts and unchanged for those with semantically incongruent multisensory pasts. The collective results reveal opposing effects of purely episodic versus semantic information from auditory-visual multisensory events. Nonetheless, both types of multisensory memory traces are accessible for processing incoming stimuli and indeed result in distinct visual object processing, leading to either impaired or enhanced performance relative to unisensory memory traces. We discuss these results as supporting a model of object-based multisensory interactions.
Resumo:
2 Abstract2.1 En françaisLe séquençage du génome humain est un pré-requis fondamental à la compréhension de la biologie de l'être humain. Ce projet achevé, les scientifiques ont dû faire face à une tâche aussi importante, comprendre cette suite de 3 milliards de lettres qui compose notre génome. Le consortium ENCODE (ENCyclopedia Of Dna Elements) fût formé comme une suite logique au projet du génome humain. Son rôle est d'identifier tous les éléments fonctionnels de notre génome incluant les régions transcrites, les sites d'attachement des facteurs de transcription, les sites hypersensibles à la DNAse I ainsi que les marqueurs de modification des histones. Dans le cadre de ma thèse doctorale, j'ai participé à 2 sous-projets d'ENCODE. En premier lieu, j'ai eu la tâche de développer et d'optimiser une technique de validation expérimentale à haut rendement de modèles de gènes qui m'a permis d'estimer la qualité de la plus récente annotation manuelle. Ce nouveau processus de validation est bien plus efficace que la technique RNAseq qui est actuellement en train de devenir la norme. Cette technique basée sur la RT-PCR, m'a notamment permis de découvrir de nouveaux exons dans 10% des régions interrogées. En second lieu j'ai participé à une étude ayant pour but d'identifier les extrémités de tous les gènes des chromosomes humains 21 et 22. Cette étude à permis l'identification à large échelle de transcrits chimères comportant des séquences provenant de deux gènes distincts pouvant être à une grande distance l'un de autre.2.2 In EnglishThe completion of the human genome sequence js the prerequisite to fully understand the biology of human beings. This project achieved, scientists had to face another challenging task, understanding the meaning of the 3 billion letters composing this genome. As a logical continuation of the human genome project, the ENCODE (ENCyclopedia Of DNA Elements) consortium was formed with the aim of annotating all its functional elements. These elements include transcribed regions, transcription binding sites, DNAse I hypersensitive sites and histone modification marks. In the frame of my PhD thesis, I was involved in two sub-projects of ENCODE. Firstly I developed and optimized an high throughput method to validate gene models, which allowed me to assess the quality of the most recent manually-curated annotation. This novel experimental validation pipeline is extremely effective, far more so than transcriptome profiling through RNA sequencing, which is becoming the norm. This RT-PCR-seq targeted-approach is likewise particularly efficient in identifying novel exons, as we discovered about 10% of loci with unannotated exons. Secondly, I participated to a study aiming to identify the gene boundaries of all genes in the human chromosome 21 and 22. This study led to the identification of chimeric transcripts that are composed of sequences coming form two distinct genes that can be map far away from each other.
Resumo:
While the dynamics of lexical-semantic and lexical-phonological encoding in word production have been investigated in several event-related potential (ERP) studies, the estimated time course of phonological-phonetic encoding is the result of rather indirect evidence. We investigated the dynamics of phonological-phonetic encoding combining ERP analyses covering the entire encoding process in picture naming and word reading tasks by comparing ERP modulations in eight brain-damaged speakers presenting impaired phonological-phonetic encoding relative to 16 healthy controls. ERPs diverged between groups in terms of local waveform amplitude and global topography at ∼400ms after stimulus onset in the picture naming task and at ∼320-350ms in word reading and sustained until 100ms before articulation onset. These divergences appeared in later time windows than those found in patients with underlying lexical-semantic and lexical-phonological impairment in previous studies, providing evidence that phonological-phonetic encoding is engaged around 400ms in picture naming and around 330ms in word reading.
Resumo:
Abstract : This thesis investigated the spatio-temporal brain mechanisms of three processes involved in recognizing environmental sounds produced by living (animal vocalisations) and man-made (manufactured) objects: their discrimination, their plasticity, and the involvement of action representations. Results showed rapid brain discrimination between these categories beginning at ~70ms. Then, beginning at ~150ms, effects of plasticity are observed, without any influence of the categories of sounds. Both of these processes of discrimination and repetition priming involved brain structures located in temporal and frontal lobes. Activation of brain areas BA21 and BA22 suggest an access to semantic representations and/or linked to object manipulation. To investigate the involvement of action representations in sound recognition, analyses were restricted to sounds produced by man-made objects. Results suggest an access to representations linked to action functionally related to sound rather than to representations linked to action that produced sound. These effects occurred at ~300ms post-stimulus onset and involved differential activity brain regions attributed to the mirror neuron system. These data are discussed in regard to motor preparation of actions functionally linked to sounds. Collectively these data showed a sequential progression of cerebral activity underlying the recognizing of environmental sounds. The processes occurred firstly in a shared network of brain areas before propagating elsewhere and/or leading to differential activity in these structures. Cerebral responses observed in this work allowed establishing a dynamic model of discrimination of sounds produced by living and man-made objects.
Resumo:
The Eukaryotic Promoter Database (EPD) is an annotated non-redundant collection of eukaryotic POL II promoters for which the transcription start site has been determined experimentally. Access to promoter sequences is provided by pointers to positions in nucleotide sequence entries. The annotation part of an entry includes a description of the initiation site mapping data, exhaustive cross-references to the EMBL nucleotide sequence database, SWISS-PROT, TRANSFAC and other databases, as well as bibliographic references. EPD is structured in a way that facilitates dynamic extraction of biologically meaningful promoter subsets for comparative sequence analysis. WWW-based interfaces have been developed that enable the user to view EPD entries in different formats, to select and extract promoter sequences according to a variety of criteria, and to navigate to related databases exploiting different cross-references. The EPD web site also features yearly updated base frequency matrices for major eukaryotic promoter elements. EPD can be accessed at http://www.epd.isb-sib.ch
Resumo:
This PhD project aims to study paraphrasing, initially understood as the different ways in which the same content is expressed linguistically. We will go into that concept in depth trying to define and delimit its scope more accurately. In that sense, we also aim to discover which kind of structures and phenomena it covers. Although there exist some paraphrasing typologies, the great majority of them only apply to English, and focus on lexical and syntactic transformations. Our intention is to go further into this subject and propose a paraphrasing typology for Spanish and Catalan combining lexical, syntactic, semantic and pragmatic knowledge. We apply a bottom-up methodology trying to collect evidence of this phenomenon from the data. For this purpose, we are initially using the Spanish Wikipedia as our corpus. The internal structure of this encyclopedia makes it a good resource for extracting paraphrasing examples for our investigation. This empirical approach will be complemented with the use of linguistic knowledge, and by comparing and contrasting our results to previously proposed paraphrasing typologies in order to enlarge the possible paraphrasing forms found in our corpus. The fact that the same content can be expressed in many different ways presents a major challenge for Natural Language Processing (NLP) applications. Thus, research on paraphrasing has recently been attracting increasing attention in the fields of NLP and Computational Linguistics. The results obtained in this investigation would be of great interest in many of these applications.
Resumo:
El objetivo de la presente investigación es analizar el tratamiento que algunos de los diccionarios generales monolingües del español aparecidos en los últimos diez años han dado al fenómeno de las colocaciones léxicas, a saber, aquellas combinaciones de palabras que, desde el punto de vista de la norma, presentan ciertas restricciones combinatorias, esencialmente de carácter semántico, impuestas por el uso (Corpas: 1996). Los repertorios objeto de análisis han sido: el "Diccionario Salamanca de la lengua española", dirigido por Juan Gutiérrez (1996); el "Diccionario del español actual", de Manuel Seco, Olimpia Andrés y Gabino Ramos (1999); la vigésima segunda edición del diccionario de la RAE (2001); y el "Gran diccionario de uso del español actual. Basado en el corpus Cumbre", dirigido por Aquilino Sánchez (2001). Nuestro estudio se ha fundamentado en un corpus de 52 colocaciones léxicas confeccionado a partir del análisis de las subentradas contenidas en la letra "b" de cada uno de los diccionarios seleccionados. Posteriormente, hemos examinado las entradas correspondientes a cada uno de los elementos que constituyen la colocación (base y colocativo) con el fin de observar si los diccionarios estudiados dan cuenta de estas mismas combinaciones en otras partes del artículo lexicográfico, como son las definiciones o los ejemplos. A la hora de analizar la información lexicográfica hemos centrado nuestra atención en cuatro aspectos: a) la información contenida en las páginas preliminares de cada una de las obras; b) la ubicación de las colocaciones en el artículo lexicográfico; c) la asignación de la colocación a un artículo determinado; y d) la marcación gramatical.
Resumo:
Signature databases are vital tools for identifying distant relationships in novel sequences and hence for inferring protein function. InterPro is an integrated documentation resource for protein families, domains and functional sites, which amalgamates the efforts of the PROSITE, PRINTS, Pfam and ProDom database projects. Each InterPro entry includes a functional description, annotation, literature references and links back to the relevant member database(s). Release 2.0 of InterPro (October 2000) contains over 3000 entries, representing families, domains, repeats and sites of post-translational modification encoded by a total of 6804 different regular expressions, profiles, fingerprints and Hidden Markov Models. Each InterPro entry lists all the matches against SWISS-PROT and TrEMBL (more than 1,000,000 hits from 462,500 proteins in SWISS-PROT and TrEMBL). The database is accessible for text- and sequence-based searches at http://www.ebi.ac.uk/interpro/. Questions can be emailed to interhelp@ebi.ac.uk.
Resumo:
In 1927 M. R. James published Latin Infancy Gospels, identified by him in two related but not identical manuscripts (one the British Library Arundel 404; the other from Hereford), together with a parallel text from the Irish manuscript known as the Leabhar Breac. Later researches brought to light more manuscripts of this Latin work, and also of the Irish text. James recognized that his apocryphal Latin Infancy text was compiled from a combination of the Protevangelium of James and a hitherto unknown text which he named "The Source". Recent research has identified a full Latin translation of the Protevangelium of James. A hitherto unrecognized Irish Infancy Narrative has also been identified in the Dublin manuscript known as the Liber Flavus Fergusiorum. A deep study of this related tradition was called for. This has been carried out over the past ten years by an Irish team in conjunction with Professor Daniel Kaestli and AELAC. The fruits of this labour are published in these two volumes. Volume 13 has a general introduction with a historical sketch of New Testament apocrypha in Ireland and a history of research on the subject. This is followed by a comparison of the Infancy Narratives in the Leabhar Breac and the Liber Flavus Fergusiorum. There are special introductions to these Infancy texts, followed by critical editions of the Irish texts, accompanied by English translations and rich annotation. Next there is similar treatment of the Irish versified Narrative (from ca. 700) of the Childhood Deeds of Jesus (commonly known as the Infancy Narrative (or Gospel) of Thomas. There is then (in volume 14, but with continuous pagination) the edition and translation of an Irish thirteenth-century poem with elements from Infancy Narratives, and both Latin and Irish texts on the wonders at Christ's birth, accompanied by translations and notes. The edition of the Irish material is followed by a critical edition of the full Arundel and Hereford forms of the Infancy Narrative (here referred to as the "J Compilation"), together with a detailed study of all the questions relating to this work. The volume concludes with a critical edition (by Rita Beyers) of the Latin text of the Protevangelium of James, accompanied by a detailed study of the work.. The work contains a detailed study of the Latin translations of the Protevangelium of James and the transmission of this work in the West. The "J Compilation" (a combination of the Protevangelium and texts of Pseudo-Matthew) can be traced back in manuscript transmission to ca. 800,and must have originated some time earlier. Behind it stands an earlier "I ("I" for Irish) Compilation" without influence from Pseudo-Matthew, the form found in the Irish witnesses. It is argued that M. R. James's "Source" may be of Judaeo-Christian origin and may really be the Gospel of the Nazoreans. Among the indexes there is a list of all the Irish words found in the texts. This edition of the Irish and related Latin texts is a major contribution to the study of the apocryphal Infancy Narratives. It should also be of particular interest to Celtic scholars, to students of Irish ecclesiastical learning, and in general to all medievalists.
Resumo:
The respective roles of the medial temporal lobe (MTL) structures in memory are controversial. Some authors put forward a modular account according to which episodic memory and recollection-based processes are crucially dependent on the hippocampal formation whereas semantic acquisition and familiarity-based processes rely on the adjacent parahippocampal gyri. Others defend a unitary view. We report the case of VJ, a boy with developmental amnesia of most likely perinatal onset diagnosed at the age of 8. Magnetic resonance imaging (MRI), including quantitative volumetric measurements of the hippocampal formation and of the entorhinal, perirhinal, and temporopolar cortices, showed severe, bilateral atrophy of the hippocampal formation, fornix and mammillary bodies; by contrast, the perirhinal cortex was within normal range and the entorhinal and temporopolar cortex remained within two standard deviations (SDs) from controls' mean. We examined the development of his semantic knowledge from childhood to teenage as well as his recognition and cued recall memory abilities. On tasks tapping semantic memory, VJ increased his raw scores across years at the same rate as children from large standardisation samples, except for one task; he achieved average performance, consistent with his socio-educational background. He performed within normal range on 74% of recognition tests and achieved average to above average scores on 42% of them despite very severe impairment on 82% of episodic recall tasks. Both faces and landscapes-scenes gave rise to above average scores when tested with coloured stimuli. Cued recall, although impaired, was largely superior to free recall. This case supports a modular account of the MTL with episodic, but not semantic memory depending on the hippocampal formation. Furthermore, the overall pattern of findings is consistent with evidence from both brain-damaged and neuroimaging studies indicating that recollection requires intact hippocampal formation and familiarity relies, at least partly, on the adjacent temporal lobe cortex.
Resumo:
Because we live in an extremely complex social environment, people require the ability to memorize hundreds or thousands of social stimuli. The aim of this study was to investigate the effect of multiple repetitions on the processing of names and faces varying in terms of pre-experimental familiarity. We measured both behavioral and electrophysiological responses to self-, famous and unknown names and faces in three phases of the experiment (in every phase, each type of stimuli was repeated a pre-determined number of times). We found that the negative brain potential in posterior scalp sites observed approximately 170 ms after the stimulus onset (N170) was insensitive to pre-experimental familiarity but showed slight enhancement with each repetition. The negative wave in the inferior-temporal regions observed at approximately 250 ms (N250) was affected by both pre-experimental (famous>unknown) and intra-experimental familiarity (the more repetitions, the larger N250). In addition, N170 and N250 for names were larger in the left inferior-temporal region, whereas right-hemispheric or bilateral patterns of activity for faces were observed. The subsequent presentations of famous and unknown names and faces were also associated with higher amplitudes of the positive waveform in the central-parietal sites analyzed in the 320-900 ms time-window (P300). In contrast, P300 remained unchanged after the subsequent presentations of self-name and self-face. Moreover, the P300 for unknown faces grew more quickly than for unknown names. The latter suggests that the process of learning faces is more effective than learning names, possibly because faces carry more semantic information.
Resumo:
In the semantic analysis of tenses, the term event is frequently used. For example, following REICHENBACH (1947, 288), many studies mention three "points", within which (E) represents the "point of the event". Besides, like MARTIN (1985, 25), it is commonly considered -often implicitly- that "the duration of the utterance can [...] ideally be reduced to an instant t 0 [i.e. a point], [...] since within the utterance, truth conditions remain unaltered". However, events as well as utterances take time (cf. the use of "intervals " by GOSSELIN 1996). Here, I will analyze Present tense utterances such as "le ballon franchit la ligne" (the ball crosses/is crossing the line), for which the described event ("achievement" for VENDLER 1957; "réalisation instantanée" for VETTERS 1996) is shorter than the utterance that mentions it. I will show why the telic character (GAREY 1957) of achievements -unlike the other types of processes- makes it difficult to express an event contemporary to speech time since, against MARTIN's (1985, 25) idealization, truth conditions vary throughout the utterance. Taking encoding as the basis (somewhat following LEVELT 1989), I will argue that truth-condition variation can naturally lead to the over-represented use of past tenses (Passé composé in French) in child language for the expression of telic events during early acquisition (cf. e.g. WAGNER 2009).
Resumo:
(Résumé de l'ouvrage) In 1927 M. R. James published Latin Infancy Gospels, identified by him in two related but not identical manuscripts (one the British Library Arundel 404; the other from Hereford), together with a parallel text from the Irish manuscript known as the Leabhar Breac. Later researches brought to light more manuscripts of this Latin work, and also of the Irish text. James recognized that his apocryphal Latin Infancy text was compiled from a combination of the Protevangelium of James and a hitherto unknown text which he named "The Source". Recent research has identified a full Latin translation of the Protevangelium of James. A hitherto unrecognized Irish Infancy Narrative has also been identified in the Dublin manuscript known as the Liber Flavus Fergusiorum. A deep study of this related tradition was called for. This has been carried out over the past ten years by an Irish team in conjunction with Professor Daniel Kaestli and AELAC. The fruits of this labour are published in these two volumes. Volume 13 has a general introduction with a historical sketch of New Testament apocrypha in Ireland and a history of research on the subject. This is followed by a comparison of the Infancy Narratives in the Leabhar Breac and the Liber Flavus Fergusiorum. There are special introductions to these Infancy texts, followed by critical editions of the Irish texts, accompanied by English translations and rich annotation. Next there is similar treatment of the Irish versified Narrative (from ca. 700) of the Childhood Deeds of Jesus (commonly known as the Infancy Narrative (or Gospel) of Thomas. There is then (in volume 14, but with continuous pagination) the edition and translation of an Irish thirteenth-century poem with elements from Infancy Narratives, and both Latin and Irish texts on the wonders at Christ's birth, accompanied by translations and notes. The edition of the Irish material is followed by a critical edition of the full Arundel and Hereford forms of the Infancy Narrative (here referred to as the "J Compilation"), together with a detailed study of all the questions relating to this work. The volume concludes with a critical edition (by Rita Beyers) of the Latin text of the Protevangelium of James, accompanied by a detailed study of the work.. The work contains a detailed study of the Latin translations of the Protevangelium of James and the transmission of this work in the West. The "J Compilation" (a combination of the Protevangelium and texts of Pseudo-Matthew) can be traced back in manuscript transmission to ca. 800,and must have originated some time earlier. Behind it stands an earlier "I ("I" for Irish) Compilation" without influence from Pseudo-Matthew, the form found in the Irish witnesses. It is argued that M. R. James's "Source" may be of Judaeo-Christian origin and may really be the Gospel of the Nazoreans. Among the indexes there is a list of all the Irish words found in the texts. This edition of the Irish and related Latin texts is a major contribution to the study of the apocryphal Infancy Narratives. It should also be of particular interest to Celtic scholars, to students of Irish ecclesiastical learning, and in general to all medievalists.
Resumo:
(Résumé de l'ouvrage) In 1927 M. R. James published Latin Infancy Gospels, identified by him in two related but not identical manuscripts (one the British Library Arundel 404; the other from Hereford), together with a parallel text from the Irish manuscript known as the Leabhar Breac. Later researches brought to light more manuscripts of this Latin work, and also of the Irish text. James recognized that his apocryphal Latin Infancy text was compiled from a combination of the Protevangelium of James and a hitherto unknown text which he named "The Source". Recent research has identified a full Latin translation of the Protevangelium of James. A hitherto unrecognized Irish Infancy Narrative has also been identified in the Dublin manuscript known as the Liber Flavus Fergusiorum. A deep study of this related tradition was called for. This has been carried out over the past ten years by an Irish team in conjunction with Professor Daniel Kaestli and AELAC. The fruits of this labour are published in these two volumes. Volume 13 has a general introduction with a historical sketch of New Testament apocrypha in Ireland and a history of research on the subject. This is followed by a comparison of the Infancy Narratives in the Leabhar Breac and the Liber Flavus Fergusiorum. There are special introductions to these Infancy texts, followed by critical editions of the Irish texts, accompanied by English translations and rich annotation. Next there is similar treatment of the Irish versified Narrative (from ca. 700) of the Childhood Deeds of Jesus (commonly known as the Infancy Narrative (or Gospel) of Thomas. There is then (in volume 14, but with continuous pagination) the edition and translation of an Irish thirteenth-century poem with elements from Infancy Narratives, and both Latin and Irish texts on the wonders at Christ's birth, accompanied by translations and notes. The edition of the Irish material is followed by a critical edition of the full Arundel and Hereford forms of the Infancy Narrative (here referred to as the "J Compilation"), together with a detailed study of all the questions relating to this work. The volume concludes with a critical edition (by Rita Beyers) of the Latin text of the Protevangelium of James, accompanied by a detailed study of the work.. The work contains a detailed study of the Latin translations of the Protevangelium of James and the transmission of this work in the West. The "J Compilation" (a combination of the Protevangelium and texts of Pseudo-Matthew) can be traced back in manuscript transmission to ca. 800,and must have originated some time earlier. Behind it stands an earlier "I ("I" for Irish) Compilation" without influence from Pseudo-Matthew, the form found in the Irish witnesses. It is argued that M. R. James's "Source" may be of Judaeo-Christian origin and may really be the Gospel of the Nazoreans. Among the indexes there is a list of all the Irish words found in the texts. This edition of the Irish and related Latin texts is a major contribution to the study of the apocryphal Infancy Narratives. It should also be of particular interest to Celtic scholars, to students of Irish ecclesiastical learning, and in general to all medievalists.