94 resultados para Phonological and orthographic overlap

em Université de Lausanne, Switzerland


Relevância:

100.00% 100.00%

Publicador:

Resumo:

Ullman (2004) suggested that Specific Language Impairment (SLI) results from a general procedural learning deficit. In order to test this hypothesis, we investigated children with SLI via procedural learning tasks exploring the verbal, motor, and cognitive domains. Results showed that compared with a Control Group, the children with SLI (a) were unable to learn a phonotactic learning task, (b) were able but less efficiently to learn a motor learning task and (c) succeeded in a cognitive learning task. Regarding the motor learning task (Serial Reaction Time Task), reaction times were longer and learning slower than in controls. The learning effect was not significant in children with an associated Developmental Coordination Disorder (DCD), and future studies should consider comorbid motor impairment in order to clarify whether impairments are related to the motor rather than the language disorder. Our results indicate that a phonotactic learning but not a cognitive procedural deficit underlies SLI, thus challenging Ullmans' general procedural deficit hypothesis, like a few other recent studies.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

A nosological issue that has yet to be resolved relates to the diagnostic and clinical overlap of schizophrenia and schizoaffective disorder. Thus, the aim of this study was to compare, within a treated epidemiological cohort of first episode patients, the clinical characteristics of patients with schizophrenia (FES) or schizoaffective disorder (FESA). Medical fi le audit methodology was employed to collect information on 704 first episode psychosis patients (FEP), among which 283 patients had a fi nal diagnosis of FES and 64 patients with a fi nal diagnosis of FESA. These patients were treated at the Early Psychosis Prevention and Intervention Centre (EPPIC), Melbourne, Australia. Patients with FES were signifi cantly more likely to have a longer prodrome (P = .020), longer duration of untreated psychosis (P < .001), and earlier age of onset (P = .004) compared to FESA. At service entry, FESA patients had more severe levels of psychopathology (P = .020), which was due to the presence of manic symptoms (P < .001); consequently, requiring a greater number of inpatient admissions (P = .017). At discharge, depressive symptoms were more severe in those with FESA (P = .011). There are signifi cant differences in the phenomenology of schizophrenia and schizoaffective disorder during early illness course; supporting the notion that these are two discernable disorders.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Anaplastic large cell lymphoma (ALCL) is a main type of T-cell lymphomas and comprises three distinct entities: systemic anaplastic lymphoma kinase (ALK) positive, systemic ALK(-) and cutaneous ALK(-) ALCL (cALCL). Little is known about their pathogenesis and their cellular origin, and morphological and immunophenotypical overlap exists between ALK(-) ALCL and classical Hodgkin lymphoma (cHL). We conducted gene expression profiling of microdissected lymphoma cells of five ALK(+) and four ALK(-) systemic ALCL, seven cALCL and sixteen cHL, and of eight subsets of normal T and NK cells. The analysis supports a derivation of ALCL from activated T cells, but the lymphoma cells acquired a gene expression pattern hampering an assignment to a CD4(+), CD8(+) or CD30(+) T-cell origin. Indeed, ALCL display a down-modulation of many T-cell characteristic molecules. All ALCL types show significant expression of NFkappaB target genes and upregulation of genes involved in oncogenesis (e.g. EZH2). Surprisingly, few genes are differentially expressed between systemic and cALCL despite their different clinical behaviour, and between ALK(-) ALCL and cHL despite their different cellular origin. ALK(+) ALCL are characterized by expression of genes regulated by pathways constitutively activated by ALK. This study provides multiple novel insights into the molecular biology and pathogenesis of ALCL.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Abstract : Understanding how biodiversity is distributed is central to any conservation effort and has traditionally been based on niche modeling and the causal relationship between spatial distribution of organisms and their environment. More recently, the study of species' evolutionary history and relatedness has permeated the fields of ecology and conservation and, coupled with spatial predictions, provides useful insights to the origin of current biodiversity patterns, community structuring and potential vulnerability to extinction. This thesis explores several key ecological questions by combining the fields of niche modeling and phylogenetics and using important components of southern African biodiversity. The aims of this thesis are to provide comparisons of biodiversity measures, to assess how climate change will affect evolutionary history loss, to ask whether there is a clear link between evolutionary history and morphology and to investigate the potential role of relatedness in macro-climatic niche structuring. The first part of my thesis provides a fine scale comparison and spatial overlap quantification of species richness and phylogenetic diversity predictions for one of the most diverse plant families in the Cape Floristic Region (CFR), the Proteaceae. In several of the measures used, patterns do not match sufficiently to argue that species relatedness information is implicit in species richness patterns. The second part of my thesis predicts how climate change may affect threat and potential extinction of southern African animal and plant taxa. I compare present and future niche models to assess whether predicted species extinction will result in higher or lower V phylogenetic diversity survival than what would be experienced under random extinction processes. l find that predicted extinction will result in lower phylogenetic diversity survival but that this non-random pattern will be detected only after a substantial proportion of the taxa in each group has been lost. The third part of my thesis explores the relationship between phylogenetic and morphological distance in southern African bats to assess whether long evolutionary histories correspond to equally high levels of morphological variation, as predicted by a neutral model of character evolution. I find no such evidence; on the contrary weak negative trends are detected for this group, as well as in simulations of both neutral and convergent character evolution. Finally, I ask whether spatial and climatic niche occupancy in southern African bats is influenced by evolutionary history or not. I relate divergence time between species pairs to climatic niche and range overlap and find no evidence for clear phylogenetic structuring. I argue that this may be due to particularly high levels of micro-niche partitioning. Résumé : Comprendre la distribution de la biodiversité représente un enjeu majeur pour la conservation de la nature. Les analyses se basent le plus souvent sur la modélisation de la niche écologique à travers l'étude des relations causales entre la distribution spatiale des organismes et leur environnement. Depuis peu, l'étude de l'histoire évolutive des organismes est également utilisée dans les domaines de l'écologie et de la conservation. En combinaison avec la modélisation de la distribution spatiale des organismes, cette nouvelle approche fournit des informations pertinentes pour mieux comprendre l'origine des patterns de biodiversité actuels, de la structuration des communautés et des risques potentiels d'extinction. Cette thèse explore plusieurs grandes questions écologiques, en combinant les domaines de la modélisation de la niche et de la phylogénétique. Elle s'applique aux composants importants de la biodiversité de l'Afrique australe. Les objectifs de cette thèse ont été l) de comparer différentes mesures de la biodiversité, 2) d'évaluer l'impact des changements climatiques à venir sur la perte de diversité phylogénétique, 3) d'analyser le lien potentiel entre diversité phylogénétique et diversité morphologique et 4) d'étudier le rôle potentiel de la phylogénie sur la structuration des niches macro-climatiques des espèces. La première partie de cette thèse fournit une comparaison spatiale, et une quantification du chevauchement, entre des prévisions de richesse spécifique et des prédictions de la diversité phylogénétique pour l'une des familles de plantes les plus riches en espèces de la région floristique du Cap (CFR), les Proteaceae. Il résulte des analyses que plusieurs mesures de diversité phylogénétique montraient des distributions spatiales différentes de la richesse spécifique, habituellement utilisée pour édicter des mesures de conservation. La deuxième partie évalue les effets potentiels des changements climatiques attendus sur les taux d'extinction d'animaux et de plantes de l'Afrique australe. Pour cela, des modèles de distribution d'espèces actuels et futurs ont permis de déterminer si l'extinction des espèces se traduira par une plus grande ou une plus petite perte de diversité phylogénétique en comparaison à un processus d'extinction aléatoire. Les résultats ont effectivement montré que l'extinction des espèces liées aux changements climatiques pourrait entraîner une perte plus grande de diversité phylogénétique. Cependant, cette perte ne serait plus grande que celle liée à un processus d'extinction aléatoire qu'à partir d'une forte perte de taxons dans chaque groupe. La troisième partie de cette thèse explore la relation entre distances phylogénétiques et morphologiques d'espèces de chauves-souris de l'Afrique australe. ll s'agit plus précisément de déterminer si une longue histoire évolutive correspond également à des variations morphologiques plus grandes dans ce groupe. Cette relation est en fait prédite par un modèle neutre d'évolution de caractères. Aucune évidence de cette relation n'a émergé des analyses. Au contraire, des tendances négatives ont été détectées, ce qui représenterait la conséquence d'une évolution convergente entre clades et des niveaux élevés de cloisonnement pour chaque clade. Enfin, la dernière partie présente une étude sur la répartition de la niche climatique des chauves-souris de l'Afrique australe. Dans cette étude je rapporte temps de divergence évolutive (ou deux espèces ont divergé depuis un ancêtre commun) au niveau de chevauchement de leurs niches climatiques. Les résultats n'ont pas pu mettre en évidence de lien entre ces deux paramètres. Les résultats soutiennent plutôt l'idée que cela pourrait être I dû à des niveaux particulièrement élevés de répartition de la niche à échelle fine.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Background:  The relationship between phoneme awareness, rapid automatized naming (RAN), verbal short-term/working memory (ST/WM) and diagnostic category is investigated in control and dyslexic children, and the extent to which this depends on orthographic complexity. Methods:  General cognitive, phonological and literacy skills were tested in 1,138 control and 1,114 dyslexic children speaking six different languages spanning a large range of orthographic complexity (Finnish, Hungarian, German, Dutch, French, English). Results:  Phoneme deletion and RAN were strong concurrent predictors of developmental dyslexia, while verbal ST/WM and general verbal abilities played a comparatively minor role. In logistic regression models, more participants were classified correctly when orthography was more complex. The impact of phoneme deletion and RAN-digits was stronger in complex than in less complex orthographies. Conclusions:  Findings are largely consistent with the literature on predictors of dyslexia and literacy skills, while uniquely demonstrating how orthographic complexity exacerbates some symptoms of dyslexia.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

This report presents systematic empirical annotation of transcript products from 399 annotated protein-coding loci across the 1% of the human genome targeted by the Encyclopedia of DNA elements (ENCODE) pilot project using a combination of 5' rapid amplification of cDNA ends (RACE) and high-density resolution tiling arrays. We identified previously unannotated and often tissue- or cell-line-specific transcribed fragments (RACEfrags), both 5' distal to the annotated 5' terminus and internal to the annotated gene bounds for the vast majority (81.5%) of the tested genes. Half of the distal RACEfrags span large segments of genomic sequences away from the main portion of the coding transcript and often overlap with the upstream-annotated gene(s). Notably, at least 20% of the resultant novel transcripts have changes in their open reading frames (ORFs), most of them fusing ORFs of adjacent transcripts. A significant fraction of distal RACEfrags show expression levels comparable to those of known exons of the same locus, suggesting that they are not part of very minority splice forms. These results have significant implications concerning (1) our current understanding of the architecture of protein-coding genes; (2) our views on locations of regulatory regions in the genome; and (3) the interpretation of sequence polymorphisms mapping to regions hitherto considered to be "noncoding," ultimately relating to the identification of disease-related sequence alterations.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Recent theory of physiology of language suggests a dual stream dorsal/ventral organization of speech perception. Using intra-cerebral Event-related potentials (ERPs) during pre-surgical assessment of twelve drug-resistant epileptic patients, we aimed to single out electrophysiological patterns during both lexical-semantic and phonological monitoring tasks involving ventral and dorsal regions respectively. Phonological information processing predominantly occurred in the left supra-marginal gyrus (dorsal stream) and lexico-semantic information occurred in anterior/middle temporal and fusiform gyri (ventral stream). Similar latencies were identified in response to phonological and lexico-semantic tasks, suggesting parallel processing. Typical ERP components were strongly left lateralized since no evoked responses were recorded in homologous right structures. Finally, ERP patterns suggested the inferior frontal gyrus as the likely final common pathway of both dorsal and ventral streams. These results brought out detailed evidence of the spatial-temporal information processing in the dual pathways involved in speech perception.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

How phenomena like helping, dispersal, or the sex ratio evolve depends critically on demographic and life-history factors. One phenotype that is of particular interest to biologists is genomic imprinting, which results in parent-of-origin-specific gene expression and thus deviates from the predictions of Mendel's rules. The most prominent explanation for the evolution of genomic imprinting, the kinship theory, originally specified that multiple paternity can cause the evolution of imprinting when offspring affect maternal resource provisioning. Most models of the kinship theory do not detail how population subdivision, demography, and life history affect the evolution of imprinting. In this work, we embed the classic kinship theory within an island model of population structure and allow for diverse demographic and life-history features to affect the direction of selection on imprinting. We find that population structure does not change how multiple paternity affects the evolution of imprinting under the classic kinship theory. However, if the degree of multiple paternity is not too large, we find that sex-specific migration and survival and generation overlap are the primary factors determining which allele is silenced. This indicates that imprinting can evolve purely as a result of sex-related asymmetries in the demographic structure or life history of a species.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Peripheral T-cell lymphomas (PTCLs) represent a heterogeneous group of more than 20 neoplastic entities derived from mature T cells and natural killer (NK) cells involved in innate and adaptive immunity. With few exceptions these malignancies, which may present as disseminated, predominantly extranodal or cutaneous, or predominantly nodal diseases, are clinically aggressive and have a dismal prognosis. Their diagnosis and classification is hampered by several difficulties, including a significant morphological and immunophenotypic overlap across different entities, and the lack of characteristic genetic alterations for most of them. Although there is increasing evidence that the cell of origin is a major determinant for the delineation of several PTCL entities, however, the cellular derivation of most entities remains poorly characterized and/or may be heterogeneous. The complexity of the biology and pathophysiology of PTCLs has been only partly deciphered. In recent years, novel insights have been gained from genome-wide profiling analyses. In this review, we will summarize the current knowledge on the pathobiological features of peripheral NK/T-cell neoplasms, with a focus on selected disease entities manifesting as tissue infiltrates primarily in extranodal sites and lymph nodes.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

BACKGROUND: The diagnostic and clinical overlap between schizophrenia and schizoaffective disorder is an important nosological issue in psychiatry that is yet to be resolved. The aim of this study was to compare the clinical and functional characteristics of an epidemiological treated cohort of first episode patients with an 18-month discharge diagnosis of schizophrenia (FES) or schizoaffective disorder (FESA). METHODS: This study was part of the larger First Episode Psychosis Outcome Study (FEPOS) which involved a medical file audit study of all 786 patients treated at the Early Psychosis Prevention and Intervention Centre between 1998 and 2000. Of this cohort, 283 patients had an 18-month discharge diagnosis of FES and 64 had a diagnosis of FESA. DSM-IV diagnoses and clinical and functional ratings were derived and validated by two consultant psychiatrists. RESULTS: Compared to FES patients, those with FESA were significantly more likely to have a later age of onset (p=.004), longer prodrome (p=.020), and a longer duration of untreated psychosis (p<.001). At service entry, FESA patients presented with a higher illness severity (p=.020), largely due to the presence of more severe manic symptoms (p<.001). FESA patients also had a greater number of subsequent inpatient admissions (p=.017), had more severe depressive symptoms (p=.011), and higher levels of functioning at discharge. DISCUSSION: The findings support the notion that these might be considered two discernable disorders; however, further research is required to ascertain the ways and extent to which these disorders are discriminable at presentation and over time.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The genetic aetiology of congenital hypopituitarism (CH) is not entirely elucidated. FGFR1 and PROKR2 loss-of-function mutations are classically involved in hypogonadotrophic hypogonadism (HH), however, due to the clinical and genetic overlap of HH and CH; these genes may also be involved in the pathogenesis of CH. Using a candidate gene approach, we screened 156 Brazilian patients with combined pituitary hormone deficiencies (CPHD) for loss-of-function mutations in FGFR1 and PROKR2. We identified three FGFR1 variants (p.Arg448Trp, p.Ser107Leu and p.Pro772Ser) in four unrelated patients (two males) and two PROKR2 variants (p.Arg85Cys and p.Arg248Glu) in two unrelated female patients. Five of the six patients harbouring the variants had a first-degree relative that was an unaffected carrier of it. Results of functional studies indicated that the new FGFR1 variant p.Arg448Trp is a loss-of-function variant, while p.Ser107Leu and p.Pro772Ser present signalling activity similar to the wild-type form. Regarding PROKR2 variants, results from previous functional studies indicated that p.Arg85Cys moderately compromises receptor signalling through both MAPK and Ca(2) (+) pathways while p.Arg248Glu decreases calcium mobilization but has normal MAPK activity. The presence of loss-of-function variants of FGFR1 and PROKR2 in our patients with CPHD is indicative of an adjuvant and/or modifier effect of these rare variants on the phenotype. The presence of the same variants in unaffected relatives implies that they cannot solely cause the phenotype. Other associated genetic and/or environmental modifiers may play a role in the aetiology of this condition.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The complexity of mammalian genome organization demands a complex interplay of DNA and proteins to orchestrate proper gene regulation. CTCF, a highly conserved, ubiquitously expressed protein has been postulated as a primary organizer of genome architecture because of its roles in transcriptional activation/repression, insulation and imprinting. Diverse regulatory functions are exerted through genome wide binding via a central eleven zinc finger DNA binding domain and an array of diverse protein-protein interactions through N- and C- terminal domains. CTCFL has been identified as a paralog of CTCF expressed only in spermatogenic cells of the testis. CTCF and CTCFL have a highly homologous DNA-binding domain, while the flanking amino acid sequences exhibit no significant similarity. Genome- wide mapping of CTCF binding sites has been carried out in many cell types, but no data exist for CTCFL apart from a few identified loci. The lack of high quality antibodies prompted us to generate an endogenously flag-tagged CTCFL mouse model using BAC recombination. IHC staining using anti-flag antibodies confirmed CTCFL localization to type Β spermatogonia and preleptotene spermatocytes and a mutually exclusive pattern of expression with CTCF. ChIP followed by high-throughput sequencing identified 10,382 binding sites showing 70% overlap but representing only 20% of CTCF sites. Consensus sequence analysis identified a significantly longer binding motif with prominently less ambiguity of base calling at every position. The significant difference between CTCF and CTCFL genomic binding patterns proposes that their binding to DNA is differentially regulated. Analysis of CTCFL binding to methylated regions on a genome wide scale identified approximately 1,000 loci. Methylation-independent binding of CTCFL might be at least one of the mechanisms that ensures distinct binding patterns of CTCF and CTCFL since CTCF binding is methylation- sensitive. Co-localization of CTCF with cohesin has been well established and analysis of CTCFL and SMC3 overlap identified around 3,300 binding sites from which two related but distinct consensus sequence motifs were derived. Because virtually all data for cohesin binding originate from mitotically proliferating cells, the anticipated overlap is expected to be considerably higher in meiotic cells. Meiosis-specific cohesin subunit Rec8 is specific for spermatocytes and 6 out of the 12 identified binding sites are also bound by CTCFL. In conclusion, this was the first genome-wide mapping of CTCFL binding sites in spermatocytes, the only cell type where CTCF is not expressed. CTCFL has a unique binding site repertoire distinct from CTCF, binds to methylated sequences and shows a significant overlap with cohesin binding sites. Future efforts will be oriented towards deciphering the role CTCFL plays in conversion of chromatin structure and function from mitotic to meiotic chromosomes. - La complexité de l'organisation du génome des mammifères exige une interaction particulière entre ADN et protéines pour orchestrer une régulation appropriée de l'expression des gènes. CTCFL, une protéine ubiquitaire très conservée, serait le principal organisateur de l'architecture du génome de par son rôle dans l'activation / la répression de la transcription, la protection et la localisation des gènes. Diverses régulations sont opérées, d'une part au travers d'interactions à différents endroits du génome par le biais d'un domaine protéique central de liaison à l'ADN à onze doigts de zinc, et d'autre part par des interactions protéine-protéine variées au niveau de leur domaine N- et C-terminal. CTCFL a été identifié comme un paralogue de CTCF exprimé uniquement dans les cellules spermatiques du testicule. CTCFL et CTCF ont un domaine de liaison à l'ADN très homologue, tandis que les séquences d'acides aminés situées de part et d'autre de ce domaine ne présentent aucune similitude. Une cartographie générale des sites de liaison au CTCF a été réalisée pour de nombreux types cellulaires, mais il n'existe aucune donnée pour CTCFL à l'exception de l'identification de quelques loci. L'absence d'anticorps de bonne qualité nous a conduit à générer un modèle murin portant un CTCFL endogène taggué grâce à un procédé de recombinaison BAC. Une coloration IHC à l'aide d'anticorps anti-FLAG a confirmé la présence de CTCFL au niveau des spermatogonies de type Β et des spermatocytes au stade préleptotène, et une distribution mutuellement exclusive avec CTCF. Une méthode de Chromatine Immunoprecipitation (ChIP) suivie d'un séquençage à haut débit a permis d'identifier 10.382 sites de liaison montrant 70% d'homologie mais ne représentant que 20% des sites CTCF. L'analyse de la séquence consensus révèle un motif de fixation à l'ADN nettement plus long et qui comporte bien moins de bases aléatoires à chaque position nucléotidique. La différence significative entre les séquences génomiques des sites de liaison au CTCF et CTCFL suggère que leur fixation à l'ADN est régulée différemment. Appliquée à l'échelle du génome, l'étude de l'interaction de CTCFL avec des régions méthylées de l'ADN a permis d'identifier environ 1.000 loci. Contrairement à CTCFL, la liaison de CTCF dépend de l'état de méthylation de l'ADN ; cette modification épigénétique constitue donc au moins un des mécanismes de régulation expliquant une localisation de CTCF et CTCFL à des sites distincts du génome. La co- localisation de CTCF avec la cohésine étant établie, l'analyse de la superposition des séquences de CTCFL avec la sous-unité SMC3 identifie environ 3.300 sites de liaison parmi lesquels deux mêmes motifs consensus distincts par leur séquence sont mis en évidence. La presque quasi-totalité des données sur la cohésine ayant été établie à partir de cellules en prolifération mitotique, il est probable que la similitude au sein des séquences consensus soit encore plus grande dans le cas des cellules en méiose. La sous-unité Rec8 de la cohésine propre à l'état de méiose est spécifiquement exprimée dans les spermatocytes. Or 6 des 12 sites de liaison identifiés sont également utilisés par CTCFL. Pour conclure, ce travail constitue la première cartographie à l'échelle du génome des sites de liaison de CTCFL dans les spermatocytes, seul type cellulaire où CTCFL n'est pas exprimé. CTCFL possède un répertoire unique de sites de fixation à l'ADN distinct de CTCF, se lie à des séquences méthylées et présente un nombre important de sites de liaison communs avec la cohésine. Les perspectives futures sont d'élucider le rôle de CTCFL dans le remodelage de la structure de la chromatine et de définir sa fonction dans le processus de méiose.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

RÉSUMÉ Cette thèse porte sur le développement de méthodes algorithmiques pour découvrir automatiquement la structure morphologique des mots d'un corpus. On considère en particulier le cas des langues s'approchant du type introflexionnel, comme l'arabe ou l'hébreu. La tradition linguistique décrit la morphologie de ces langues en termes d'unités discontinues : les racines consonantiques et les schèmes vocaliques. Ce genre de structure constitue un défi pour les systèmes actuels d'apprentissage automatique, qui opèrent généralement avec des unités continues. La stratégie adoptée ici consiste à traiter le problème comme une séquence de deux sous-problèmes. Le premier est d'ordre phonologique : il s'agit de diviser les symboles (phonèmes, lettres) du corpus en deux groupes correspondant autant que possible aux consonnes et voyelles phonétiques. Le second est de nature morphologique et repose sur les résultats du premier : il s'agit d'établir l'inventaire des racines et schèmes du corpus et de déterminer leurs règles de combinaison. On examine la portée et les limites d'une approche basée sur deux hypothèses : (i) la distinction entre consonnes et voyelles peut être inférée sur la base de leur tendance à alterner dans la chaîne parlée; (ii) les racines et les schèmes peuvent être identifiés respectivement aux séquences de consonnes et voyelles découvertes précédemment. L'algorithme proposé utilise une méthode purement distributionnelle pour partitionner les symboles du corpus. Puis il applique des principes analogiques pour identifier un ensemble de candidats sérieux au titre de racine ou de schème, et pour élargir progressivement cet ensemble. Cette extension est soumise à une procédure d'évaluation basée sur le principe de la longueur de description minimale, dans- l'esprit de LINGUISTICA (Goldsmith, 2001). L'algorithme est implémenté sous la forme d'un programme informatique nommé ARABICA, et évalué sur un corpus de noms arabes, du point de vue de sa capacité à décrire le système du pluriel. Cette étude montre que des structures linguistiques complexes peuvent être découvertes en ne faisant qu'un minimum d'hypothèses a priori sur les phénomènes considérés. Elle illustre la synergie possible entre des mécanismes d'apprentissage portant sur des niveaux de description linguistique distincts, et cherche à déterminer quand et pourquoi cette coopération échoue. Elle conclut que la tension entre l'universalité de la distinction consonnes-voyelles et la spécificité de la structuration racine-schème est cruciale pour expliquer les forces et les faiblesses d'une telle approche. ABSTRACT This dissertation is concerned with the development of algorithmic methods for the unsupervised learning of natural language morphology, using a symbolically transcribed wordlist. It focuses on the case of languages approaching the introflectional type, such as Arabic or Hebrew. The morphology of such languages is traditionally described in terms of discontinuous units: consonantal roots and vocalic patterns. Inferring this kind of structure is a challenging task for current unsupervised learning systems, which generally operate with continuous units. In this study, the problem of learning root-and-pattern morphology is divided into a phonological and a morphological subproblem. The phonological component of the analysis seeks to partition the symbols of a corpus (phonemes, letters) into two subsets that correspond well with the phonetic definition of consonants and vowels; building around this result, the morphological component attempts to establish the list of roots and patterns in the corpus, and to infer the rules that govern their combinations. We assess the extent to which this can be done on the basis of two hypotheses: (i) the distinction between consonants and vowels can be learned by observing their tendency to alternate in speech; (ii) roots and patterns can be identified as sequences of the previously discovered consonants and vowels respectively. The proposed algorithm uses a purely distributional method for partitioning symbols. Then it applies analogical principles to identify a preliminary set of reliable roots and patterns, and gradually enlarge it. This extension process is guided by an evaluation procedure based on the minimum description length principle, in line with the approach to morphological learning embodied in LINGUISTICA (Goldsmith, 2001). The algorithm is implemented as a computer program named ARABICA; it is evaluated with regard to its ability to account for the system of plural formation in a corpus of Arabic nouns. This thesis shows that complex linguistic structures can be discovered without recourse to a rich set of a priori hypotheses about the phenomena under consideration. It illustrates the possible synergy between learning mechanisms operating at distinct levels of linguistic description, and attempts to determine where and why such a cooperation fails. It concludes that the tension between the universality of the consonant-vowel distinction and the specificity of root-and-pattern structure is crucial for understanding the advantages and weaknesses of this approach.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The Gronnedal-Ika complex is dominated by layered nepheline syenites which were intruded by a xenolithic syenite and a central plug of calcite to calcite-siderite carbonatite. Aegirine-augite, alkali feldspar and nepheline are the major mineral phases in the syenites, along with rare calcite. Temperatures of 680-910degreesC and silica activities of 0.28-0.43 were determined for the crystallization of the syenites on the basis of mineral equilibria. Oxygen fugacities, estimated using titanomagnetite compositions, were between 2 and 5 log units above the fayalite-magnetite-quartz buffer during the magmatic stage. Chondrite-normalized REE patterns of magmatic calcite in both carbonatites and syenites are characterized by REE enrichment (La-CN-Yb-CN = 10-70). Calcite from the carbonatites has higher Ba (similar to5490 ppm) and lower HREE concentrations than calcite from the syenites (54-106 ppm Ba). This is consistent with the behavior of these elements during separation of immiscible silicate-carbonate liquid pairs. epsilon(Nd)(T = 1.30 Ga) values of clinopyroxenes from the syenites vary between +1.8 and +2.8, and epsilon(Nd)(T) values of whole-rock carbonatites range from +2.4 to +2.8. Calcite from the carbonatites has delta(18)O values of 7.8 to 8.6parts per thousand and delta(13)C values of -3.9 to -4.6parts per thousand. delta(18)O values of clinopyroxene separates from the nepheline syenites range between 4.2 and 4.9parts per thousand. The average oxygen isotopic composition of the nepheline syenitic melt was calculated based on known rock-water and mineral-water isotope fractionation to be 5.7 +/- 0.4parts per thousand. Nd and C-O isotope compositions are typical for mantle-derived rocks and do not indicate significant crustal assimilation for either syenite or carbonatite magmas. The difference in delta(18)O between calculated syenitic melts and carbonatites, and the overlap in epsilon(Nd) values between carbonatites and syenites, are consistent with derivation of the carbonatites from the syenites via liquid immiscibility.