7 resultados para Encyclopedias and dictionaries, Italian.
em Helda - Digital Repository of University of Helsinki
Resumo:
The present research is an investigation into the corpus of personal names and titles that are found in sources from the Middle Mongolian period, that is the time from the 13th to the beginning of the 15th century. The entry for every name or title has been divided into three parts: occurence(s) of a given name in Middle Mongolian sources (primary sources), etymology, and occurence(s) in sources other than Middle Mongolian (secondary sources). Culturally and lingistically the corpus can be divided into six sub-groups: Mongolian, Turkic (Old, Middle and Modern), Arabo-Persian (Islamic), Indo-Iranian and Tibetan (Buddhist), as well as Chinese. Among these, the largest group is formed by Mongolian and Turkic, followed by Chinese (mostly titles), Indo-Iranian, Arabo-Persian and Tibetan. With regard to the primary and secondary occurences the research is based mainly on primary sources including text-publications and dictionaries. Every name or title is documented as completely as possible within a Central Asian framework. However, due to the divergency of the sources available as well as diachronical importance, each sub-group has been dealt with slightly differently, but consistently. The corpus of investigated names and titles gives a fairly correct picture of the multi-ethnical composition of the Mongolian world-empire. It also shows the foreign influences on Mongolian names and titles, being in this respect a mirror of the influences that are visible in other parts of the Middle Mongolian culture too. Furthermore, the investigated corpus reflects the transitory stage of the 13th to 15th century in Central Asian history, and includes thus material from the past (Indo-Iranian, Old and Middle Turkic), and material that points to the future (Arabo-Persian, Tibetan, Modern Turkic).
Resumo:
There are numerous formats for writing spellcheckers for open-source systems and there are many descriptions for languages written in these formats. Similarly, for word hyphenation by computer there are TEX rules for many languages. In this paper we demonstrate a method for converting these spell-checking lexicons and hyphenation rule sets into finite-state automata, and present a new finite-state based system for writer’s tools used in current open-source software such as Firefox, OpenOffice.org and enchant via the spell-checking library voikko.
Resumo:
The work is based on the assumption that words with similar syntactic usage have similar meaning, which was proposed by Zellig S. Harris (1954,1968). We study his assumption from two aspects: Firstly, different meanings (word senses) of a word should manifest themselves in different usages (contexts), and secondly, similar usages (contexts) should lead to similar meanings (word senses). If we start with the different meanings of a word, we should be able to find distinct contexts for the meanings in text corpora. We separate the meanings by grouping and labeling contexts in an unsupervised or weakly supervised manner (Publication 1, 2 and 3). We are confronted with the question of how best to represent contexts in order to induce effective classifiers of contexts, because differences in context are the only means we have to separate word senses. If we start with words in similar contexts, we should be able to discover similarities in meaning. We can do this monolingually or multilingually. In the monolingual material, we find synonyms and other related words in an unsupervised way (Publication 4). In the multilingual material, we ?nd translations by supervised learning of transliterations (Publication 5). In both the monolingual and multilingual case, we first discover words with similar contexts, i.e., synonym or translation lists. In the monolingual case we also aim at finding structure in the lists by discovering groups of similar words, e.g., synonym sets. In this introduction to the publications of the thesis, we consider the larger background issues of how meaning arises, how it is quantized into word senses, and how it is modeled. We also consider how to define, collect and represent contexts. We discuss how to evaluate the trained context classi?ers and discovered word sense classifications, and ?nally we present the word sense discovery and disambiguation methods of the publications. This work supports Harris' hypothesis by implementing three new methods modeled on his hypothesis. The methods have practical consequences for creating thesauruses and translation dictionaries, e.g., for information retrieval and machine translation purposes. Keywords: Word senses, Context, Evaluation, Word sense disambiguation, Word sense discovery.
Resumo:
In this study I look at what people want to express when they talk about time in Russian and Finnish, and why they use the means they use. The material consists of expressions of time: 1087 from Russian and 1141 from Finnish. They have been collected from dictionaries, usage guides, corpora, and the Internet. An expression means here an idiomatic set of words in a preset form, a collocation or construction. They are studied as lexical entities, without a context, and analysed and categorized according to various features. The theoretical background for the study includes two completely different approaches. Functional Syntax is used in order to find out what general meanings the speaker wishes to convey when talking about time and how these meanings are expressed in specific languages. Conceptual metaphor theory is used for explaining why the expressions are as they are, i.e. what kind of conceptual metaphors (transfers from one conceptual domain to another) they include. The study has resulted in a grammatically glossed list of time expressions in Russian and Finnish, a list of 56 general meanings involved in these time expressions and an account of the means (constructions) that these languages have for expressing the general meanings defined. It also includes an analysis of conceptual metaphors behind the expressions. The general meanings involved turned out to revolve around expressing duration, point in time, period of time, frequency, sequence, passing of time, suitable time and the right time, life as time, limitedness of time, and some other notions having less obvious semantic relations to the others. Conceptual metaphor analysis of the material has shown that time is conceptualized in Russian and Finnish according to the metaphors Time Is Space (Time Is Container, Time Has Direction, Time Is Cycle, and the Time Line Metaphor), Time Is Resource (and its submapping Time Is Substance), Time Is Actor; and some characteristics are added to these conceptualizations with the help of the secondary metaphors Time Is Nature and Time Is Life. The limits between different conceptual metaphors and the connections these metaphors have with one another are looked at with the help of the theory of conceptual integration (the blending theory) and its schemas. The results of the study show that although Russian and Finnish are typologically different, they are very similar both in the needs of expression their speakers have concerning time, and in the conceptualizations behind expressing time. This study introduces both theoretical and methodological novelties in the nature of material used, in developing empirical methodology for conceptual metaphor studies, in the exactness of defining the limits of different conceptual metaphors, and in seeking unity among the different facets of time. Keywords: time, metaphor, time expression, idiom, conceptual metaphor theory, functional syntax, blending theory
Resumo:
Celiac disease, or gluten intolerance, is triggered by dietary glutens in genetically susceptible individuals and it affects approximately 1% of the Caucasian population. The best known genetic risk factors for celiac disease are HLA DQ2 and DQ8 heterodimers, which are necessary for the development of the disease. However, they alone are not sufficient for disease induction, other risk factors are required. This thesis investigated genetic factors for celiac disease, concentrating on susceptibility loci on chromosomes 5q31-q33, 19p13 and 2q12 previously reported in genome-wide linkage and association studies. In addition, a novel genotyping method for the detection of HLA DQ2 and DQ8 coding haplotypes was validated. This study was conducted using Finnish and Hungarian family materials, and Finnish, Hungarian and Italian case-control materials. Genetic linkage and association were analysed in these materials using candidate gene and fine-mapping approaches. The results confirmed linkage to celiac disease on the chromosomal regions 5q31-q33 and 19p13. Fine-mapping on chromosome 5q31-q33 revealed several modest associations in the region, and highlighted the need for further investigations to locate the causal risk variants. The MYO9B gene on chromosome 19p13 showed evidence for linkage and association particularly with dermatitis herpetiformis, the skin manifestation of celiac disease. This implies a potential difference in the genetic background of the intestinal and skin forms of the disease, although studies on larger samplesets are required. The IL18RAP locus on chromosome 2q12, shown to be associated with celiac disease in a previous genome-wide association study and a subsequent follow-up, showed association in the Hungarian population in this study. The expression of IL18RAP was further investigated in small intestinal tissue and in peripheral blood mononuclear cells. The results showed that IL18RAP is expressed in the relevant tissues. Two putative isoforms of IL18RAP were detected by Western blot analysis, and the results suggested that the ratios and total levels of these isoforms may contribute to the aetiology of celiac disease. A novel genotyping method for celiac disease-associated HLA haplotypes was also validated in this thesis. The method utilises single-nucleotide polymorphisms tagging these HLA haplotypes with high sensitivity and specificity. Our results suggest that this method is transferable between populations, and it is suitable for large-scale analysis. In conclusion, this doctorate study provides an insight into the roles of the 5q31-q33, MYO9B, IL18RAP and HLA loci in the susceptibility to celiac disease in the Finnish, Hungarian and Italian populations, highlighting the need for further studies at these genetic loci and examination of the function of the candidate genes.
Resumo:
Disadvantages of invariable cereal cropping, concern of nutrient leaching and prices of nitrogen (N) fertilizer have all increased during last decades. An undersown crop, which grows together with a main crop and after harvest, could mitigate all those questions. The aim of this study was to develop undersowing in Finnish conditions, so that it suits for spring cereal farming as well as possible and enhances taking care of soil and environment, especially when control of N is concerned. In total, 17 plant species were undersown in spring cereals during the field experiments between 1991-1999 at four sites in South and Central Finland, but after selection, eight of them were studied more thoroughly. Two legumes, one grass species and one mixture of them were included in long-term trials in order to study annually repeated undersowing. Further, simultaneous broadcasting of seeds instead of separate undersowing was studied. Grain yield response and the capacity of the undersown crop to absorb soil N or fix N from atmosphere, and the release of N were of greatest interest. Seeding rates of undersown crops and N fertilization rates during annually repeated undersowing were also studied. Italian ryegrass (Lolium multiflorum Lam., IR) absorbed soil nitrate N (NO3-N) most efficiently in autumn and timothy (Phleum pratense L.) in spring. The capacity of other grass species to absorb N was low, or it was insufficient considering the negative effect on grain yield. Red clover (Trifolium pratense L.) and white clover (Trifolium repens L.) suited well in annually repeated undersowing, supplying fixed N for cereals without markedly increased risk of N leaching. Autumn oriented growth rhythm of the studied legumes was optimal for undersowing, whereas the growth rhythm of grasses was less suited but varied between species. A model of adaptive undersowing system was outlined in order to emphasize allocation of measures according needs. After defining the goal of undersowing, many decisions are to be done. When diminishing N leaching is primarily sought, a mixture of IR and timothy is advantageous. Clovers suit for replacing N fertilization, as the positive residual effect is greater than the negative effect caused by competition. A mixture of legume and non legume is a good choice when increased diversity is the main target. Seeding rate is an efficient means for adjusting competition and N effects. Broadcasting with soil covering equipment can be used to establish an undersown crop. In addition, timing and method of cover crop termination have an important role in the outcome. Continuous observing of the system is needed as for instance conditions significantly affect growth of undersown crop and on the other hand N release from crop residues may increase in long run.