28 resultados para linguistic corpora


Relevância:

20.00% 20.00%

Publicador:

Resumo:

Linguistic modelling is a rather new branch of mathematics that is still undergoing rapid development. It is closely related to fuzzy set theory and fuzzy logic, but knowledge and experience from other fields of mathematics, as well as other fields of science including linguistics and behavioral sciences, is also necessary to build appropriate mathematical models. This topic has received considerable attention as it provides tools for mathematical representation of the most common means of human communication - natural language. Adding a natural language level to mathematical models can provide an interface between the mathematical representation of the modelled system and the user of the model - one that is sufficiently easy to use and understand, but yet conveys all the information necessary to avoid misinterpretations. It is, however, not a trivial task and the link between the linguistic and computational level of such models has to be established and maintained properly during the whole modelling process. In this thesis, we focus on the relationship between the linguistic and the mathematical level of decision support models. We discuss several important issues concerning the mathematical representation of meaning of linguistic expressions, their transformation into the language of mathematics and the retranslation of mathematical outputs back into natural language. In the first part of the thesis, our view of the linguistic modelling for decision support is presented and the main guidelines for building linguistic models for real-life decision support that are the basis of our modeling methodology are outlined. From the theoretical point of view, the issues of representation of meaning of linguistic terms, computations with these representations and the retranslation process back into the linguistic level (linguistic approximation) are studied in this part of the thesis. We focus on the reasonability of operations with the meanings of linguistic terms, the correspondence of the linguistic and mathematical level of the models and on proper presentation of appropriate outputs. We also discuss several issues concerning the ethical aspects of decision support - particularly the loss of meaning due to the transformation of mathematical outputs into natural language and the issue or responsibility for the final decisions. In the second part several case studies of real-life problems are presented. These provide background and necessary context and motivation for the mathematical results and models presented in this part. A linguistic decision support model for disaster management is presented here – formulated as a fuzzy linear programming problem and a heuristic solution to it is proposed. Uncertainty of outputs, expert knowledge concerning disaster response practice and the necessity of obtaining outputs that are easy to interpret (and available in very short time) are reflected in the design of the model. Saaty’s analytic hierarchy process (AHP) is considered in two case studies - first in the context of the evaluation of works of art, where a weak consistency condition is introduced and an adaptation of AHP for large matrices of preference intensities is presented. The second AHP case-study deals with the fuzzified version of AHP and its use for evaluation purposes – particularly the integration of peer-review into the evaluation of R&D outputs is considered. In the context of HR management, we present a fuzzy rule based evaluation model (academic faculty evaluation is considered) constructed to provide outputs that do not require linguistic approximation and are easily transformed into graphical information. This is achieved by designing a specific form of fuzzy inference. Finally the last case study is from the area of humanities - psychological diagnostics is considered and a linguistic fuzzy model for the interpretation of outputs of multidimensional questionnaires is suggested. The issue of the quality of data in mathematical classification models is also studied here. A modification of the receiver operating characteristics (ROC) method is presented to reflect variable quality of data instances in the validation set during classifier performance assessment. Twelve publications on which the author participated are appended as a third part of this thesis. These summarize the mathematical results and provide a closer insight into the issues of the practicalapplications that are considered in the second part of the thesis.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The National Library of Finland realizes the Digitization Project of Kindred Languages in 2012–15. The project is financially supported by the Kone Foundation. During this project the National Library of Finland has digitized and made available approximately 1200 monograph and more than 100 newspaper titles in several Uralic languages. The materials are available to both researchers and citizens in the National Library’s Fenno-Ugrica collection. The project will produce digitized materials in the Uralic languages as well as their development tools to support linguistic research and citizen science. The resulting materials will constitute the largest resource for the Uralic languages in the world. Through this project, researchers will gain access to corpora which they have not been able to study before and to which all users will have open access regardless of their place of residence. In my presentation, I will discuss 1) how we utilized the social media (Facebook, Twitter, VKontakte etc) to gain audience for our collection and 2) how the needs of researchers and laymen were met in crowdsourcing.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Ensikielen jälkeen opittavan kielen tutkimusta ja suomi toisena kielenä alaa sen osana ovat koko niiden olemassaolon ajan hallinneet samat peruskysymykset: millaista oppiminen on eri vaiheissa ja eri ympäristöissä, sekä mikä oppimisessa on yleistä ja toisaalta mikä riippuu opittavasta kielestä ja oppijoiden kielitaustasta. Sähköisten oppijankielen tutkimusaineistojen eli korpusten lisääntymisen myötä tutkijat voivat aiempaa helpommin tutkia näitä ilmiöitä määrällisesti ja tarkastella oppijankielen sisäistä vaihtelua ja sen suhdetta tyypilliseen ensikieliseen kielenkäyttöön kielen eri osa-alueilla käyttöpohjaisesti eli todelliseen kielenkäyttöön pohjautuen. Tekninen kehitys on tuonut mukanaan aineisto- eli korpusvetoisuuden kaltaisia uusia tapoja lähestyä tutkimusaineistoa, jolloin tyypillisiä tutkimuskysymyksiä ”Miksi?” ja ”Miten?” edeltää kysymys: ”Mikä?”. Tässä väitöskirjassa tarkastellaan edistyneiden suomenoppijoiden kirjoitettua akateemista kieltä ja suhteutetaan suomen oppimiselle ominaisia seikkoja käyttöpohjaisen mallin perusolettamuksiin. Aineisto on suomea toisena kielenä käyttävien opiskelijoiden tenttivastauksia, ja se on osa Edistyneiden suomenoppijoiden korpusta. Tutkimus on osin metodologinen, sillä väitöskirjassa esitellään ja siinä sovelletaan uutta korpusvetoista avainrakenneanalyysi-menetelmää, jonka avulla aineistoa lähestytään ilman hypoteeseja siitä, mitkä kielen ilmiöt ovat ominaisia edistyneelle oppijansuomelle. Tutkimus kuuluu kieliopin tutkimuksen piiriin, ja se nojaa kognitiivisen konstruktiokieliopin ajatukseen abstraktiudeltaan vaihtelevista konstruktioista kielijärjestelmän perusyksiköinä. Tulokset puoltavat menetelmän sovellettavuutta kielen oppimisen tutkimukseen, sillä sen avulla kyettiin tunnistamaan konstruktioita, jotka erottavat edistyneitä oppijoita ensikielisistä kirjoittajista (esim. modaaliset verbiketjut), eri ensikieliä puhuvia suomenoppijoita (esim. konjunktiot) sekä konstruktioita, joiden käyttö muuttuu ajan kuluessa (esim. preteriti ja preesens). Monet havaitut erot ovat akateemisen kirjoittamisen erityispiirteitä, mikä tukee ajatusta kielen käyttö- ja kontekstikohtaisesta oppimisesta. Tuloksia voidaan yhtäältä soveltaa akateemisen kielitaidon opetuksessa. Toisaalta menetelmää voidaan käyttää kielenoppimisen tutkimuksen ohella uusien näkökulmien kartoittamiseksi erilaisten tai eri-ikäisten tekstien tyypillisten ominaisuuksien ja erojen tutkimuksessa.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Principen om nationalismen där det politiska och det nationella är samspelt kan vara av markant betydelse för uppbyggande av autonomiska regimer. Likaså tillåter decentralicering och delegering av befogenheter för språk och utbildning (officiellt erkännande av språk, standardisering av språk, undervisningsspråk och relaterade läroplaner) formning av identiteter inom dessa autonomiska regimer. Resultatet är en ofullkomlig cirkulär relation där språk, samfund och politiska institutioner ömsesidigt och kontinuerligt formar varandra: lingvistiskt mångfald prägar och formger autonomiska ordningar och vice-versa. De juridiska implikationerna av territoriella och icke-territoriella former av autonomi är dock av en annan art. Emedan territoriell autonomi bygger på idéen om ett eventuellt inkluderande hemland för lingvistiska grupper, vars vistelseort är avgörande, förstärker den icke-territoriella autonomin idéen om ett exclusivt samfund bestående av själv-identifierade medlemmar som är kapabla till självstyre oavsett territoriella gränser. Denna avhandling utgör an analys av sådana juridiska implikationer genom komparativa och institutionella analyser. Avhandlingen föreslår som resultat en serie av normativa och pragmatiska rekommendationer inriktade på att främja demokratiseringsprocesser i linje med principer om multikulturalism.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

There are more than 7000 languages in the world, and many of these have emerged through linguistic divergence. While questions related to the drivers of linguistic diversity have been studied before, including studies with quantitative methods, there is no consensus as to which factors drive linguistic divergence, and how. In the thesis, I have studied linguistic divergence with a multidisciplinary approach, applying the framework and quantitative methods of evolutionary biology to language data. With quantitative methods, large datasets may be analyzed objectively, while approaches from evolutionary biology make it possible to revisit old questions (related to, for example, the shape of the phylogeny) with new methods, and adopt novel perspectives to pose novel questions. My chief focus was on the effects exerted on the speakers of a language by environmental and cultural factors. My approach was thus an ecological one, in the sense that I was interested in how the local environment affects humans and whether this human-environment connection plays a possible role in the divergence process. I studied this question in relation to the Uralic language family and to the dialects of Finnish, thus covering two different levels of divergence. However, as the Uralic languages have not previously been studied using quantitative phylogenetic methods, nor have population genetic methods been previously applied to any dialect data, I first evaluated the applicability of these biological methods to language data. I found the biological methodology to be applicable to language data, as my results were rather similar to traditional views as to both the shape of the Uralic phylogeny and the division of Finnish dialects. I also found environmental conditions, or changes in them, to be plausible inducers of linguistic divergence: whether in the first steps in the divergence process, i.e. dialect divergence, or on a large scale with the entire language family. My findings concerning Finnish dialects led me to conclude that the functional connection between linguistic divergence and environmental conditions may arise through human cultural adaptation to varying environmental conditions. This is also one possible explanation on the scale of the Uralic language family as a whole. The results of the thesis bring insights on several different issues in both a local and a global context. First, they shed light on the emergence of the Finnish dialects. If the approach used in the thesis is applied to the dialects of other languages, broader generalizations may be drawn as to the inducers of linguistic divergence. This again brings us closer to understanding the global patterns of linguistic diversity. Secondly, the quantitative phylogeny of the Uralic languages, with estimated times of language divergences, yields another hypothesis as to the shape and age of the language family tree. In addition, the Uralic languages can now be added to the growing list of language families studied with quantitative methods. This will allow broader inferences as to global patterns of language evolution, and more language families can be included in constructing the tree of the world’s languages. Studying history through language, however, is only one way to illuminate the human past. Therefore, thirdly, the findings of the thesis, when combined with studies of other language families, and those for example in genetics and archaeology, bring us again closer to an understanding of human history.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Today’s international business in highly related to crossing national, cultural and linguistic borders making communication and linguistic skills a vital part of the trade. The purpose of the study is to understand the role of linguistic skills in trust creation in international business relationships. Subobjectives are to discuss the importance of linguistic skills in international business context, to evaluate the strategic value of trust in business relationships and to analyze the extent to which linguistic skills affect trust formation. The scope is restricted to business-to-business markets. The theoretical background consists of different theories and previous studies related to trust and linguistic skills. Based on the theory a new LTS-framework is created to demonstrate a process model of linguistic skills affecting trust creation in international B2B relationships. This study is qualitative using interviews as a data collection method. Altogether eleven interviews were conducted between October 2014 and February 2015. All of the interviewees worked for organizations operating in the field of international business in B2B markets, spoke multiple languages and had a lot of experience in sales and negotiations. This study confirms that linguistic skills are an important part of international business. In many organizations English is used as lingua franca. However, there are several benefits of speaking the mother tongue of the customer. It makes people feel more relaxed and it makes the relationship more intimate and allows to continue developing it at a more personal level. From the strategic point of view trust creates competitive advantage to a company adding strategic value to the business. The data also supported the view that linguistic skills definitely impact the trust formation process. Quickness and easiness could be stated as the main benefits. It was seen that trust forms faster because both parties understand each other better and they become more open about information sharing within a shorter period of time. These findings and the importance of linguistic skills in trust creation should be acknowledged by organizations, especially regarding the human resource management. Boundary spanners are in key positions so special attention should be put into hiring and educating employees which then take care of company’s relationships. Eventually, these benefits are economical and affect to the profitability of the organization

Relevância:

20.00% 20.00%

Publicador:

Resumo:

In an increasingly multilingual world, English language has kept a marked predominance as a global language. In many countries, English is the primary choice for foreign language learning. There is a long history of research in English language learning. The same applies for research in reading. A main interest since the 1970s has been the reading strategy defined as inferencing or guessing the meaning of unknown words from context. Inferencing has ben widely researched, however, the results and conclusions seem to be mixed. While some agree that inferencing is a useful strategy, others doubt its usefulness. Nevertheless, most of the research seem to agree that the cultural background affects comprehension and inferencing. While most of these studies have been done with texts and contexts created by the researches, little has been done using natural prose. The present study will attempt to further clarify the process of inferencing and the effects of the text’s cultural context and the linguistic background of the reader using a text that has not been created by the researcher. The participants of the study are 40 international students from Turku, Finland. Their linguistic background was obtained through a questionnaire and proved to be diverse. Think aloud protocols were performed to investigate their inferencing process and find connections between their inferences, comments, the text, and their linguistic background. The results show that: some inferences were made based on the participants’ world knowledge, experience, other languages, and English language knowledge; other inferences and comments were made based on the text, its use of language and vocabulary, and few cues provided by the author. The results from the present study and previous research seem to show that: 1) linguistic background is a source of information for inferencing but is not a major source; 2) the cultural context of the text affected the inferences made by the participants according to their closeness or distance from it.

Relevância:

10.00% 10.00%

Publicador: