83 resultados para linguistic variation


Relevância:

20.00% 20.00%

Publicador:

Resumo:

The emerging technologies have recently challenged the libraries to reconsider their role as a mere mediator between the collections, researchers, and wider audiences (Sula, 2013), and libraries, especially the nationwide institutions like national libraries, haven’t always managed to face the challenge (Nygren et al., 2014). In the Digitization Project of Kindred Languages, the National Library of Finland has become a node that connects the partners to interplay and work for shared goals and objectives. In this paper, I will be drawing a picture of the crowdsourcing methods that have been established during the project to support both linguistic research and lingual diversity. The National Library of Finland has been executing the Digitization Project of Kindred Languages since 2012. The project seeks to digitize and publish approximately 1,200 monograph titles and more than 100 newspapers titles in various, and in some cases endangered Uralic languages. Once the digitization has been completed in 2015, the Fenno-Ugrica online collection will consist of 110,000 monograph pages and around 90,000 newspaper pages to which all users will have open access regardless of their place of residence. The majority of the digitized literature was originally published in the 1920s and 1930s in the Soviet Union, and it was the genesis and consolidation period of literary languages. This was the era when many Uralic languages were converted into media of popular education, enlightenment, and dissemination of information pertinent to the developing political agenda of the Soviet state. The ‘deluge’ of popular literature in the 1920s to 1930s suddenly challenged the lexical orthographic norms of the limited ecclesiastical publications from the 1880s onward. Newspapers were now written in orthographies and in word forms that the locals would understand. Textbooks were written to address the separate needs of both adults and children. New concepts were introduced in the language. This was the beginning of a renaissance and period of enlightenment (Rueter, 2013). The linguistically oriented population can also find writings to their delight, especially lexical items specific to a given publication, and orthographically documented specifics of phonetics. The project is financially supported by the Kone Foundation in Helsinki and is part of the Foundation’s Language Programme. One of the key objectives of the Kone Foundation Language Programme is to support a culture of openness and interaction in linguistic research, but also to promote citizen science as a tool for the participation of the language community in research. In addition to sharing this aspiration, our objective within the Language Programme is to make sure that old and new corpora in Uralic languages are made available for the open and interactive use of the academic community as well as the language societies. Wordlists are available in 17 languages, but without tokenization, lemmatization, and so on. This approach was verified with the scholars, and we consider the wordlists as raw data for linguists. Our data is used for creating the morphological analyzers and online dictionaries at the Helsinki and Tromsø Universities, for instance. In order to reach the targets, we will produce not only the digitized materials but also their development tools for supporting linguistic research and citizen science. The Digitization Project of Kindred Languages is thus linked with the research of language technology. The mission is to improve the usage and usability of digitized content. During the project, we have advanced methods that will refine the raw data for further use, especially in the linguistic research. How does the library meet the objectives, which appears to be beyond its traditional playground? The written materials from this period are a gold mine, so how could we retrieve these hidden treasures of languages out of the stack that contains more than 200,000 pages of literature in various Uralic languages? The problem is that the machined-encoded text (OCR) contains often too many mistakes to be used as such in research. The mistakes in OCRed texts must be corrected. For enhancing the OCRed texts, the National Library of Finland developed an open-source code OCR editor that enabled the editing of machine-encoded text for the benefit of linguistic research. This tool was necessary to implement, since these rare and peripheral prints did often include already perished characters, which are sadly neglected by the modern OCR software developers, but belong to the historical context of kindred languages and thus are an essential part of the linguistic heritage (van Hemel, 2014). Our crowdsourcing tool application is essentially an editor of Alto XML format. It consists of a back-end for managing users, permissions, and files, communicating through a REST API with a front-end interface—that is, the actual editor for correcting the OCRed text. The enhanced XML files can be retrieved from the Fenno-Ugrica collection for further purposes. Could the crowd do this work to support the academic research? The challenge in crowdsourcing lies in its nature. The targets in the traditional crowdsourcing have often been split into several microtasks that do not require any special skills from the anonymous people, a faceless crowd. This way of crowdsourcing may produce quantitative results, but from the research’s point of view, there is a danger that the needs of linguists are not necessarily met. Also, the remarkable downside is the lack of shared goal or the social affinity. There is no reward in the traditional methods of crowdsourcing (de Boer et al., 2012). Also, there has been criticism that digital humanities makes the humanities too data-driven and oriented towards quantitative methods, losing the values of critical qualitative methods (Fish, 2012). And on top of that, the downsides of the traditional crowdsourcing become more imminent when you leave the Anglophone world. Our potential crowd is geographically scattered in Russia. This crowd is linguistically heterogeneous, speaking 17 different languages. In many cases languages are close to extinction or longing for language revitalization, and the native speakers do not always have Internet access, so an open call for crowdsourcing would not have produced appeasing results for linguists. Thus, one has to identify carefully the potential niches to complete the needed tasks. When using the help of a crowd in a project that is aiming to support both linguistic research and survival of endangered languages, the approach has to be a different one. In nichesourcing, the tasks are distributed amongst a small crowd of citizen scientists (communities). Although communities provide smaller pools to draw resources, their specific richness in skill is suited for complex tasks with high-quality product expectations found in nichesourcing. Communities have a purpose and identity, and their regular interaction engenders social trust and reputation. These communities can correspond to research more precisely (de Boer et al., 2012). Instead of repetitive and rather trivial tasks, we are trying to utilize the knowledge and skills of citizen scientists to provide qualitative results. In nichesourcing, we hand in such assignments that would precisely fill the gaps in linguistic research. A typical task would be editing and collecting the words in such fields of vocabularies where the researchers do require more information. For instance, there is lack of Hill Mari words and terminology in anatomy. We have digitized the books in medicine, and we could try to track the words related to human organs by assigning the citizen scientists to edit and collect words with the OCR editor. From the nichesourcing’s perspective, it is essential that altruism play a central role when the language communities are involved. In nichesourcing, our goal is to reach a certain level of interplay, where the language communities would benefit from the results. For instance, the corrected words in Ingrian will be added to an online dictionary, which is made freely available for the public, so the society can benefit, too. This objective of interplay can be understood as an aspiration to support the endangered languages and the maintenance of lingual diversity, but also as a servant of ‘two masters’: research and society.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The National Library of Finland is implementing the Digitization Project of Kindred Languages in 2012–16. Within the project we will digitize materials in the Uralic languages as well as develop tools to support linguistic research and citizen science. Through this project, researchers will gain access to new corpora 329 and to which all users will have open access regardless of their place of residence. Our objective is to make sure that the new corpora are made available for the open and interactive use of both the academic community and the language societies as a whole. The project seeks to digitize and publish approximately 1200 monograph titles and more than 100 newspapers titles in various Uralic languages. The digitization will be completed by the early of 2015, when the Fenno-Ugrica collection would contain around 200 000 pages of editable text. The researchers cannot spend so much time with the material that they could retrieve a satisfactory amount of edited words, so the participation of a crowd in editing work is needed. Often the targets in crowdsourcing have been split into several microtasks that do not require any special skills from the anonymous people, a faceless crowd. This way of crowdsourcing may produce quantitative results, but from the research’s point of view, there is a danger that the needs of linguistic research are not necessarily met. Also, the number of pages is too high to deal with. The remarkable downside is the lack of shared goal or social affinity. There is no reward in traditional methods of crowdsourcing. Nichesourcing is a specific type of crowdsourcing where tasks are distributed amongst a small crowd of citizen scientists (communities). Although communities provide smaller pools to draw resources, their specific richness in skill is suited for the complex tasks with high-quality product expectations found in nichesourcing. Communities have purpose, identity and their regular interactions engenders social trust and reputation. These communities can correspond to research more precisely. Instead of repetitive and rather trivial tasks, we are trying to utilize the knowledge and skills of citizen scientists to provide qualitative results. Some selection must be made, since we are not aiming to correct all 200,000 pages which we have digitized, but give such assignments to citizen scientists that would precisely fill the gaps in linguistic research. A typical task would editing and collecting the words in such fields of vocabularies, where the researchers do require more information. For instance, there’s a lack of Hill Mari words in anatomy. We have digitized the books in medicine and we could try to track the words related to human organs by assigning the citizen scientists to edit and collect words with OCR editor. From the nichesourcing’s perspective, it is essential that the altruism plays a central role, when the language communities involve. Upon the nichesourcing, our goal is to reach a certain level of interplay, where the language communities would benefit on the results. For instance, the corrected words in Ingrian will be added onto the online dictionary, which is made freely available for the public and the society can benefit too. This objective of interplay can be understood as an aspiration to support the endangered languages and the maintenance of lingual diversity, but also as a servant of “two masters”, the research and the society.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Principen om nationalismen där det politiska och det nationella är samspelt kan vara av markant betydelse för uppbyggande av autonomiska regimer. Likaså tillåter decentralicering och delegering av befogenheter för språk och utbildning (officiellt erkännande av språk, standardisering av språk, undervisningsspråk och relaterade läroplaner) formning av identiteter inom dessa autonomiska regimer. Resultatet är en ofullkomlig cirkulär relation där språk, samfund och politiska institutioner ömsesidigt och kontinuerligt formar varandra: lingvistiskt mångfald prägar och formger autonomiska ordningar och vice-versa. De juridiska implikationerna av territoriella och icke-territoriella former av autonomi är dock av en annan art. Emedan territoriell autonomi bygger på idéen om ett eventuellt inkluderande hemland för lingvistiska grupper, vars vistelseort är avgörande, förstärker den icke-territoriella autonomin idéen om ett exclusivt samfund bestående av själv-identifierade medlemmar som är kapabla till självstyre oavsett territoriella gränser. Denna avhandling utgör an analys av sådana juridiska implikationer genom komparativa och institutionella analyser. Avhandlingen föreslår som resultat en serie av normativa och pragmatiska rekommendationer inriktade på att främja demokratiseringsprocesser i linje med principer om multikulturalism.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Picornaviruses are the most common human viruses and the identification of the picornaviruses is nowadays based on molecular techniques, for example, reverse transcriptase polymerase chain reaction (RT-PCR). One aim of this thesis was to improve the identification of picornaviruses, especially rhino- and enteroviruses, with a real-time assay format and, also, to improve the differentiation of the viruses with genus-specific locked nucleic acid (LNA) probes. Another aim was to identify and study the causative agent of the enterovirus epidemics that appeared in Finland during seasons 2008-2010. In this thesis, the first version of picornavirus qRT-PCR with a melting curve analysis was used in a study of rhinovirus transmission within families with a rhinovirus positive index child where rhinovirus infection was monitored in all family members. In conclusion, rhinoviruses spread effectively within families causing mostly symptomatic infections in children and asymptomatic infections in adults. To improve the differentiation between rhino- and enterovirus the picornavirus qRT-PCR was modified with LNA-incorporated probes. The LNA probes were validated with picornavirus prototypes and different clinical specimen types. The LNA probe-based picornavirus qRT-PCR was able to differentiate all rhino- and enteroviruses correctly, which makes it suitable for diagnostic use. Moreover, in this thesis enterovirus outbreaks were studied with a well-observed method to create a strain-specific qRT-PCR from the typing region VP1 protein. In a hand-foot-and-mouth-disease (HFMD) outbreak in 2008, the causative agent was identified as CV-A6 and when the molecular evolution of the new HFMD CV-A6 strain was studied it was found that CV-A6 was the emerging agent for HFMD and onychomadesis. Furthermore, unusual E-30 meningitis epidemics that apeared during seasons 2009 and 2010 were studied with strain-specific qRT-PCR. The E-30 affected mostly adolescents and was probably spread in sports teams.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

There are more than 7000 languages in the world, and many of these have emerged through linguistic divergence. While questions related to the drivers of linguistic diversity have been studied before, including studies with quantitative methods, there is no consensus as to which factors drive linguistic divergence, and how. In the thesis, I have studied linguistic divergence with a multidisciplinary approach, applying the framework and quantitative methods of evolutionary biology to language data. With quantitative methods, large datasets may be analyzed objectively, while approaches from evolutionary biology make it possible to revisit old questions (related to, for example, the shape of the phylogeny) with new methods, and adopt novel perspectives to pose novel questions. My chief focus was on the effects exerted on the speakers of a language by environmental and cultural factors. My approach was thus an ecological one, in the sense that I was interested in how the local environment affects humans and whether this human-environment connection plays a possible role in the divergence process. I studied this question in relation to the Uralic language family and to the dialects of Finnish, thus covering two different levels of divergence. However, as the Uralic languages have not previously been studied using quantitative phylogenetic methods, nor have population genetic methods been previously applied to any dialect data, I first evaluated the applicability of these biological methods to language data. I found the biological methodology to be applicable to language data, as my results were rather similar to traditional views as to both the shape of the Uralic phylogeny and the division of Finnish dialects. I also found environmental conditions, or changes in them, to be plausible inducers of linguistic divergence: whether in the first steps in the divergence process, i.e. dialect divergence, or on a large scale with the entire language family. My findings concerning Finnish dialects led me to conclude that the functional connection between linguistic divergence and environmental conditions may arise through human cultural adaptation to varying environmental conditions. This is also one possible explanation on the scale of the Uralic language family as a whole. The results of the thesis bring insights on several different issues in both a local and a global context. First, they shed light on the emergence of the Finnish dialects. If the approach used in the thesis is applied to the dialects of other languages, broader generalizations may be drawn as to the inducers of linguistic divergence. This again brings us closer to understanding the global patterns of linguistic diversity. Secondly, the quantitative phylogeny of the Uralic languages, with estimated times of language divergences, yields another hypothesis as to the shape and age of the language family tree. In addition, the Uralic languages can now be added to the growing list of language families studied with quantitative methods. This will allow broader inferences as to global patterns of language evolution, and more language families can be included in constructing the tree of the world’s languages. Studying history through language, however, is only one way to illuminate the human past. Therefore, thirdly, the findings of the thesis, when combined with studies of other language families, and those for example in genetics and archaeology, bring us again closer to an understanding of human history.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Today’s international business in highly related to crossing national, cultural and linguistic borders making communication and linguistic skills a vital part of the trade. The purpose of the study is to understand the role of linguistic skills in trust creation in international business relationships. Subobjectives are to discuss the importance of linguistic skills in international business context, to evaluate the strategic value of trust in business relationships and to analyze the extent to which linguistic skills affect trust formation. The scope is restricted to business-to-business markets. The theoretical background consists of different theories and previous studies related to trust and linguistic skills. Based on the theory a new LTS-framework is created to demonstrate a process model of linguistic skills affecting trust creation in international B2B relationships. This study is qualitative using interviews as a data collection method. Altogether eleven interviews were conducted between October 2014 and February 2015. All of the interviewees worked for organizations operating in the field of international business in B2B markets, spoke multiple languages and had a lot of experience in sales and negotiations. This study confirms that linguistic skills are an important part of international business. In many organizations English is used as lingua franca. However, there are several benefits of speaking the mother tongue of the customer. It makes people feel more relaxed and it makes the relationship more intimate and allows to continue developing it at a more personal level. From the strategic point of view trust creates competitive advantage to a company adding strategic value to the business. The data also supported the view that linguistic skills definitely impact the trust formation process. Quickness and easiness could be stated as the main benefits. It was seen that trust forms faster because both parties understand each other better and they become more open about information sharing within a shorter period of time. These findings and the importance of linguistic skills in trust creation should be acknowledged by organizations, especially regarding the human resource management. Boundary spanners are in key positions so special attention should be put into hiring and educating employees which then take care of company’s relationships. Eventually, these benefits are economical and affect to the profitability of the organization

Relevância:

20.00% 20.00%

Publicador:

Resumo:

In marine benthic communities, herbivores consume a considerable proportion of primary producer biomass and, thus, generate selection for the evolution of resistance traits. According to the theory of plant defenses, resistance traits are costly to produce and, consequently, inducible resistance traits are adaptive in conditions of variable herbivory, while in conditions of constant/strong herbivory constitutive resistance traits are selected for. The evolution of resistance plasticity may be constrained by the costs of resistance or lack of genetic variation in resistance. Furthermore, resource allocation to induced resistance may be affected by higher trophic levels preying on herbivores. I studied the resistance to herbivory of a foundation species, the brown alga Fucus vesiculosus. By using factorial field experiments, I explored the effects of herbivores and fish predators on growth and resistance of the alga in two seasons. I explored genetic variation in and allocation costs of resistance traits as well as their chemical basis and their effects on herbivore performance. Using a field experiment I tested if induced resistance spreads via water-borne cues from one individual to another in relevant ecological conditions. I found that in the northern Baltic Sea F. vesiculosus communities, strength of three trophic interactions strongly vary among seasons. The highly synchronized summer reproduction of herbivores promoted their escape from the top-down control of fish predators in autumn. This resulted into large grazing losses in algal stands. In spring, herbivore densities were low and regulated by fish, which, thus,enhanced algal growth. The resistance of algae to herbivory increased with an increase in constitutive phlorotannin content. Furthermore, individuals adopted induced resistance when grazed and when exposed to water-borne cues originating from grazing of conspecific algae both in the laboratory and in field conditions. Induced resistance was adopted to a lesser extent in the presence of fish predators. The results in this thesis indicate that inducible resistance in F. vesiculosus is an adaptation to varying herbivory in the northern Baltic Sea. The costs of resistance and strong seasonality of herbivory have likely contributed to the evolution of this defense strategy. My findings also show that fish predators have positive cascading effects on F. vesiculosus which arise via reduced herbivory but possibly also through reduced resource allocation to resistance. I further found evidence that the spread of resistance via water-borne cues also occurs in ecologically realistic conditions in natural marine sublittoral. Thus, water-borne induction may enable macroalgae to cope with the strong grazing pressure characteristic of marine benthic communities. The results presented here show that seasonality can have pronounced effects on the biotic interactions in marine benthic communities and thereafter influence the evolution of resistance traits in primary producers.