517 resultados para Multilingual


Relevância:

20.00% 20.00%

Publicador:

Resumo:

The identification of cognates between two distinct languages has recently start- ed to attract the attention of NLP re- search, but there has been little research into using semantic evidence to detect cognates. The approach presented in this paper aims to detect English-French cog- nates within monolingual texts (texts that are not accompanied by aligned translat- ed equivalents), by integrating word shape similarity approaches with word sense disambiguation techniques in order to account for context. Our implementa- tion is based on BabelNet, a semantic network that incorporates a multilingual encyclopedic dictionary. Our approach is evaluated on two manually annotated da- tasets. The first one shows that across different types of natural text, our method can identify the cognates with an overall accuracy of 80%. The second one, con- sisting of control sentences with semi- cognates acting as either true cognates or false friends, shows that our method can identify 80% of semi-cognates acting as cognates but also identifies 75% of the semi-cognates acting as false friends.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Community languages have a place in early years education for children in prior-to-school settings and students in formal schooling. Australian curricula treat the languages of the local community as both content and resource for learning. In this article we look at the potential of dual language and multiple language storybooks for teaching about linguistic diversity and developing literacy skills, highlighting the potential of paired and shared reading.

Relevância:

20.00% 20.00%

Publicador:

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This study addresses the issue of multilingualism in EU law. More specifically, it explores the implications of multilingualism for conceptualising legal certainty, a central principle of law both in domestic and EU legal systems. The main question addressed is how multilingualism and legal certainty may be reconciled in the EU legal system. The study begins with a discussion on the role of translation in drafting EU legislation and its implications for interpreting EU law at the European Court of Justice (ECJ). Uncertainty regarding the meaning of multilingual EU law and the interrelationship between multilingualism and ECJ methods of interpretation are explored. This analysis leads to questioning the importance of linguistic-semantic methods of interpretation, especially the role of comparing language versions for clarifying meaning and the ordinary meaning thesis, and to placing emphasis on other, especially the teleological, purpose-oriented method of interpretation. As regards the principle of legal certainty, the starting-point is a two-dimensional concept consisting of both formal and substantive elements; of predictability and acceptability. Formal legal certainty implies that laws and adjudication, in particular, must be predictable. Substantive legal certainty is related to rational acceptability of judicial decision-making placing emphasis on its acceptability to the legal community in question. Contrary to predictability that one might intuitively relate to linguistic-semantic methods of interpretation, the study suggests a new conception of legal certainty where purpose, telos, and other dynamic methods of interpretation are of particular significance for meaning construction in multilingual EU law. Accordingly, the importance of purposive, teleological interpretation as the standard doctrine of interpretation in a multilingual legal system is highlighted. The focus on rational, substantive acceptability results in emphasising discourse among legal actors among the EU legal community and stressing the need to give reasons in favour of proposed meaning in accordance with dynamic methods of interpretation including considerations related to purposes, aims, objectives and consequences. In this context, the role of ideal discourse situations and communicative action taking the form of interaction among the EU legal community in an ongoing dialogue especially in the preliminary ruling procedure is brought into focus. In order for this dialogue to function, it requires that the ECJ gives persuasive, convincing and acceptable reasons in justifying its decisions. This necessitates transparency, sincerity, and dialogue with the relevant audience.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

To effectively support today’s global economy, database systems need to manage data in multiple languages simultaneously. While current database systems do support the storage and management of multilingual data, they are not capable of querying across different natural languages. To address this lacuna, we have recently proposed two cross-lingual functionalities, LexEQUAL[13] and SemEQUAL[14], for matching multilingual names and concepts, respectively. In this paper, we investigate the native implementation of these multilingual functionalities as first-class operators on relational engines. Specifically, we propose a new multilingual storage datatype, and an associated algebra of the multilingual operators on this datatype. These components have been successfully implemented in the PostgreSQL database system, including integration of the algebra with the query optimizer and inclusion of a metric index in the access layer. Our experiments demonstrate that the performance of the native implementation is up to two orders-of-magnitude faster than the corresponding outsidethe- server implementation. Further, these multilingual additions do not adversely impact the existing functionality and performance. To the best of our knowledge, our prototype represents the first practical implementation of a crosslingual database query engine.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Today 80 % of the content on the Web is in English, which is spoken by only 8% of the World population and 5% of Indian population. There is wealth of useful content in the various languages of the world other than English, which can be made available on the Internet. But, to date, for various reasons most of it is not yet available on the Internet. India itself has 18 officially recognized languages and scores of dialects. Although the medium of instruction for most of the higher education and research in India is English, substantial amount of literature by way of novels, textbooks, scholarly information are being generated in the other languages in the country. Many of the e-governance initiatives are in the respective state languages. In the past, support for different languages by the operating systems and the software packages were not very encouraging. However, with the advent of Unicode technology, operating systems and software packages are supporting almost all the major languages of the world that have scripts. In the work reported in this paper, we have explained the configuration changes that are needed for Eprints.org software to store multilingual content and to create a multilingual user interface.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Scatter/Gather systems are increasingly becoming useful in browsing document corpora. Usability of the present-day systems are restricted to monolingual corpora, and their methods for clustering and labeling do not easily extend to the multilingual setting, especially in the absence of dictionaries/machine translation. In this paper, we study the cluster labeling problem for multilingual corpora in the absence of machine translation, but using comparable corpora. Using a variational approach, we show that multilingual topic models can effectively handle the cluster labeling problem, which in turn allows us to design a novel Scatter/Gather system ShoBha. Experimental results on three datasets, namely the Canadian Hansards corpus, the entire overlapping Wikipedia of English, Hindi and Bengali articles, and a trilingual news corpus containing 41,000 articles, confirm the utility of the proposed system.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

clarify the multilingual problem of place and oceanographic names in the region of the Okhotsk Sea, especially in Southern Sakhalin (Minami Karafuto) and in the Kuril Islands (Chishima Islands) areas. (PDF contains 61 pages)

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Máster y Doctorado en Sistemas Informáticos Avanzados, Informatika Fakultatea - Facultad de Informática