10 resultados para Aligned Corpus

em Archivo Digital para la Docencia y la Investigación - Repositorio Institucional de la Universidad del País Vasco


Relevância:

20.00% 20.00%

Publicador:

Resumo:

In this article we describe the methodology developed for the semiautomatic annotation of EPEC-RolSem, a Basque corpus labeled at predicate level following the PropBank-VerbNet model. The methodology presented is the product of detailed theoretical study of the semantic nature of verbs in Basque and of their similarities and differences with verbs in other languages. As part of the proposed methodology, we are creating a Basque lexicon on the PropBank-VerbNet model that we have named the Basque Verb Index (BVI). Our work thus dovetails the general trend toward building lexicons from tagged corpora that is clear in work conducted for other languages. EPEC-RolSem and BVI are two important resources for the computational semantic processing of Basque; as far as the authors are aware, they are also the first resources of their kind developed for Basque. In addition, each entry in BVI is linked to the corresponding verb-entry in well-known resources like PropBank, VerbNet, WordNet, Levin’s Classification and FrameNet. We have also implemented several automatic processes to aid in creating and annotating the BVI, including processes designed to facilitate the task of manual annotation.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

592 p. (versión en euskera) ; 117 p. (versión en alemán)

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The present corpus study aimed to examine whether Basque (OV) resorts more often than Spanish (VO) to certain grammatical operations, in order to minimi ze the number of arguments to be processed before the verb. Ueno & Polinsky (2009) argue that VO/OV languages use certain grammatical resources with different frequencies in order to facilitate real-time processing. They observe that both OV and VO languages in their sample (Japanese, Turkish and Spanish) have a similar frequency of use of subject pro-drop; however, they find that OV languages (Japanese, Turkish) use more intransitive sentences than VO languages (English, Spanish), and conclude this is an OV-specific strategy to facilitate processing. We conducted a comparative corpus study of Spanish (VO) and Basque (OV). Results show (a) that the fre- quency of use of subject pro-drop is higher in Basque than in Spanish; and (b) Basque does not use more intransitive sentences than Spanish; both languages have a similar frequency of intransitive sentences. Based on these findings, we conclude that the frequency of use of grammatical resources to facilitate the processing does not depend on a single typological trait (VO/OV) but it is modulated by the concurrence of other grammatical feature.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

More and more users aim at taking advantage of the existing Linked Open Data environment to formulate a query over a dataset and to then try to process the same query over different datasets, one after another, in order to obtain a broader set of answers. However, the heterogeneity of vocabularies used in the datasets on the one side, and the fact that the number of alignments among those datasets is scarce on the other, makes that querying task difficult for them. Considering this scenario we present in this paper a proposal that allows on demand translations of queries formulated over an original dataset, into queries expressed using the vocabulary of a targeted dataset. Our approach relieves users from knowing the vocabulary used in the targeted datasets and even more it considers situations where alignments do not exist or they are not suitable for the formulated query. Therefore, in order to favour the possibility of getting answers, sometimes there is no guarantee of obtaining a semantically equivalent translation. The core component of our proposal is a query rewriting model that considers a set of transformation rules devised from a pragmatic point of view. The feasibility of our scheme has been validated with queries defined in well known benchmarks and SPARQL endpoint logs, as the obtained results confirm.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Editores:Micaela Muñoz-Calvo; Carmen Buesa-Gómez

Relevância:

20.00% 20.00%

Publicador:

Resumo:

[EN] 3D microfluidic device fabrication methods are normally quite expensive and tedious. In this paper, we present an easy and cheap alternative wherein thin cyclic olefin polymer (COP) sheets and pressure sensitive adhesive(PSA) were used to fabricate hybrid 3D microfluidic structures, by the Origami technique, which enables the fabrication of microfluidic devices without the need of any alignment tool. The COP and PSA layers were both cut simultaneously using a portable, low-cost plotter allowing for rapid prototyping of a large variety of designs in a single production step. The devices were then manually assembled using the Origami technique by simply combining COP and PSA layers and mild pressure. This fast fabrication method was applied, as proof of concept, to the generation of a micromixer with a 3D-stepped serpentine design made of ten layers in less than 8 min. Moreover, the micromixer was characterized as a function of its pressure failure, achieving pressures of up to 1000 mbar. This fabrication method is readily accessible across a large range of potential end users, such as educational agencies (schools,universities), low-income/developing world research and industry or any laboratory without access to clean room facilities, enabling the fabrication of robust, reproducible microfluidic devices.