40 resultados para Linear Attention,Conditional Language Model,Natural Language Generation,FLAX,Rare diseases
Resumo:
Next-generation sequencing techniques such as exome sequencing can successfully detect all genetic variants in a human exome and it has been useful together with the implementation of variant filters to identify causing-disease mutations. Two filters aremainly used for the mutations identification: low allele frequency and the computational annotation of the genetic variant. Bioinformatic tools to predict the effect of a givenvariant may have errors due to the existing bias in databases and sometimes show a limited coincidence among them. Advances in functional and comparative genomics are needed in order to properly annotate these variants.The goal of this study is to: first, functionally annotate Common Variable Immunodeficiency disease (CVID) variants with the available bioinformatic methods in order to assess the reliability of these strategies. Sencondly, as the development of new methods to reduce the number of candidate genetic variants is an active and necessary field of research, we are exploring the utility of gene function information at organism level as a filter for rare disease genes identification. Recently, it has been proposed that only 10-15% of human genes are essential and therefore we would expect that severe rare diseases are mostly caused by mutations on them. Our goal is to determine whether or not these rare and severe diseases are caused by deleterious mutations in these essential genes. If this hypothesis were true, taking into account essential genes as a filter would be an interesting parameter to identify causingdisease mutations.
Resumo:
Desarrollo de un sistema capaz de procesar consultas en lenguaje natural introducidas por el usuario mediante el teclado. El sistema es capaz de responder a consultas en castellano, relacionadas con un dominio de aplicación representado mediante una base de datos relacional.
Resumo:
In this paper we present the theoretical and methodologicalfoundations for the development of a multi-agentSelective Dissemination of Information (SDI) servicemodel that applies Semantic Web technologies for specializeddigital libraries. These technologies make possibleachieving more efficient information management,improving agent–user communication processes, andfacilitating accurate access to relevant resources. Othertools used are fuzzy linguistic modelling techniques(which make possible easing the interaction betweenusers and system) and natural language processing(NLP) techniques for semiautomatic thesaurus generation.Also, RSS feeds are used as “current awareness bulletins”to generate personalized bibliographic alerts.
Resumo:
Although paraphrasing is the linguistic mechanism underlying many plagiarism cases, little attention has been paid to its analysis in the framework of automatic plagiarism detection. Therefore, state-of-the-art plagiarism detectors find it difficult to detect cases of paraphrase plagiarism. In this article, we analyse the relationship between paraphrasing and plagiarism, paying special attention to which paraphrase phenomena underlie acts of plagiarism and which of them are detected by plagiarism detection systems. With this aim in mind, we created the P4P corpus, a new resource which uses a paraphrase typology to annotate a subset of the PAN-PC-10 corpus for automatic plagiarism detection. The results of the Second International Competition on Plagiarism Detection were analysed in the light of this annotation. The presented experiments show that (i) more complex paraphrase phenomena and a high density of paraphrase mechanisms make plagiarism detection more difficult, (ii) lexical substitutions are the paraphrase mechanisms used the most when plagiarising, and (iii) paraphrase mechanisms tend to shorten the plagiarized text. For the first time, the paraphrase mechanisms behind plagiarism have been analysed, providing critical insights for the improvement of automatic plagiarism detection systems.
Resumo:
Peer-reviewed
Resumo:
This paper investigates the economic value of Catalan knowledge for national and foreign first- and second-generation immigrants in Catalonia. Specifically, drawing on data from the “Survey on Living Conditions and Habits of the Catalan Population (2006)”, we want to quantify the expected earnings differential between individuals who are proficient in Catalan and those who are not, taking into account the potential endogeneity between knowledge of Catalan and earnings. The results indicate the existence of a positive return to knowledge of Catalan, with a 7.5% increase in earnings estimated by OLS; however, when we account for the presence of endogeneity, monthly earnings are around 18% higher for individuals who are able to speak and write Catalan. However, we also find that language and education are complementary inputs for generating earnings in Catalonia, given that knowledge of Catalan increases monthly earnings only for more educated individuals.
Resumo:
The study tested three analytic tools applied in SLA research (T-unit, AS-unit and Idea-unit) against FL learner monologic oral data. The objective was to analyse their effectiveness for the assessment of complexity of learners' academic production in English. The data were learners' individual productions gathered during the implementation of a CLIL teaching sequence on Natural Sciences in a Catalan state secondary school. The analysis showed that only AS-unit was easily applicable and highly effective in segmenting the data and taking complexity measures
Resumo:
El objetivo de PANACEA es engranar diferentes herramientas avanzadas para construir una fábrica de Recursos Lingüísticos (RL), una línea de producción que automatice los pasos implicados en la adquisición, producción, actualización y mantenimiento de los RL que la Traducción Automática y otras tecnologías lingüísticas, necesitan.
Resumo:
Language Resources are a critical component for Natural Language Processing applications. Throughout the years many resources were manually created for the same task, but with different granularity and coverage information. To create richer resources for a broad range of potential reuses, nformation from all resources has to be joined into one. The hight cost of comparing and merging different resources by hand has been a bottleneck for merging existing resources. With the objective of reducing human intervention, we present a new method for automating merging resources. We have addressed the merging of two verbs subcategorization frame (SCF) lexica for Spanish. The results achieved, a new lexicon with enriched information and conflicting information signalled, reinforce our idea that this approach can be applied for other task of NLP.
Resumo:
Collaborative activities, in which students actively interact with each other, have proved to provide significant learning benefits. In Computer-Supported Collaborative Learning (CSCL), these collaborative activities are assisted by technologies. However, the use of computers does not guarantee collaboration, as free collaboration does not necessary lead to fruitful learning. Therefore, practitioners need to design CSCL scripts that structure the collaborative settings so that they promote learning. However, not all teachers have the technical and pedagogical background needed to design such scripts. With the aim of assisting teachers in designing effective CSCL scripts, we propose a model to support the selection of reusable good practices (formulated as patterns) so that they can be used as a starting point for their own designs. This model is based on a pattern ontology that computationally represents the knowledge captured on a pattern language for the design of CSCL scripts. A preliminary evaluation of the proposed approach is provided with two examples based on a set of meaningful interrelated patters computationally represented with the pattern ontology, and a paper prototyping experience carried out with two teaches. The results offer interesting insights towards the implementation of the pattern ontology in software tools.
Resumo:
Under the Dynamic Model of Multilingualism multilinguals are especially vulnerable to language attrition. It was the aim of the present study to verify if this was the case and to observe whether the different linguistic skills (receptive vs. descriptive) and the different linguistic levels (syntactic, lexical, morphological, etc.) would be affected equally.Data were gathered longitudinally by means of a language test for the subject’s reading, writing, listening and speaking skills as well as her knowledge of grammar and vocabulary. Although the overall accuracy remained intact and no proof for attrition in the receptive skills was found, the productive skills - mainly fluency - were shown to have suffered from language attrition. This was demonstrated by an increase in the number of pauses, hesitations, repetitions and self-corrections among others and decrease in the percentage of error-free clauses and decrease in the clause length, in oral and written fluency respectively.
Resumo:
En contra del que molts pares i mestres pensen, i en contra del que s'ha trobat en contextos d'adquisició natural de segones llengües i d'immersió, començar al més aviat possible l'aprenentatge d'idiomes, sobretot pel que fa a les tasques cognitivament més exigents, com l'escriptura, no sembla l'opció més eficaç. Els resultats del Grup de Recerca en Adquisició de Llengües (GRAL) estudià aprenents d'anglès que començaren als vuit i onze anys i trobaren, sistemàticament, que a llarg termini, després de les mateixes hores d'instrucció, eren els més grans, que havien començat als onze anys, els que obtenien millors resultats en totes les proves orals i escrites d'anglès excepte en alguna de prova de reconeixement fonètic. La maduresa cognitiva dels alumnes més grans i els diferents mecanismes d¿aprenentatge implícit i explícit entre els nens més petits i els adults ajuden a explicar per què només començar abans l'aprenentatge d'idiomes no sembla suficient per obtenir millors resultats.
Resumo:
Background: Recent research based on comparisons between bilinguals and monolinguals postulates that bilingualism enhances cognitive control functions, because the parallel activation of languages necessitates control of interference. In a novel approach we investigated two groups of bilinguals, distinguished by their susceptibility to cross-language interference, asking whether bilinguals with strong language control abilities ('non-switchers") have an advantage in executive functions (inhibition of irrelevant information, problem solving, planning efficiency, generative fluency and self-monitoring) compared to those bilinguals showing weaker language control abilities ('switchers"). Methods: 29 late bilinguals (21 women) were evaluated using various cognitive control neuropsychological tests [e.g., Tower of Hanoi, Ruff Figural Fluency Task, Divided Attention, Go/noGo] tapping executive functions as well as four subtests of the Wechsler Adult Intelligence Scale. The analysis involved t-tests (two independent samples). Non-switchers (n = 16) were distinguished from switchers (n = 13) by their performance observed in a bilingual picture-naming task. Results: The non-switcher group demonstrated a better performance on the Tower of Hanoi and Ruff Figural Fluency task, faster reaction time in a Go/noGo and Divided Attention task, and produced significantly fewer errors in the Tower of Hanoi, Go/noGo, and Divided Attention tasks when compared to the switchers. Non-switchers performed significantly better on two verbal subtests of the Wechsler Adult Intelligence Scale (Information and Similarity), but not on the Performance subtests (Picture Completion, Block Design). Conclusions: The present results suggest that bilinguals with stronger language control have indeed a cognitive advantage in the administered tests involving executive functions, in particular inhibition, self-monitoring, problem solving, and generative fluency, and in two of the intelligence tests. What remains unclear is the direction of the relationship between executive functions and language control abilities.
Resumo:
Humanist writers were multifaceted and thcir writings eclectic, delving into a wide range of fields of enquiry Many issues wcre raised and addressed, pursued ur abandoned, uftcn unsystematically. ‘[his hetcrogeneity has frequently lcd tu the neglect of specific facets of authurs who have gained renuwn in uthcr fzelds. ‘[his 1 believe tu be the case fur Richard Mulcaster and Juan Luis Vives, whuse contribution tu language Éhcory has been eclipsed by their rclatively mudem views un educatiun. ‘[heir views un language merit mure attention, if not fur their originality as such, at least fur te testimony they pruvide uf a periud in transition. ‘[he work uf these authors show 1mw views un language evolved thruughout te periud mié convey a sense uf its dynamic character. Profoundly cunservative attitudes coexist with progressive unes and, tliough ruoted in the past, thcy strain tuwards a new vision uf the nature and functioning uf language in human sucicty.
Resumo:
Although the majority of English language teachers worldwide are non-native English speakers, no research was conducted on these teachers until recently. After the pioneering work of Robert Phillipson in 1992 and Peter Medgyes in 1994, nearly a decade had to elapse for more research to emerge on the issues relating to non-native English teachers. The publication in 1999 of George Braine's book Nonnative educators in English language teaching appears to have encouraged a number of graduate students and scholars to research this issue, with topics ranging from teachers' perceptions of their own identity to students' views and aspects of teacher education. This article compiles, classifies, and examines research conducted in the last two decades on this topic, placing a special emphasis on World Englishes concerns, methods of investigation, and areas in need of further attention.