711 resultados para Annotation informatisée
Resumo:
IARG-AnCora tiene como objetivo la anotación con papeles temáticos de los argumentos implícitos de las nominalizaciones deverbales en el corpus AnCora. Estos corpus servirán de base para los sistemas de etiquetado automático de roles semánticos basados en técnicas de aprendizaje automático. Los analizadores semánticos son componentes básicos en las aplicaciones actuales de las tecnologías del lenguaje, en las que se quiere potenciar una comprensión más profunda del texto para realizar inferencias de más alto nivel y obtener así mejoras cualitativas en los resultados.
Resumo:
One of the main concerns is the nature of the missing values. Let’s consider extremes for simplicity. If missing at random we have not to care about. But if missing shows structures that covariate with substantive variables we have to make decisions. There are, in fact, several options to take. We are speaking about one country, one mode. But if you go cross-cultural (or more precisely, cross-state nations) and mixed modes many questions raise. For example, the simple one. What are we comparing? Reports and books usually go straight into variables distributions and coefficient comparisons. This is possible because the annalist presume "tabula rasa" effect from data collections procedures. But this is not, frequently, the real situation. This paper will expose the mixed missing mode imprint in international surveys. This will help to evaluate how deal with this problem. Also, to consider the real meaning of observed cross-national differences.
Resumo:
Pochonia chlamydosporia is a worldwide-distributed soil fungus with a great capacity to infect and destroy the eggs and kill females of plant-parasitic nematodes. Additionally, it has the ability to colonize endophytically roots of economically-important crop plants, thereby promoting their growth and eliciting plant defenses. This multitrophic behavior makes P. chlamydosporia a potentially useful tool for sustainable agriculture approaches. We sequenced and assembled ∼41 Mb of P. chlamydosporia genomic DNA and predicted 12,122 gene models, of which many were homologous to genes of fungal pathogens of invertebrates and fungal plant pathogens. Predicted genes (65%) were functionally annotated according to Gene Ontology, and 16% of them found to share homology with genes in the Pathogen Host Interactions (PHI) database. The genome of this fungus is highly enriched in genes encoding hydrolytic enzymes, such as proteases, glycoside hydrolases and carbohydrate esterases. We used RNA-Seq technology in order to identify the genes expressed during endophytic behavior of P. chlamydosporia when colonizing barley roots. Functional annotation of these genes showed that hydrolytic enzymes and transporters are expressed during endophytism. This structural and functional analysis of the P. chlamydosporia genome provides a starting point for understanding the molecular mechanisms involved in the multitrophic lifestyle of this fungus. The genomic information provided here should also prove useful for enhancing the capabilities of this fungus as a biocontrol agent of plant-parasitic nematodes and as a plant growth-promoting organism.
Resumo:
This paper describes the automatic process of building a dependency annotated corpus based on Ancora constituent structures. The Ancora corpus already has a dependency structure information layer, but the new annotated data applies a purely syntactic orientation and offers in this way a new resource to the linguistic research community. The paper details the process of reannotating the corpus, the linguistic criteria used and the obtained results.
Resumo:
The great amount of text produced every day in the Web turned it as one of the main sources for obtaining linguistic corpora, that are further analyzed with Natural Language Processing techniques. On a global scale, languages such as Portuguese - official in 9 countries - appear on the Web in several varieties, with lexical, morphological and syntactic (among others) differences. Besides, a unified spelling system for Portuguese has been recently approved, and its implementation process has already started in some countries. However, it will last several years, so different varieties and spelling systems coexist. Since PoS-taggers for Portuguese are specifically built for a particular variety, this work analyzes different training corpora and lexica combinations aimed at building a model with high-precision annotation in several varieties and spelling systems of this language. Moreover, this paper presents different dictionaries of the new orthography (Spelling Agreement) as well as a new freely available testing corpus, containing different varieties and textual typologies.
Graphical Representation of the Changes of Sector for Particular Cases in the Ponchon Savarit Method
Resumo:
A graphical and systematic analysis of particular cases where the compositions of the streams developed in the rectification column coincide with one of the vapor (yGFk) or liquid (xGFk) portions generated from the GFk can be found in this material (i.e.: yGFk=yk+1,1 or xGFk=xk,NTk).
Resumo:
En el archivo del Quaid’Orsay de París he podido consultar la correspondencia que entre 1907 y 1909 mantuvieron el embajador francés en Madrid y su ministro de Asuntos Exteriores. Aunque existe bibliografía sobre los acontecimientos que condujeron a la campaña de 1909, la documentación de las cajas 88-95 (Affaires duRif) de la Correspondance politique et militaire posee una doble virtud: detalla esos acontecimientos y contribuye a esclarecer varios temas polémicos, razón por la que preparo un trabajo basado en esa correspondencia. Esta nota de investigación adelanta algunas de sus conclusiones. La he dividido en dos partes. En la primera, relato los sucesos que originaron la guerra y en la segunda utilizo la documentación para tratar dos temas controvertidos: el problema de las minas como desencadenante de la contienda y el papel que Alfonso XIII jugó en ella. La limitación de espacio que exige una nota de investigación me ha obligado a “comprimir” la primera parte y por la misma razón no he incluido notas a pie de página citando los documentos de donde proviene la información.
Resumo:
El análisis de citas bibliográficas que usa variaciones de métodos de conteo provoca deformaciones en la evaluación del impacto. Para enriquecer el cálculo de los factores de impacto se necesita entender el tipo de influencia de los aportes de un investigador sobre el autor que los menciona. Para ello, se requiere realizar análisis de contenido del contexto de las citas que permita obtener su función, polaridad e influencia. El presente artículo trata sobre la definición de un esquema de anotación tendiente a la creación de un corpus de acceso público que sea la base de trabajo colaborativo en este campo, con miras al desarrollo de sistemas que permitan llevar adelante tareas de análisis de contenido con el objetivo planteado.
Resumo:
Esta investigación fue financiada en parte por el Ministerio de Ciencia e Innovación (CGL2011-23658), Ministerio de Economía y Competitividad (CGL2012-31669) y Generalitat Valenciana (proyectos PROMETEO/2013/03412 y ACOMP/2014/140). A. R. H. agradece la beca predoctoral del programa Santiago Grisolía de la Generalitat Valenciana (GRISOLIA/2010/080).