943 resultados para manipulación textual


Relevância:

20.00% 20.00%

Publicador:

Resumo:

Objective To synthesise recent research on the use of machine learning approaches to mining textual injury surveillance data. Design Systematic review. Data sources The electronic databases which were searched included PubMed, Cinahl, Medline, Google Scholar, and Proquest. The bibliography of all relevant articles was examined and associated articles were identified using a snowballing technique. Selection criteria For inclusion, articles were required to meet the following criteria: (a) used a health-related database, (b) focused on injury-related cases, AND used machine learning approaches to analyse textual data. Methods The papers identified through the search were screened resulting in 16 papers selected for review. Articles were reviewed to describe the databases and methodology used, the strength and limitations of different techniques, and quality assurance approaches used. Due to heterogeneity between studies meta-analysis was not performed. Results Occupational injuries were the focus of half of the machine learning studies and the most common methods described were Bayesian probability or Bayesian network based methods to either predict injury categories or extract common injury scenarios. Models were evaluated through either comparison with gold standard data or content expert evaluation or statistical measures of quality. Machine learning was found to provide high precision and accuracy when predicting a small number of categories, was valuable for visualisation of injury patterns and prediction of future outcomes. However, difficulties related to generalizability, source data quality, complexity of models and integration of content and technical knowledge were discussed. Conclusions The use of narrative text for injury surveillance has grown in popularity, complexity and quality over recent years. With advances in data mining techniques, increased capacity for analysis of large databases, and involvement of computer scientists in the injury prevention field, along with more comprehensive use and description of quality assurance methods in text mining approaches, it is likely that we will see a continued growth and advancement in knowledge of text mining in the injury field.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

In this thesis we present and evaluate two pattern matching based methods for answer extraction in textual question answering systems. A textual question answering system is a system that seeks answers to natural language questions from unstructured text. Textual question answering systems are an important research problem because as the amount of natural language text in digital format grows all the time, the need for novel methods for pinpointing important knowledge from the vast textual databases becomes more and more urgent. We concentrate on developing methods for the automatic creation of answer extraction patterns. A new type of extraction pattern is developed also. The pattern matching based approach chosen is interesting because of its language and application independence. The answer extraction methods are developed in the framework of our own question answering system. Publicly available datasets in English are used as training and evaluation data for the methods. The techniques developed are based on the well known methods of sequence alignment and hierarchical clustering. The similarity metric used is based on edit distance. The main conclusions of the research are that answer extraction patterns consisting of the most important words of the question and of the following information extracted from the answer context: plain words, part-of-speech tags, punctuation marks and capitalization patterns, can be used in the answer extraction module of a question answering system. This type of patterns and the two new methods for generating answer extraction patterns provide average results when compared to those produced by other systems using the same dataset. However, most answer extraction methods in the question answering systems tested with the same dataset are both hand crafted and based on a system-specific and fine-grained question classification. The the new methods developed in this thesis require no manual creation of answer extraction patterns. As a source of knowledge, they require a dataset of sample questions and answers, as well as a set of text documents that contain answers to most of the questions. The question classification used in the training data is a standard one and provided already in the publicly available data.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Concept inventory tests are one method to evaluate conceptual understanding and identify possible misconceptions. The multiple-choice question format, offering a choice between a correct selection and common misconceptions, can provide an assessment of students' conceptual understanding in various dimensions. Misconceptions of some engineering concepts exist due to a lack of mental frameworks, or schemas, for these types of concepts or conceptual areas. This study incorporated an open textual response component in a multiple-choice concept inventory test to capture written explanations of students' selections. The study's goal was to identify, through text analysis of student responses, the types and categorizations of concepts in these explanations that had not been uncovered by the distractor selections. The analysis of the textual explanations of a subset of the discrete-time signals and systems concept inventory questions revealed that students have difficulty conceptually explaining several dimensions of signal processing. This contributed to their inability to provide a clear explanation of the underlying concepts, such as mathematical concepts. The methods used in this study evaluate students' understanding of signals and systems concepts through their ability to express understanding in written text. This may present a bias for students with strong written communication skills. This study presents a framework for extracting and identifying the types of concepts students use to express their reasoning when answering conceptual questions.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

We address the task of mapping a given textual domain model (e.g., an industry-standard reference model) for a given domain (e.g., ERP), with the source code of an independently developed application in the same domain. This has applications in improving the understandability of an existing application, migrating it to a more flexible architecture, or integrating it with other related applications. We use the vector-space model to abstractly represent domain model elements as well as source-code artifacts. The key novelty in our approach is to leverage the relationships between source-code artifacts in a principled way to improve the mapping process. We describe experiments wherein we apply our approach to the task of matching two real, open-source applications to corresponding industry-standard domain models. We demonstrate the overall usefulness of our approach, as well as the role of our propagation techniques in improving the precision and recall of the mapping task.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Introducción: Uno de los grandes temas en discusión en 2012 por parte de la doctrina jurídica argentina ha sido la presentación ante el Congreso Nacional del “Proyecto de Reforma, Unificación y Actualización de los Códigos Civil y Comercial de la Nación”. Así las cosas, varias son las voces que se han alzado tanto a favor de la reforma como en su contra. Mientras que algunos consideran que la legislación proyectada implica un avance en materia de derechos humanos, llegando a tildarla como la “reforma más participativa de la historia”1; otros se apartan de esta idea y ven en el Proyecto grandes deficiencias. En el presente trabajo intentaremos analizar uno de los artículos más polémicos de la reforma: el Artículo 19, referente al comienzo de la existencia de la persona humana. Mas no habremos de agotar nuestra investigación allí, proseguiremos analizando otro artículo que puede pasar inadvertido pero que conlleva grandes implicancias si se analiza a la luz del 19. Hacemos referencia al Artículo 57, el cual trata acerca de las prácticas destinadas a alterar la constitución genética de la descendencia. Por último, buscaremos ver qué implicancias puede generar este panorama respecto de la responsabilidad de los profesionales de la salud.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

[ES]En este trabajo se estudia el uso de los marcadores del discurso y del asíndeton como medios de articulación textual entre los diversos enunciados que constituyen los "Progumnásmata" de Nicolao. Este estudio permite observar si existen diferencias entre las dos partes que componen la edición de Felten y si el uso de partículas de Nicolao es diferente del que hacen los demás autores de "Progumnásmata".

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Eguíluz, Federico; Merino, Raquel; Olsen, Vickie; Pajares, Eterio; Santamaría, José Miguel (eds.)

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Eguíluz, Federico; Merino, Raquel; Olsen, Vickie; Pajares, Eterio; Santamaría, José Miguel (eds.)

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Raquel Merino Álvarez, José Miguel Santamaría, Eterio Pajares (eds.)

Relevância:

20.00% 20.00%

Publicador:

Resumo:

[EN]Measuring semantic similarity and relatedness between textual items (words, sentences, paragraphs or even documents) is a very important research area in Natural Language Processing (NLP). In fact, it has many practical applications in other NLP tasks. For instance, Word Sense Disambiguation, Textual Entailment, Paraphrase detection, Machine Translation, Summarization and other related tasks such as Information Retrieval or Question Answering. In this masther thesis we study di erent approaches to compute the semantic similarity between textual items. In the framework of the european PATHS project1, we also evaluate a knowledge-base method on a dataset of cultural item descriptions. Additionaly, we describe the work carried out for the Semantic Textual Similarity (STS) shared task of SemEval-2012. This work has involved supporting the creation of datasets for similarity tasks, as well as the organization of the task itself.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Esta es una colección de 10 vídeos tutoriales que pueden ser empleados como material educativo en los cursos de fonética básica en el ámbito universitario. Los vídeos 1-3 tratan aspectos relacionados con la grabación: el tipo de micrófonos que se emplean, las clases de espacios en las que se suelen llevar a cabo la captura de señales de audio y las grabadoras que se suelen emplear. El vídeo 4 explora técnicas de captura y observación de datos de flujo y presión en fonética aerodinámica. Los vídeos 5-10 presentan información sobre los principales usos que se le brindan al programa Praat (Boersma y Weenink, 2014) en los estudios actuales de fonética acústica, desde la clase de información sobre modos de articulación de las consonantes que se puede identificar en oscilogramas hasta la creación de señales sonoras sintetizadas por medio de unos procedimientos que tiene el programa para tal propósito, los cuales son susceptibles de ser empleados en experimentos de percepción auditiva.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

429 p. -- Tesis doctoral original leida en la Universidad Pública de Navarra (UPNA, Dpto. Ciencias de la Salud

Relevância:

20.00% 20.00%

Publicador:

Resumo:

[ES]Este trabajo trata sobre los sistemas de control reconfigurables. El objetivo es conseguir que el sistema de control siga controlando el proceso antes situaciones de cambio solicitadas por el usuario o ante situaciones de fallo en los controladores. Se aplicará sobre la primera estación de la célula de manipulación didáctica FMS 200 del laboratorio del Departamento de Ingeniería de Sistemas y Automática (DISA) de la ETSI de Bilbao.