894 resultados para Query expansion, Text mining, Information retrieval, Chinese IR


Relevância:

100.00% 100.00%

Publicador:

Resumo:

O transplante hepático é o tratamento de escolha para uma série de doenças terminais agudas e crônicas do fígado. Contudo, sua oferta tem sido restringida pela falta de doadores, o que tem provocado o aumento do número de pacientes em lista de espera. A escassez de órgãos condiciona a aceitação para transplante de enxertos provindos de doadores sem as melhores condições para tal – os chamados doadores marginais. O dano de isquemia/reperfusão (IR) é resultado dos fatores perioperatórios inerentes ao procedimento, incluindo as condições do doador. Quanto pior o doador, pior o órgão transplantado, e maior a possibilidade de desenvolvimento de disfunção primária do enxerto (DPE). DPE comumente é definida pela elevação das enzimas hepáticas. As aminotransferases, entretanto, podem alterar-se por outras complicações que não a lesão de isquemia/reperfusão. A histologia hepática, por sua vez, pode fornecer informações acerca da IR. Com o objetivo de estimar a extensão histológica do dano de preservação (necrose hepatocelular e neutrofilia sinusoidal), correlacioná-la a variáveis bioquímicas (índice de reperfusão: AST + ALT + LDH / 3) e avaliar a sua influência no período pós-operatório imediato (até 7 dias), foi realizado um estudo transversal com análise sistemática de 55 pacientes adultos que receberam seu primeiro enxerto hepático entre Setembro de 1996 e Dezembro de 1999. Foram comparados os fatores de risco relacionados ao doador, ao receptor, ao procedimento cirúrgico e ao período pós-operatório e analisadas as biópsias feitas antes e imediatamente após o procedimento cirúrgico. Houve dano de preservação em todos os pacientes estudados tanto por critérios anatomopatológicos quanto por critérios bioquímicos. Houve associação significativa entre os achados bioquímicos e histológicos (p=0,04; coeficiente gamma=0,49). A extensão da necrose hepatocitária parece ser o dado anatomopatológico isolado que melhor se relaciona ao índice de reperfusão (p=0,05; coeficiente gamma=0,48). Houve associação entre DPE e a histologia hepática (p=0,02). O índice bioquímico associou-se à DPE (p=0,001) e à incidência de insuficiência renal aguda (IRA) (p<0,0001). A mortalidade inicial foi maior nos pacientes com índice de reperfusão grave (p=0,002). O índice de reperfusão foi um fator de risco independente para a função do enxerto (p=0,004) e IRA (0,04). A sobrevida atuarial em 1 ano foi significativamente menor nos pacientes com dano de preservação grave (p=0,003). A análise da biópsia de reperfusão é capaz de detectar o dano de preservação sofrido pelo enxerto e se correlaciona às variáveis bioquímicas em sua estimativa.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Esta pesquisa tem como objetivo principal propor uma metodologia que agilize a construção de uma ferramenta no campo da Documentação. Trata-se da geração de uma base de dados terminológica com sustentação na terminologia utilizada pelo especialista em sua área de domínio. Ela se apóia nos pressupostos teóricos da Teoria da Enunciação, da Teoria Comunicativa da Terminologia e da Socioterminologia. Com esse referencial acredita-se ser possível assegurar a efetiva comunicação entre os Sistemas de Recuperação de Informação e os usuários, sendo o bibliotecário o mediador do processo comunicativo que tem origem no autor do texto indexado. Buscou-se o suporte da Terminografia e da Lingüística de Corpus pela possibilidade de coletar, tratar e armazenar um grande volume de informações de uma determinada área do saber.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

A implantação dos sistemas de notas fiscais eletrônicas proporcionou uma grande quantidade de dados para as administrações tributárias. Analisar esses dados e extrair informações importantes é um desafio. Esse trabalho buscou, por meio de técnicas de análise de dados e mineração de textos, identificar, a partir da descrição dos serviços prestados, notas emitidas incorretamente a fim de respaldar um melhor planejamento de fiscalizações.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The popularization of the Internet has stimulated the appearance of Search Engines that have as their objective aid the users in the Web information research process. However, it s common for users to make queries and receive results which do not satisfy their initial needs. The Information Retrieval in Context (IRiX) technique allows for the information related to a specific theme to be related to the initial user query, enabling, in this way, better results. This study presents a prototype of a search engine based on contexts built from linguistic gatherings and on relationships defined by the user. The context information can be shared with softwares and other tool users with the objective of promoting a socialization of contexts

Relevância:

100.00% 100.00%

Publicador:

Resumo:

In some applications with case-based system, the attributes available for indexing are better described as linguistic variables instead of receiving numerical treatment. In these applications, the concept of fuzzy hypercube can be applied to give a geometrical interpretation of similarities among cases. This paper presents an approach that uses geometrical properties of fuzzy hypercube space to make indexing and retrieval processes of cases.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

One way to organize knowledge and make its search and retrieval easier is to create a structural representation divided by hierarchically related topics. Once this structure is built, it is necessary to find labels for each of the obtained clusters. In many cases the labels have to be built using only the terms in the documents of the collection. This paper presents the SeCLAR (Selecting Candidate Labels using Association Rules) method, which explores the use of association rules for the selection of good candidates for labels of hierarchical document clusters. The candidates are processed by a classical method to generate the labels. The idea of the proposed method is to process each parent-child relationship of the nodes as an antecedent-consequent relationship of association rules. The experimental results show that the proposed method can improve the precision and recall of labels obtained by classical methods. © 2010 Springer-Verlag.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

A comparative evaluation was made of the use of natural language versus two specialized indexing languages, aiming to demonstrate the influence of the availability of indexing languages on the functioning of information retrieval systems. The study was conducted within the ambit of the construction of search strategies by subject in online university library catalogs. The precision ratio was calculated to determine the accuracy of each indexing language in subjectbased information retrieval. From the comparative evaluation of the use of indexing languages, it was concluded that the term specificity required by the user during retrieval was more satisfactory when the query was made through controlled languages, whose availability and simplicity is also an indispensable requisite.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

One way to organize knowledge and make its search and retrieval easier is to create a structural representation divided by hierarchically related topics. Once this structure is built, it is necessary to find labels for each of the obtained clusters. In many cases the labels must be built using all the terms in the documents of the collection. This paper presents the SeCLAR method, which explores the use of association rules in the selection of good candidates for labels of hierarchical document clusters. The purpose of this method is to select a subset of terms by exploring the relationship among the terms of each document. Thus, these candidates can be processed by a classical method to generate the labels. An experimental study demonstrates the potential of the proposed approach to improve the precision and recall of labels obtained by classical methods only considering the terms which are potentially more discriminative. © 2012 - IOS Press and the authors. All rights reserved.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Pós-graduação em Ciência da Informação - FFC

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Pós-graduação em Ciência da Informação - FFC

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES)

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The classification of texts has become a major endeavor with so much electronic material available, for it is an essential task in several applications, including search engines and information retrieval. There are different ways to define similarity for grouping similar texts into clusters, as the concept of similarity may depend on the purpose of the task. For instance, in topic extraction similar texts mean those within the same semantic field, whereas in author recognition stylistic features should be considered. In this study, we introduce ways to classify texts employing concepts of complex networks, which may be able to capture syntactic, semantic and even pragmatic features. The interplay between various metrics of the complex networks is analyzed with three applications, namely identification of machine translation (MT) systems, evaluation of quality of machine translated texts and authorship recognition. We shall show that topological features of the networks representing texts can enhance the ability to identify MT systems in particular cases. For evaluating the quality of MT texts, on the other hand, high correlation was obtained with methods capable of capturing the semantics. This was expected because the golden standards used are themselves based on word co-occurrence. Notwithstanding, the Katz similarity, which involves semantic and structure in the comparison of texts, achieved the highest correlation with the NIST measurement, indicating that in some cases the combination of both approaches can improve the ability to quantify quality in MT. In authorship recognition, again the topological features were relevant in some contexts, though for the books and authors analyzed good results were obtained with semantic features as well. Because hybrid approaches encompassing semantic and topological features have not been extensively used, we believe that the methodology proposed here may be useful to enhance text classification considerably, as it combines well-established strategies. (c) 2012 Elsevier B.V. All rights reserved.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Except the article forming the main content most HTML documents on the WWW contain additional contents such as navigation menus, design elements or commercial banners. In the context of several applications it is necessary to draw the distinction between main and additional content automatically. Content extraction and template detection are the two approaches to solve this task. This thesis gives an extensive overview of existing algorithms from both areas. It contributes an objective way to measure and evaluate the performance of content extraction algorithms under different aspects. These evaluation measures allow to draw the first objective comparison of existing extraction solutions. The newly introduced content code blurring algorithm overcomes several drawbacks of previous approaches and proves to be the best content extraction algorithm at the moment. An analysis of methods to cluster web documents according to their underlying templates is the third major contribution of this thesis. In combination with a localised crawling process this clustering analysis can be used to automatically create sets of training documents for template detection algorithms. As the whole process can be automated it allows to perform template detection on a single document, thereby combining the advantages of single and multi document algorithms.