66 resultados para Ontologies (Information Retrieval)
em Repositório Institucional UNESP - Universidade Estadual Paulista "Julio de Mesquita Filho"
Resumo:
Pós-graduação em Ciência da Informação - FFC
Resumo:
In some applications with case-based system, the attributes available for indexing are better described as linguistic variables instead of receiving numerical treatment. In these applications, the concept of fuzzy hypercube can be applied to give a geometrical interpretation of similarities among cases. This paper presents an approach that uses geometrical properties of fuzzy hypercube space to make indexing and retrieval processes of cases.
Resumo:
The need for the representation of both semantics and common sense and its organization in a lexical database or knowledge base has motivated the development of large projects, such as Wordnets, CYC and Mikrokosmos. Besides the generic bases, another approach is the construction of ontologies for specific domains. Among the advantages of such approach there is the possibility of a greater and more detailed coverage of a specific domain and its terminology. Domain ontologies are important resources in several tasks related to the language processing, especially in those related to information retrieval and extraction in textual bases. Information retrieval or even question and answer systems can benefit from the domain knowledge represented in an ontology. Besides embracing the terminology of the field, the ontology makes the relationships among the terms explicit. Copyright 2007 ACM.
Resumo:
This paper carries out a descriptive study on Portuguese adjectives. Our aim is to describe the semantics of the legal domain adjectives in order to construct an ontology which may improve Information Retrieval Systems. For this, we present an approach based on valency and semantic relations. The ontology proposed here is a first step aiming to build a legal ontology based on top-level concepts. © AEPIA.
Resumo:
Pós-graduação em Ciência da Informação - FFC
Resumo:
Introduction: In the Web environment, there is a need for greater care with regard to the processing of descriptive and thematic information. The concern with the recovery of information in computer systems precedes the development of the first personal computers. Models of information retrieval have been and are today widely used in databases specific to a field whose scope is known. Objectives: Verify how the issue of relevance is treated in the main computer models of information retrieval and, especially, as the issue is addressed in the future of the Web, the called Semantic Web. Methodology: Bibliographical research. Results: In the classical models studied here, it was realized that the main concern is retrieving documents whose description is closest to the search expression used by the user, which does not necessarily imply that this really needs. In semantic retrieval is the use of ontologies, feature that extends the user's search for a wider range of possible relevant options. Conclusions: The relevance is a subjective judgment and inherent to the user, it will depend on the interaction with the system and especially the fact that he expects to recover in your search. Systems that are based on a model of relevance are not popular, because it requires greater interaction and depend on the user's disposal. The Semantic Web is so far the initiative more efficient in the case of information retrieval in the digital environment.
Resumo:
The indexing process aims to represent synthetically the informational content of documents by a set of terms whose meanings indicate the themes or subjects treated by them. With the emergence of the Web, research in automatic indexing received major boost with the necessity of retrieving documents from this huge collection. The traditional indexing languages, used to translate the thematic content of documents in standardized terms, always proved efficient in manual indexing. Ontologies open new perspectives for research in automatic indexing, offering a computer-process able language restricted to a particular domain. The use of ontologies in the automatic indexing process allows using a specific domain language and a logical and conceptual framework to make inferences, and whose relations allow an expansion of the terms extracted directly from the text of the document. This paper presents techniques for the construction and use of ontologies in the automatic indexing process. We conclude that the use of ontologies in the indexing process allows to add not only new feature to the indexing process, but also allows us to think in new and advanced features in an information retrieval system.
Resumo:
This paper reports a research to evaluate the potential and the effects of use of annotated Paraconsistent logic in automatic indexing. This logic attempts to deal with contradictions, concerned with studying and developing inconsistency-tolerant systems of logic. This logic, being flexible and containing logical states that go beyond the dichotomies yes and no, permits to advance the hypothesis that the results of indexing could be better than those obtained by traditional methods. Interactions between different disciplines, as information retrieval, automatic indexing, information visualization, and nonclassical logics were considered in this research. From the methodological point of view, an algorithm for treatment of uncertainty and imprecision, developed under the Paraconsistent logic, was used to modify the values of the weights assigned to indexing terms of the text collections. The tests were performed on an information visualization system named Projection Explorer (PEx), created at Institute of Mathematics and Computer Science (ICMC - USP Sao Carlos), with available source code. PEx uses traditional vector space model to represent documents of a collection. The results were evaluated by criteria built in the information visualization system itself, and demonstrated measurable gains in the quality of the displays, confirming the hypothesis that the use of the para-analyser under the conditions of the experiment has the ability to generate more effective clusters of similar documents. This is a point that draws attention, since the constitution of more significant clusters can be used to enhance information indexing and retrieval. It can be argued that the adoption of non-dichotomous (non-exclusive) parameters provides new possibilities to relate similar information.
Resumo:
Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)
Resumo:
Pós-graduação em Ciência da Informação - FFC
Resumo:
Na atualidade a atribuição dos descritores de assuntos ou indexação do conteúdo dos livros, nem sempre está associada ao contexto concreto de cada biblioteca, provocando, em muitos casos, que a recuperação por assuntos não resulte adequada. Neste trabalho analisam-se os principais desafios e perspectivas da indexação dos livros, os avanços de análises de assuntos nos catálogos de bibliotecas, examinam-se procedimentos, instrumentos, regras e condutas utilizadas nas análises e representação do conteúdo dos livros. Também se mostra a interação entre o ensino, a pesquisa e a atuação profissional necessária para que os estudantes possam desenvolver competências na análise, na representação e na procura da informação, assim como os princípios - provavelmente menos evidentes- da organização do conhecimento. Este trabalho coloca em evidência que as políticas de gestão da informação, mais quantitativas que qualitativas, deixam num segundo plano o processamento intelectual do conteúdo prejudicando, desta maneira, a recuperação por assuntos através do catalogo da biblioteca. Finalmente, se recolhe uma serie de propostas docentes relacionadas com a atribuição de descritores de assuntos em contextos bibliotecários.
Resumo:
Avaliou-se o uso de linguagem documentária alfabética de catálogos coletivos, na perspectiva das bibliotecas universitárias e no contexto sociocognitivo dos indexadores e dos usuários. Concluiu-se que o uso adequado de linguagens documentárias de áreas científicas especializadas faz-se por meio da avaliação quanto à atualização, especificidade e compatibilidade para atender às necessidades de indexação e recuperação da informação.
Resumo:
Este estudo apresenta uma síntese bibliográfica sobre as metodologias de avaliação que foram propostas por pesquisadores internacionais e nacionais e utilizadas por indexadores de instituições de ensino e/ou pesquisas atuantes em unidades de informação e/ou centros de documentação, bem como aquelas que foram analisadas pelas opiniões dos próprios usuários da informação registrada e disponibilizada em inúmeros sistemas de informações, com enfoques nas abordagens quantitativa, qualitativa e qualitativa/cognitiva, respectivamente.
Resumo:
A política de indexação deve ser constituída de estratégias que permitam o alcance dos objetivos de recuperação do sistema de informação. O indexador tem a função primordial de compreender o documento ao realizar uma análise conceitual que represente adequadamente seu conteúdo. Utilizando a leitura como evento social/protocolo verbal em grupo, nosso objetivo é contribuir com a literatura sobre política de indexação e apresentar propostas de ensino de política de indexação direcionadas a alunos de graduação e pós-graduação, além de uma experiência de educação à distância com vistas à formação do bibliotecário em serviço. Os resultados obtidos demonstraram que a metodologia pode ser utilizada por sistemas de informação para que se tenha acesso ao conhecimento do indexador. Conclui que o indexador deve ser o alvo de investimento dos sistemas de informação e sugere aos sistemas de informação que a experiência do indexador também seja utilizada como parâmetro para política de indexação.
Resumo:
The indexing automation has been discussed by researches in the area of Information Science however the discussions have not been so clear on the use of indexing software. Thus, it is necessary to know the indexing software, as well as its application in the analysis of documentary contents. To do so, it is proposed, here, to investigate both the consistency of indexing and the exhaustiveness and precision of the information retrieval, by means of comparative analysis between SISA (Sistema de Indizacion Semi-Automatico) automatic index and BIREME ( Centro Latino-Americano e do Caribe de Informação em Ciencias da Saude) manual indexing. The aim of this paper is to contribute to the theoretical development of the indexing automation and the improvement of SISA. Thus, SISA application and evaluation was used based on the calculation of the consistency indexes between the two types of indexing, and the calculation of the exhaustiveness and precision indexes in information retrieval, by means of searching into BDSISA and BIREME databases, composed by descriptors taken from SISA and manual indexing respectively. The differences among the terms used in scientific papers comparing to the DeCS ones were the main difficult factor to achieve higher consistency indexes in the indexing. These differences influenced the exhaustiveness and precision indexes in the information retrieval, showing that it is necessary to improve the documentary language used by SISA software and to incorporate linguistic methods.