924 resultados para Information Retrieval, Document Databases, Digital Libraries
Resumo:
This work is aimed at building an adaptable frame-based system for processing Dravidian languages. There are about 17 languages in this family and they are spoken by the people of South India.Karaka relations are one of the most important features of Indian languages. They are the semabtuco-syntactic relations between verbs and other related constituents in a sentence. The karaka relations and surface case endings are analyzed for meaning extraction. This approach is comparable with the borad class of case based grammars.The efficiency of this approach is put into test in two applications. One is machine translation and the other is a natural language interface (NLI) for information retrieval from databases. The system mainly consists of a morphological analyzer, local word grouper, a parser for the source language and a sentence generator for the target language. This work make contributios like, it gives an elegant account of the relation between vibhakthi and karaka roles in Dravidian languages. This mapping is elegant and compact. The same basic thing also explains simple and complex sentence in these languages. This suggests that the solution is not just ad hoc but has a deeper underlying unity. This methodology could be extended to other free word order languages. Since the frame designed for meaning representation is general, they are adaptable to other languages coming in this group and to other applications.
Resumo:
Доклад, поместен в сборника на Националната конференция "Образованието в информационното общество", Пловдив, май 2011 г.
Resumo:
In the last few years, there has been a wide development in the research on textual information systems. The goal is to improve these systems in order to allow an easy localization, treatment and access to the information stored in digital format (Digital Databases, Documental Databases, and so on). There are lots of applications focused on information access (for example, Web-search systems like Google or Altavista). However, these applications have problems when they must access to cross-language information, or when they need to show information in a language different from the one of the query. This paper explores the use of syntactic-sematic patterns as a method to access to multilingual information, and revise, in the case of Information Retrieval, where it is possible and useful to employ patterns when it comes to the multilingual and interactive aspects. On the one hand, the multilingual aspects that are going to be studied are the ones related to the access to documents in different languages from the one of the query, as well as the automatic translation of the document, i.e. a machine translation system based on patterns. On the other hand, this paper is going to go deep into the interactive aspects related to the reformulation of a query based on the syntactic-semantic pattern of the request.
Resumo:
Existing theories of semantic cognition propose models of cognitive processing occurring in a conceptual space, where ‘meaning’ is derived from the spatial relationships between concepts’ mapped locations within the space. Information visualisation is a growing area of research within the field of information retrieval, and methods for presenting database contents visually in the form of spatial data management systems (SDMSs) are being developed. This thesis combined these two areas of research to investigate the benefits associated with employing spatial-semantic mapping (documents represented as objects in two- and three-dimensional virtual environments are proximally mapped dependent on the semantic similarity of their content) as a tool for improving retrieval performance and navigational efficiency when browsing for information within such systems. Positive effects associated with the quality of document mapping were observed; improved retrieval performance and browsing behaviour were witnessed when mapping was optimal. It was also shown using a third dimension for virtual environment (VE) presentation provides sufficient additional information regarding the semantic structure of the environment that performance is increased in comparison to using two-dimensions for mapping. A model that describes the relationship between retrieval performance and browsing behaviour was proposed on the basis of findings. Individual differences were not found to have any observable influence on retrieval performance or browsing behaviour when mapping quality was good. The findings from this work have implications for both cognitive modelling of semantic information, and for designing and testing information visualisation systems. These implications are discussed in the conclusions of this work.
Resumo:
The increasing amount of available semistructured data demands efficient mechanisms to store, process, and search an enormous corpus of data to encourage its global adoption. Current techniques to store semistructured documents either map them to relational databases, or use a combination of flat files and indexes. These two approaches result in a mismatch between the tree-structure of semistructured data and the access characteristics of the underlying storage devices. Furthermore, the inefficiency of XML parsing methods has slowed down the large-scale adoption of XML into actual system implementations. The recent development of lazy parsing techniques is a major step towards improving this situation, but lazy parsers still have significant drawbacks that undermine the massive adoption of XML. Once the processing (storage and parsing) issues for semistructured data have been addressed, another key challenge to leverage semistructured data is to perform effective information discovery on such data. Previous works have addressed this problem in a generic (i.e. domain independent) way, but this process can be improved if knowledge about the specific domain is taken into consideration. This dissertation had two general goals: The first goal was to devise novel techniques to efficiently store and process semistructured documents. This goal had two specific aims: We proposed a method for storing semistructured documents that maps the physical characteristics of the documents to the geometrical layout of hard drives. We developed a Double-Lazy Parser for semistructured documents which introduces lazy behavior in both the pre-parsing and progressive parsing phases of the standard Document Object Model's parsing mechanism. The second goal was to construct a user-friendly and efficient engine for performing Information Discovery over domain-specific semistructured documents. This goal also had two aims: We presented a framework that exploits the domain-specific knowledge to improve the quality of the information discovery process by incorporating domain ontologies. We also proposed meaningful evaluation metrics to compare the results of search systems over semistructured documents.
Resumo:
Training in information competencies or information literacy is one of the current challenges of university libraries at the possibilities of access to vast information resources that facilitate digital media, which require a better understand and apply the selection and assessment criteria to retrieval the highest quality and relevance of information as needed. In this situation, Ibero-American university libraries (Latin-America, Spain and Portugal) have been slowly incorporating this training either from direct training programs, offered from the library or through collaborative work with teachers and schools in curricula of various universities as a whole or in specific disciplines. In this text, it was identified that, at present, from the information displayed on Web sites of universities-HEI in Costa Rica, a very small percentage of university libraries would find taking actions in a level 1 or 2 of incorporating information literacy, since a large most developed is still very focused programs and processes to the traditional user training, while another large majority, unfortunately, has no action-information about actions from the forming perspective that should be any library.
Resumo:
This paper presents the results of my action research. I was involved in establishing and running a digital library that was founded by the government of South Korea. The process involved understanding the relationship between the national IT infrastructure and the success factors of the digital library. In building, the national IT infrastructure, a digital library system was implemented; it combines all existing digitized university libraries and can provide overseas information, such as foreign journal articles, instantly and freely to every Korean researcher. An empirical survey was made as a part of the action research; the survey determined user satisfaction in the newly established national digital library. After obtaining the survey results, I suggested that the current way of running the nationwide government-owned digital library should be retained. (C) 2002 Elsevier Science B.V. All rights reserved.
Resumo:
Dissertation submitted in partial fulfilment of the requirements for the Degree of Master of Science in Geospatial Technologies.
Resumo:
En aquest treball es fa una aproximació al tractament dels documents electrònics (sobretot recursos d'Internet) en el context de la biblioteca d'una universitat a distància. També es relata l'experiència de la Biblioteca de la Universitat Oberta de Catalunya en el procés de creació de pàgines electròniques de sumaris de revistes i monografies que formen part de la col·lecció de la biblioteca. Tots aquests serveis de valor afegit estan directament orientats vers uns usuaris que es posen en contacte amb la biblioteca des del Campus Virtual de la UOC.
Resumo:
Given the existing challenges in accessing print-based publications in developing countries, digital libraries are seen as a good alternative. Thus, it is important to understand how such libraries are used in these contexts, especially when compared with the usage of traditional libraries. This paper analyzes and compares the usage of the digital and traditional libraries of the University Jean Piaget of Cape Verde, aiming at understanding the way they are used, and the relation between the access to the existing information resources in these two libraries.
Resumo:
Peer-reviewed
Resumo:
This research project is a contribution to the global field of information retrieval, specifically, to develop tools to enable information access in digital documents. We recognize the need to provide the user with flexible access to the contents of large, potentially complex digital documents, with means other than a search function or a handful of metadata elements. The goal is to produce a text browsing tool offering a maximum of information based on a fairly superficial linguistic analysis. We are concerned with a type of extensive single-document indexing, and not indexing by a set of keywords (see Klement, 2002, for a clear distinction between the two). The desired browsing tool would not only give at a glance the main topics discussed in the document, but would also present relationships between these topics. It would also give direct access to the text (via hypertext links to specific passages). The present paper, after reviewing previous research on this and similar topics, discusses the methodology and the main characteristics of a prototype we have devised. Experimental results are presented, as well as an analysis of remaining hurdles and potential applications.
Resumo:
Conceptual Information Systems provide a multi-dimensional conceptually structured view on data stored in relational databases. On restricting the expressiveness of the retrieval language, they allow the visualization of sets of realted queries in conceptual hierarchies, hence supporting the search of something one does not have a precise description, but only a vague idea of. Information Retrieval is considered as the process of finding specific objects (documents etc.) out of a large set of objects which fit to some description. In some data analysis and knowledge discovery applications, the dual task is of interest: The analyst needs to determine, for a subset of objects, a description for this subset. In this paper we discuss how Conceptual Information Systems can be extended to support also the second task.
Bibliotecas digitais em Arquitetura e urbanismo: um estudo sobre a arquitetura da informação digital
Resumo:
The goal of this paper was to search the state of the art from the Digital Libraries in Architecture and Urbanism in the Higher Education Institutions (IES) through conceptualizations and showing the importance of Digital Libraries in the disclosure and easing of information transferring. Questions about digital information architecture, usability, digital preservation and accessibility were approached. The research was made in the websites of Brazilian Universities, firstly to identify the institutions which offered the Architecture and Urbanism course, focusing on postgraduate education. After identifying the offering, the research was done by analyzing the contents, storage and dissemination and access to information, these libraries. It was found that the digital libraries are increasingly and taking part of organizations and educational institutions focusing on the knowledge dissemination releasing digitally information that may be needed for institution or the individual. A monitoring was done over of the physical and computational restructuring of the Board of Studies and Research in Architecture and Urbanism (Câmara de Estudos e Pesquisa em Arquitetura e Urbanismo, CEPAU), from the Architecture and Urbanism Course of the Federal University of Rio Grande do Norte (UFRN), showing the need of installing a Digital Library to integrate the databases of PPGAU s research groups, which today remain independent, with no interface among themselves. The research chosen area was Architecture and Urbanism, because there is a gap and little documentation about digital libraries in this area
Resumo:
Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES)