924 resultados para Information Retrieval, Document Databases, Digital Libraries
Resumo:
The need to teach information literacy skills to undergraduate students is often framed as a 21st century concern, but debate over the value and practice of teaching this set of skills can be found as far back as the early 1900’s. This article reviews the history of information literacy instruction in academic libraries from its origins to the present, examines the current state of information literacy instruction in academic libraries, and explores possible future directions that this instruction may take. Looking to the past, present and future shows that while library instruction has evolved, many central concerns remain unanswered.
Resumo:
The explosion of multimedia digital content and the development of technologies that go beyond traditional broadcast and TV have rendered access to such content important for all end-users of these technologies. While originally developed for providing access to multimedia digital libraries, video search technologies assume now a more demanding role. In this paper, we attempt to shed light onto this new role of video search technologies, looking at the rapid developments in the related market, the lessons learned from state of art video search prototypes developed mainly in the digital libraries context and the new technological challenges that have risen. We focus on one of the latter, i.e., the development of cross-media decision mechanisms, drawing examples from REVEAL THIS, an FP6 project on the retrieval of video and language for the home user. We argue, that efficient video search holds a key to the usability of the new ”pervasive digital video” technologies and that it should involve cross-media decision mechanisms.
Resumo:
The Digital Public Library of America (DPLA) is a digital library that strives to serve the public through digital collections accumulated from a wide variety of partners. Our chosen topic for the DPLA exhibit project is Perspectives on the Vietnam War. The Vietnam War remains a controversial topic of national interest, making it a topic of depth and of many perspectives. Our goals with this exhibit were to gather different perspectives of the war through personal stories, the media, presidential administrations of the war, military personnel, and the general public, including famous figures. We strove to demonstrate the variety of perspectives on the Vietnam War through a variation of digital objects and content that will be engaging for users: both black and white and color photos, videos, and audio files. Furthermore, we wanted to ensure that our digital materials are of high quality, properly documented, and easy to search and find thus all of our objects are from DPLA and are from usable original sources. This poster will describe our processes for organizational, object selection, building our exhibit, attainment of our goals, and detailed steps of our overall operation. The poster will also include details about the minor issues and bumps that occurred while reaching our final product as well as the team members’ perspectives on the project as a whole including: problems, words to for the wise, and triumphs.
Resumo:
Este estudio analiza la importancia de las TIC al servicio de las bibliotecas en general y, en concreto, el potencial de las bibliotecas digitales. En este trabajo se reflexiona sobre la importancia de las bibliotecas digitales no sólo como repositorios de contenidos, sino también como centros de creación de conocimiento. Se reflexiona sobre la importancia de la constitución de bibliotecas digitales especializadas en áreas culturales relacionadas y sobre el hecho de que sean multilingües, a fin de preservar los contenidos en sus lenguas originales, al tiempo que debe trabajarse con la traducción multilingüe (de y a muchas lenguas) como herramienta fundamental para la mejora de la difusión y conocimiento del patrimonio que se contiene en tales bibliotecas. En este sentido se explican las características de la Biblioteca Digital Plurilingüe del Mediterráneo-IVITRA.
Resumo:
"November 1994."
Resumo:
Includes bibliographical references.
Resumo:
Document classification is a supervised machine learning process, where predefined category labels are assigned to documents based on the hypothesis derived from training set of labelled documents. Documents cannot be directly interpreted by a computer system unless they have been modelled as a collection of computable features. Rogati and Yang [M. Rogati and Y. Yang, Resource selection for domain-specific cross-lingual IR, in SIGIR 2004: Proceedings of the 27th annual international conference on Research and Development in Information Retrieval, ACM Press, Sheffied: United Kingdom, pp. 154-161.] pointed out that the effectiveness of document classification system may vary in different domains. This implies that the quality of document model contributes to the effectiveness of document classification. Conventionally, model evaluation is accomplished by comparing the effectiveness scores of classifiers on model candidates. However, this kind of evaluation methods may encounter either under-fitting or over-fitting problems, because the effectiveness scores are restricted by the learning capacities of classifiers. We propose a model fitness evaluation method to determine whether a model is sufficient to distinguish positive and negative instances while still competent to provide satisfactory effectiveness with a small feature subset. Our experiments demonstrated how the fitness of models are assessed. The results of our work contribute to the researches of feature selection, dimensionality reduction and document classification.
Resumo:
Music similarity query based on acoustic content is becoming important with the ever-increasing growth of the music information from emerging applications such as digital libraries and WWW. However, relative techniques are still in their infancy and much less than satisfactory. In this paper, we present a novel index structure, called Composite Feature tree, CF-tree, to facilitate efficient content-based music search adopting multiple musical features. Before constructing the tree structure, we use PCA to transform the extracted features into a new space sorted by the importance of acoustic features. The CF-tree is a balanced multi-way tree structure where each level represents the data space at different dimensionalities. The PCA transformed data and reduced dimensions in the upper levels can alleviate suffering from dimensionality curse. To accurately mimic human perception, an extension, named CF+-tree, is proposed, which further applies multivariable regression to determine the weight of each individual feature. We conduct extensive experiments to evaluate the proposed structures against state-of-art techniques. The experimental results demonstrate superiority of our technique.
Resumo:
The main aim of the proposed approach presented in this paper is to improve Web information retrieval effectiveness by overcoming the problems associated with a typical keyword matching retrieval system, through the use of concepts and an intelligent fusion of confidence values. By exploiting the conceptual hierarchy of the WordNet (G. Miller, 1995) knowledge base, we show how to effectively encode the conceptual information in a document using the semantic information implied by the words that appear within it. Rather than treating a word as a string made up of a sequence of characters, we consider a word to represent a concept.
Resumo:
During the MEMORIAL project time an international consortium has developed a software solution called DDW (Digital Document Workbench). It provides a set of tools to support the process of digitisation of documents from the scanning up to the retrievable presentation of the content. The attention is focused to machine typed archival documents. One of the important features is the evaluation of quality in each step of the process. The workbench consists of automatic parts as well as of parts which request human activity. The measurable improvement of 20% shows the approach is successful.
Resumo:
Content creation and presentation are key activities in a multimedia digital library (MDL). The proper design and intelligent implementation of these services provide a stable base for overall MDL functionality. This paper presents the framework and the implementation of these services in the latest version of the “Virtual Encyclopaedia of Bulgarian Iconography” multimedia digital library. For the semantic description of the iconographical objects a tree-based annotation template is implemented. It provides options for autocompletion, reuse of values, bilingual entering of data, automated media watermarking, resizing and conversing. The paper describes in detail the algorithm for automated appearance of dependent values for different characteristics of an iconographical object. An algorithm for avoiding duplicate image objects is also included. The service for automated appearance of new objects in a collection after their entering is included as an important part of the content presentation. The paper also presents the overall service-based architecture of the library, covering its main service panels, repositories and their relationships. The presented vision is based on a long-term observation of the users’ preferences, cognitive goals, and needs, aiming to find an optimal functionality solution for the end users.
Resumo:
This article briefly reviews the software developments for digital presentation and preservation of Bulgarian folklore treasure created within the project “Knowledge Technologies for Creation of Digital Presentation and Significant Repositories of Folklore Heritage” by teams of the Institute of Mathematics and Informatics.