805 resultados para controlled vocabularies


Relevância:

60.00% 60.00%

Publicador:

Resumo:

Presentation at Open Repositories 2014, Helsinki, Finland, June 9-13, 2014

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Climate modeling is a complex process, requiring accurate and complete metadata in order to identify, assess and use climate data stored in digital repositories. The preservation of such data is increasingly important given the development of ever-increasingly complex models to predict the effects of global climate change. The EU METAFOR project has developed a Common Information Model (CIM) to describe climate data and the models and modelling environments that produce this data. There is a wide degree of variability between different climate models and modelling groups. To accommodate this, the CIM has been designed to be highly generic and flexible, with extensibility built in. METAFOR describes the climate modelling process simply as "an activity undertaken using software on computers to produce data." This process has been described as separate UML packages (and, ultimately, XML schemas). This fairly generic structure canbe paired with more specific "controlled vocabularies" in order to restrict the range of valid CIM instances. The CIM will aid digital preservation of climate models as it will provide an accepted standard structure for the model metadata. Tools to write and manage CIM instances, and to allow convenient and powerful searches of CIM databases,. Are also under development. Community buy-in of the CIM has been achieved through a continual process of consultation with the climate modelling community, and through the METAFOR team’s development of a questionnaire that will be used to collect the metadata for the Intergovernmental Panel on Climate Change’s (IPCC) Coupled Model Intercomparison Project Phase 5 (CMIP5) model runs.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

The Metafor project has developed a common information model (CIM) using the ISO19100 series for- malism to describe numerical experiments carried out by the Earth system modelling community, the models they use, and the simulations that result. Here we describe the mechanism by which the CIM was developed, and its key properties. We introduce the conceptual and application ver- sions and the controlled vocabularies developed in the con- text of supporting the fifth Coupled Model Intercomparison Project (CMIP5). We describe how the CIM has been used in experiments to describe model coupling properties and de- scribe the near term expected evolution of the CIM.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Interoperability of water quality data depends on the use of common models, schemas and vocabularies. However, terms are usually collected during different activities and projects in isolation of one another, resulting in vocabularies that have the same scope being represented with different terms, using different formats and formalisms, and published in various access methods. Significantly, most water quality vocabularies conflate multiple concepts in a single term, e.g. quantity kind, units of measure, substance or taxon, medium and procedure. This bundles information associated with separate elements from the OGC Observations and Measurements (O&M) model into a single slot. We have developed a water quality vocabulary, formalized using RDF, and published as Linked Data. The terms were extracted from existing water quality vocabularies. The observable property model is inspired by O&M but aligned with existing ontologies. The core is an OWL ontology that extends the QUDT ontology for Unit and QuantityKind definitions. We add classes to generalize the QuantityKind model, and properties for explicit description of the conflated concepts. The key elements are defined to be sub-classes or sub-properties of SKOS elements, which enables a SKOS view to be published through standard vocabulary APIs, alongside the full view. QUDT terms are re-used where possible, supplemented with additional Unit and QuantityKind entries required for water quality. Along with items from separate vocabularies developed for objects, media, and procedures, these are linked into definitions in the actual observable property vocabulary. Definitions of objects related to chemical substances are linked to items from the Chemical Entities of Biological Interest (ChEBI) ontology. Mappings to other vocabularies, such as DBPedia, are in separately maintained files. By formalizing the model for observable properties, and clearly labelling the separate concerns, water quality observations from different sources may be more easily merged and also transformed to O&M for cross-domain applications.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Pós-graduação em Ciência da Informação - FFC

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Pós-graduação em Ciência da Informação - FFC

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Pós-graduação em Ciência da Informação - FFC

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Pós-graduação em Ciência da Informação - FFC

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Controlled vocabularies are tools of representation of information necessary to standardize the content description and classification of information, making information systems consistent and also minimizing the dispersion of information. One of the most critical points of controlled vocabularies is the need to constantly update, in terminology and the computer system. The purpose of this paper is to share the experience of the Sistema Integrado de Bibliotecas da Universidade de São Paulo - (SIBiUSP) in planning and developing an innovation plan for your Controlled Vocabulary, reporting their goals and actions. Such actions are in different stages of referral, so there are provisional results. The article also brings the description of his movements and the difficulties encountered as collaboration and knowledge for professionals that working and researching with the theme controlled vocabularies

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Sistemas de gestão desenvolvidos para a web, a partir de metadados, permitem manutenção eficiente de grandes quantidades de informação. Um vocabulário controlado como o utilizado pelo Sistema Integrado de Bibliotecas da USP (SIBi/USP) necessita de atualização contínua realizada através de uma rede colaborativa com a participação de bibliotecários indexadores de todas as áreas do conhecimento. Este trabalho apresenta os resultados obtidos com o sistema de gestão desenvolvido pelo Grupo de Gerenciamento para a manutenção do Vocabulário Controlado do SIBi/USP. O fluxo deste sistema consiste em filtros de validação realizados pelos componentes do Grupo de Gerenciamento do Vocabulário. A metodologia de gestão do Vocabulário possui além deste sistema, uma política de governança. Os resultados obtidos nos seis anos desde a ativação do sistema de gestão pela Base de Sugestões consistiram em: 1192 inclusões de descritores, 240 alterações, 61 exclusões, totalizando 1493 operações. A gestão e o controle de qualidade do Vocabulário permitiram o aprimoramento do tratamento e da recuperação da informação no Banco de Dados Bibliográficos da USP – DEDALUS.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Este ensaio apresenta a construção de um objeto de pesquisa com base na teoria da semiótica da cultura. São feitas reflexões sobre os sistemas modelizantes envolvidos no ciclo da comunicação científica em grupo de pesquisa de universidade, desde a busca da informação até a publicação dos resultados dos estudos. As linguagens naturais (idiomas) e artificiais (linguagem de busca em computadores e vocabulários controlados) são identificadas. A partir disso, o objeto se delineia como o conjunto de textos da cultura e a própria semiosfera, representada pelos diálogos dos sujeitos da cultura e o processo de comunicação envolvido. Alguns desafios se apresentam, como: a necessidade de aprofundamento na teoria da semiótica da cultura, a participação do pesquisador também como sujeito da pesquisa e o trabalho com a interdisciplinaridade para estudar um objeto com as vertentes da ciência da informação, biomedicina, semiótica e outras disciplinas a elas relacionadas.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Library of Congress Subject Headings (LCSH), the standard subject language used in library catalogues, are often criticized for their lack of currency, biased language, and atypical syndetic structure. Conversely, folksonomies (or tags), which rely on the natural language of their users, offer a flexibility often lacking in controlled vocabularies and may offer a means of augmenting more rigid controlled vocabularies such as LCSH. Content analysis studies have demonstrated the potential for folksonomies to be used as a means of enhancing subject access to materials, and libraries are beginning to integrate tagging systems into their catalogues. This study examines the utility of tags as a means of enhancing subject access to materials in library online public access catalogues (OPACs) through usability testing with the LibraryThing for Libraries catalogue enhancements. Findings indicate that while they cannot replace LCSH, tags do show promise for aiding information seeking in OPACs. In the context of information systems design, the study revealed that while folksonomies have the potential to enhance subject access to materials, that potential is severely limited by the current inability of catalogue interfaces to support tag-based searches alongside standard catalogue searches.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

The Gene Expression Database (GXD) is a community resource of gene expression information for the laboratory mouse. By combining the different types of expression data, GXD aims to provide increasingly complete information about the expression profiles of genes in different mouse strains and mutants, thus enabling valuable insights into the molecular networks that underlie normal development and disease. GXD is integrated with the Mouse Genome Database (MGD). Extensive interconnections with sequence databases and with databases from other species, and the development and use of shared controlled vocabularies extend GXD’s utility for the analysis of gene expression information. GXD is accessible through the Mouse Genome Informatics web site at http://www.informatic s.jax.org/ or directly at http://www.informatics.jax.org/me nus/expression_menu.shtml.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Manual curation has long been held to be the gold standard for functional annotation of DNA sequence. Our experience with the annotation of more than 20,000 full-length cDNA sequences revealed problems with this approach, including inaccurate and inconsistent assignment of gene names, as well as many good assignments that were difficult to reproduce using only computational methods. For the FANTOM2 annotation of more than 60,000 cDNA clones, we developed a number of methods and tools to circumvent some of these problems, including an automated annotation pipeline that provides high-quality preliminary annotation for each sequence by introducing an uninformative filter that eliminates uninformative annotations, controlled vocabularies to accurately reflect both the functional assignments and the evidence supporting them, and a highly refined, Web-based manual annotation tool that allows users to view a wide array of sequence analyses and to assign gene names and putative functions using a consistent nomenclature. The ultimate utility of our approach is reflected in the low rate of reassignment of automated assignments by manual curation. Based on these results, we propose a new standard for large-scale annotation, in which the initial automated annotations are manually investigated and then computational methods are iteratively modified and improved based on the results of manual curation.