950 resultados para Information Discovery Paradigm,
Resumo:
The domain of Knowledge Discovery (KD) and Data Mining (DM) is of growing importance in a time where more and more data is produced and knowledge is one of the most precious assets. Having explored both the existing underlying theory, the results of the ongoing research in academia and the industry practices in the domain of KD and DM, we have found that this is a domain that still lacks some systematization. We also found that this systematization exists to a greater degree in the Software Engineering and Requirements Engineering domains, probably due to being more mature areas. We believe that it is possible to improve and facilitate the participation of enterprise stakeholders in the requirements engineering for KD projects by systematizing requirements engineering process for such projects. This will, in turn, result in more projects that end successfully, that is, with satisfied stakeholders, including in terms of time and budget constraints. With this in mind and based on all information found in the state-of-the art, we propose SysPRE - Systematized Process for Requirements Engineering in KD projects. We begin by proposing an encompassing generic description of the KD process, where the main focus is on the Requirements Engineering activities. This description is then used as a base for the application of the Design and Engineering Methodology for Organizations (DEMO) so that we can specify a formal ontology for this process. The resulting SysPRE ontology can serve as a base that can be used not only to make enterprises become aware of their own KD process and requirements engineering process in the KD projects, but also to improve such processes in reality, namely in terms of success rate.
Resumo:
Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq)
Resumo:
Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)
Resumo:
The quest for new control strategies for ticks can profit from high throughput genomics. In order to identify genes that are involved in oogenesis and development, in defense, and in hematophagy, the transcriptomes of ovaries, hemocytes, and salivary glands from rapidly ingurgitating females, and of salivary glands from males of Boophilus microplus were PCR amplified, and the expressed sequence tags (EST) of random clones were mass sequenced. So far, more than 1,344 EST have been generated for these tissues, with approximately 30% novelty, depending on the the tissue studied. To date approximately 760 nucleotide sequences from B. microplus are deposited in the NCBI database. Mass sequencing of partial cDNAs of parasite genes can build up this scant database and rapidly generate a large quantity of useful information about potential targets for immunobiological or chemical control.
Resumo:
This paper describes a data mining environment for knowledge discovery in bioinformatics applications. The system has a generic kernel that implements the mining functions to be applied to input primary databases, with a warehouse architecture, of biomedical information. Both supervised and unsupervised classification can be implemented within the kernel and applied to data extracted from the primary database, with the results being suitably stored in a complex object database for knowledge discovery. The kernel also includes a specific high-performance library that allows designing and applying the mining functions in parallel machines. The experimental results obtained by the application of the kernel functions are reported. © 2003 Elsevier Ltd. All rights reserved.
Resumo:
Includes bibliography
Resumo:
Includes bibliography
Resumo:
The uses of Information and Communication Technologies (ICT) and Web environments for creation, treatment and availability of information have supported the emergence of new social-cultural patterns represented by convergences in textual, image and audio languages. This paper describes and analyzes the National Archives Experience Digital Vaults as a digital publishing web environment and as a cultural heritage. It is a complex system - synthesizer of information design options at information setting, provides new aesthetic aspects, but specially enlarges the cognition of the subjects who interact with the environment. It also enlarges the institutional spaces that guard the collective memory beyond its role of keeping the physical patrimony collected there. Digital Vaults lies as a mix of guide and interactive catalogue to be dealt in a ludic way. The publishing design of the information held on the Archives is meant to facilitate access to knowledge. The documents are organized in a dynamic and not chronological way. They are not divided in fonds or distinct categories, but in controlled interaction of documents previously indexed and linked by the software. The software creates information design and view of documental content that can be considered a new paradigm in Information Science and are part of post-custodial regime, independent from physical spaces and institutions. Information professionals must be prepared to understand and work with the paradigmatic changes described and represented by the new hybrid digital environments; hence the importance of this paper. Cyberspace interactivity between user and the content provided by the environment design provide cooperation, collaboration and sharing knowledge actions, all features of networks, transforming culture globally. © 2011 - IOS Press and the authors. All rights reserved.
Resumo:
The uses of Information and Communication Technologies (ICT) and Web environments for creation, treatment and availability of information have supported the emergence of new social-cultural patterns represented by convergences in textual, image and audio languages. This paper describes and analyzes the National Archives Experience Digital Vaults as a digital publishing web environment and as a cultural heritage. It is a complex system - synthesizer of information design options at information setting, provides new aesthetic aspects, but specially enlarges the cognition of the subjects who interact with the environment. It also enlarges the institutional spaces that guard the collective memory beyond its role of keeping the physical patrimony collected there. Digital Vaults lies as a mix of guide and interactive catalogue to be dealt in a ludic way. The publishing design of the information held on the Archives is meant to facilitate access to knowledge. The documents are organized in a dynamic and not chronological way. They are not divided in fonds or distinct categories, but in controlled interaction of documents previously indexed and linked by the software. The software creates information design and view of documental content that can be considered a new paradigm in Information Science and are part of post-custodial regime, independent from physical spaces and institutions. Information professionals must be prepared to understand and work with the paradigmatic changes described and represented by the new hybrid digital environments; hence the importance of this paper. Cyberspace interactivity between user and the content provided by the environment design provide cooperation, collaboration and sharing knowledge actions, all features of networks, transforming culture globally.
Resumo:
The uses of Information and Communication Technologies (ICT) and Web environments for creation, treatment and availability of information have supported the emergence of new social-cultural patterns represented by convergences in textual, image and audio languages. This paper describes and analyzes the National Archives Experience Digital Vaults as a digital publishing web environment and as a cultural heritage. It is a complex system - synthesizer of information design options at information setting, provides new aesthetic aspects, but specially enlarges the cognition of the subjects who interact with the environment. It also enlarges the institutional spaces that guard the collective memory beyond its role of keeping the physical patrimony collected there. Digital Vaults lies as a mix of guide and interactive catalogue to be dealt in a ludic way. The publishing design of the information held on the Archives is meant to facilitate access to knowledge. The documents are organized in a dynamic and not chronological way. They are not divided in fonds or distinct categories, but in controlled interaction of documents previously indexed and linked by the software. The software creates information design and view of documental content that can be considered a new paradigm in Information Science and are part of post-custodial regime, independent from physical spaces and institutions. Information professionals must be prepared to understand and work with the paradigmatic changes described and represented by the new hybrid digital environments; hence the importance of this paper. Cyberspace interactivity between user and the content provided by the environment design provide cooperation, collaboration and sharing knowledge actions, all features of networks, transforming culture globally.
Resumo:
Effective communication is the soul of any institution. New opportunities are presented by Information and Communication Technology (ICTS). Library managers must adopt a more critical and communication plan that will enhance their skills and allow for further training, as well as allowing stakeholders to have input into decision-making.
Resumo:
The `Critically Endangered` Cone-billed Tanager Conothraupis mesoleuca was described in 71 93 9, based on a single specimen collected in the state of Mato Grosso, western Brazil. Not seen again in the wild until 2003, this poorly-known species was rediscovered in Emas National Park, in the Brazilian state of Goias. We describe here the discovery of a new population of Cone-billed Tanager in Chapada dos Parecis, along the upper Juruena River basin, in the state of Mato Grosso. The birds were always detected in (or near) flooded habitats along rivers. At least 40 individuals were found, but the population may be larger since areas of potential habitat are available in the upper Juruena basin and these have not yet been surveyed. We also provide here the first information on the biology and behaviour of the species based on observations in Juruena and Emas, as well as a first description of the female. Historical documents and our records support our suggestion that ""Juruena"", i.e. the type locality of the Cone-billed Tanager, refers to the Juruena telegraph station (12 degrees 50`S, 58 degrees 55`W). Considering that the range of the species is being settled, research on different aspects of its biology are urgent.
Resumo:
Background: Genome-wide association studies (GWAS) require large sample sizes to obtain adequate statistical power, but it may be possible to increase the power by incorporating complementary data. In this study we investigated the feasibility of automatically retrieving information from the medical literature and leveraging this information in GWAS. Methods: We developed a method that searches through PubMed abstracts for pre-assigned keywords and key concepts, and uses this information to assign prior probabilities of association for each single nucleotide polymorphism (SNP) with the phenotype of interest - the Adjusting Association Priors with Text (AdAPT) method. Association results from a GWAS can subsequently be ranked in the context of these priors using the Bayes False Discovery Probability (BFDP) framework. We initially tested AdAPT by comparing rankings of known susceptibility alleles in a previous lung cancer GWAS, and subsequently applied it in a two-phase GWAS of oral cancer. Results: Known lung cancer susceptibility SNPs were consistently ranked higher by AdAPT BFDPs than by p-values. In the oral cancer GWAS, we sought to replicate the top five SNPs as ranked by AdAPT BFDPs, of which rs991316, located in the ADH gene region of 4q23, displayed a statistically significant association with oral cancer risk in the replication phase (per-rare-allele log additive p-value [p(trend)] = 2.5 x 10(-3)). The combined OR for having one additional rare allele was 0.83 (95% CI: 0.76-0.90), and this association was independent of previously identified susceptibility SNPs that are associated with overall UADT cancer in this gene region. We also investigated if rs991316 was associated with other cancers of the upper aerodigestive tract (UADT), but no additional association signal was found. Conclusion: This study highlights the potential utility of systematically incorporating prior knowledge from the medical literature in genome-wide analyses using the AdAPT methodology. AdAPT is available online (url: http://services.gate.ac.uk/lld/gwas/service/config).
Resumo:
We extend our earlier results delineating the supersymmetry reach of the CERN Large Hadron Collider operating at a center-of-mass energy root s = 7 TeV to integrated luminosities in the range 5-30 fb(-1). Our results are presented within the paradigm minimal supergravity model or constrained minimal supersymmetric standard model. Using a six-dimensional grid of cuts for the optimization of signal to background ratio-including missing E-T-we find for m((g) over tilde) similar to m((q) over tilde) an LHC 5 sigma supersymmetry discovery reach of m((g) over tilde) similar to 1:3, 1.4, 1.5, and 1.6 TeV for 5, 10, 20, and 30 fb(-1), respectively. For m((q) over tilde) >> m((g) over tilde), the corresponding reach is instead m((g) over tilde) similar to 0: 8, 0.9, 1.0, and 1.05 TeV, for the same integrated luminosities.