1 resultado para Data Mining, Big Data, Consumi energetici, Weka Data Cleaning
em Illinois Digital Environment for Access to Learning and Scholarship Repository
Filtro por publicador
- JISC Information Environment Repository (1)
- Repository Napier (1)
- Aberdeen University (4)
- Abertay Research Collections - Abertay University’s repository (1)
- Aberystwyth University Repository - Reino Unido (3)
- Academic Archive On-line (Jönköping University; Sweden) (1)
- Academic Archive On-line (Mid Sweden University; Sweden) (1)
- Acceda, el repositorio institucional de la Universidad de Las Palmas de Gran Canaria. España (2)
- AMS Tesi di Dottorato - Alm@DL - Università di Bologna (8)
- AMS Tesi di Laurea - Alm@DL - Università di Bologna (76)
- Aquatic Commons (2)
- ArchiMeD - Elektronische Publikationen der Universität Mainz - Alemanha (2)
- Archivo Digital para la Docencia y la Investigación - Repositorio Institucional de la Universidad del País Vasco (2)
- Aston University Research Archive (16)
- Avian Conservation and Ecology - Eletronic Cientific Hournal - Écologie et conservation des oiseaux: (1)
- Biblioteca de Teses e Dissertações da USP (2)
- Biblioteca Digital da Produção Intelectual da Universidade de São Paulo (5)
- Biblioteca Digital da Produção Intelectual da Universidade de São Paulo (BDPI/USP) (2)
- BORIS: Bern Open Repository and Information System - Berna - Suiça (21)
- Bulgarian Digital Mathematics Library at IMI-BAS (14)
- CaltechTHESIS (1)
- Cambridge University Engineering Department Publications Database (5)
- CentAUR: Central Archive University of Reading - UK (52)
- Chinese Academy of Sciences Institutional Repositories Grid Portal (5)
- Cochin University of Science & Technology (CUSAT), India (8)
- Comissão Econômica para a América Latina e o Caribe (CEPAL) (4)
- CORA - Cork Open Research Archive - University College Cork - Ireland (4)
- Corvinus Research Archive - The institutional repository for the Corvinus University of Budapest (1)
- Dalarna University College Electronic Archive (7)
- Department of Computer Science E-Repository - King's College London, Strand, London (1)
- Digital Commons @ Winthrop University (1)
- Digital Commons at Florida International University (17)
- Digital Peer Publishing (4)
- DigitalCommons@The Texas Medical Center (2)
- Doria (National Library of Finland DSpace Services) - National Library of Finland, Finland (4)
- DRUM (Digital Repository at the University of Maryland) (2)
- Duke University (6)
- FUNDAJ - Fundação Joaquim Nabuco (1)
- Greenwich Academic Literature Archive - UK (1)
- Helda - Digital Repository of University of Helsinki (6)
- Helvia: Repositorio Institucional de la Universidad de Córdoba (1)
- Illinois Digital Environment for Access to Learning and Scholarship Repository (1)
- Indian Institute of Science - Bangalore - Índia (26)
- Instituto Politécnico de Castelo Branco - Portugal (1)
- Instituto Politécnico do Porto, Portugal (16)
- Lume - Repositório Digital da Universidade Federal do Rio Grande do Sul (2)
- Martin Luther Universitat Halle Wittenberg, Germany (1)
- Massachusetts Institute of Technology (1)
- Memorial University Research Repository (2)
- National Center for Biotechnology Information - NCBI (1)
- Plymouth Marine Science Electronic Archive (PlyMSEA) (2)
- Publishing Network for Geoscientific & Environmental Data (15)
- QSpace: Queen's University - Canada (3)
- QUB Research Portal - Research Directory and Institutional Repository for Queen's University Belfast (19)
- Queensland University of Technology - ePrints Archive (221)
- RCAAP - Repositório Científico de Acesso Aberto de Portugal (2)
- RDBU - Repositório Digital da Biblioteca da Unisinos (1)
- ReCiL - Repositório Científico Lusófona - Grupo Lusófona, Portugal (1)
- Repositório Alice (Acesso Livre à Informação Científica da Embrapa / Repository Open Access to Scientific Information from Embrapa) (3)
- Repositório Científico da Universidade de Évora - Portugal (1)
- Repositório Científico do Instituto Politécnico de Lisboa - Portugal (2)
- Repositorio de la Universidad de Cuenca (1)
- Repositório digital da Fundação Getúlio Vargas - FGV (10)
- Repositorio Institucional da UFLA (RIUFLA) (1)
- Repositório Institucional da Universidade de Aveiro - Portugal (1)
- Repositório Institucional da Universidade Estadual de São Paulo - UNESP (3)
- Repositorio Institucional de la Universidad de La Laguna (1)
- Repositorio Institucional de la Universidad de Málaga (2)
- Repositório Institucional UNESP - Universidade Estadual Paulista "Julio de Mesquita Filho" (16)
- Repositorio Institucional Universidad Católica de Colombia (1)
- Research Open Access Repository of the University of East London. (1)
- RUN (Repositório da Universidade Nova de Lisboa) - FCT (Faculdade de Cienecias e Technologia), Universidade Nova de Lisboa (UNL), Portugal (5)
- SAPIENTIA - Universidade do Algarve - Portugal (1)
- School of Medicine, Washington University, United States (1)
- The Scholarly Commons | School of Hotel Administration; Cornell University Research (1)
- Universidad de Alicante (8)
- Universidad Politécnica de Madrid (30)
- Universidade Complutense de Madrid (3)
- Universidade Metodista de São Paulo (1)
- Universita di Parma (2)
- Universitätsbibliothek Kassel, Universität Kassel, Germany (5)
- Université de Lausanne, Switzerland (1)
- University of Canberra Research Repository - Australia (3)
- University of Michigan (2)
- University of Queensland eSpace - Australia (21)
- University of Southampton, United Kingdom (14)
- University of Washington (4)
- WestminsterResearch - UK (8)
Resumo:
Discovery Driven Analysis (DDA) is a common feature of OLAP technology to analyze structured data. In essence, DDA helps analysts to discover anomalous data by highlighting 'unexpected' values in the OLAP cube. By giving indications to the analyst on what dimensions to explore, DDA speeds up the process of discovering anomalies and their causes. However, Discovery Driven Analysis (and OLAP in general) is only applicable on structured data, such as records in databases. We propose a system to extend DDA technology to semi-structured text documents, that is, text documents with a few structured data. Our system pipeline consists of two stages: first, the text part of each document is structured around user specified dimensions, using semi-PLSA algorithm; then, we adapt DDA to these fully structured documents, thus enabling DDA on text documents. We present some applications of this system in OLAP analysis and show how scalability issues are solved. Results show that our system can handle reasonable datasets of documents, in real time, without any need for pre-computation.