1 resultado para Impala, Hadoop, Big Data, HDFS, Social Business Intelligence, SBI, cloudera
em Digital Commons - Michigan Tech
Filtro por publicador
- JISC Information Environment Repository (1)
- Repository Napier (3)
- Aberdeen University (2)
- Abertay Research Collections - Abertay University’s repository (3)
- Academic Archive On-line (Jönköping University; Sweden) (1)
- Acceda, el repositorio institucional de la Universidad de Las Palmas de Gran Canaria. España (3)
- AMS Tesi di Dottorato - Alm@DL - Università di Bologna (2)
- AMS Tesi di Laurea - Alm@DL - Università di Bologna (52)
- Andina Digital - Repositorio UASB-Digital - Universidade Andina Simón Bolívar (2)
- Archive of European Integration (2)
- Archivo Digital para la Docencia y la Investigación - Repositorio Institucional de la Universidad del País Vasco (2)
- Aston University Research Archive (17)
- B-Digital - Universidade Fernando Pessoa - Portugal (1)
- Biblioteca de Teses e Dissertações da USP (1)
- Biblioteca Digital da Produção Intelectual da Universidade de São Paulo (5)
- Biblioteca Digital de Teses e Dissertações Eletrônicas da UERJ (1)
- BORIS: Bern Open Repository and Information System - Berna - Suiça (17)
- Brock University, Canada (1)
- Bucknell University Digital Commons - Pensilvania - USA (1)
- Bulgarian Digital Mathematics Library at IMI-BAS (9)
- CaltechTHESIS (1)
- Cambridge University Engineering Department Publications Database (3)
- CentAUR: Central Archive University of Reading - UK (41)
- Chinese Academy of Sciences Institutional Repositories Grid Portal (6)
- CiencIPCA - Instituto Politécnico do Cávado e do Ave, Portugal (1)
- Cochin University of Science & Technology (CUSAT), India (1)
- Coffee Science - Universidade Federal de Lavras (1)
- Comissão Econômica para a América Latina e o Caribe (CEPAL) (13)
- CORA - Cork Open Research Archive - University College Cork - Ireland (7)
- Corvinus Research Archive - The institutional repository for the Corvinus University of Budapest (1)
- CUNY Academic Works (2)
- Dalarna University College Electronic Archive (6)
- Department of Computer Science E-Repository - King's College London, Strand, London (1)
- Digital Commons - Michigan Tech (1)
- Digital Commons @ Winthrop University (1)
- Digital Commons at Florida International University (14)
- Digital Peer Publishing (9)
- DigitalCommons@The Texas Medical Center (1)
- Doria (National Library of Finland DSpace Services) - National Library of Finland, Finland (6)
- DRUM (Digital Repository at the University of Maryland) (6)
- Duke University (5)
- FUNDAJ - Fundação Joaquim Nabuco (1)
- Glasgow Theses Service (1)
- Helda - Digital Repository of University of Helsinki (2)
- Helvia: Repositorio Institucional de la Universidad de Córdoba (1)
- Indian Institute of Science - Bangalore - Índia (6)
- Instituto Politécnico de Castelo Branco - Portugal (1)
- Instituto Politécnico de Leiria (1)
- Instituto Politécnico do Porto, Portugal (8)
- Lume - Repositório Digital da Universidade Federal do Rio Grande do Sul (2)
- Martin Luther Universitat Halle Wittenberg, Germany (1)
- Memorial University Research Repository (1)
- Plymouth Marine Science Electronic Archive (PlyMSEA) (2)
- Portal de Periódicos Eletrônicos da UFPB (1)
- Portal de Revistas Científicas Complutenses - Espanha (4)
- QSpace: Queen's University - Canada (3)
- QUB Research Portal - Research Directory and Institutional Repository for Queen's University Belfast (29)
- Queensland University of Technology - ePrints Archive (129)
- RCAAP - Repositório Científico de Acesso Aberto de Portugal (1)
- RDBU - Repositório Digital da Biblioteca da Unisinos (11)
- ReCiL - Repositório Científico Lusófona - Grupo Lusófona, Portugal (4)
- Repositório Aberto da Universidade Aberta de Portugal (1)
- Repositório Científico da Universidade de Évora - Portugal (6)
- Repositorio de la Universidad de Cuenca (1)
- Repositório digital da Fundação Getúlio Vargas - FGV (25)
- Repositório Institucional da Universidade de Aveiro - Portugal (6)
- Repositório Institucional da Universidade de Brasília (4)
- Repositório Institucional da Universidade Estadual de São Paulo - UNESP (4)
- Repositório Institucional da Universidade Federal do Rio Grande do Norte (1)
- Repositório Institucional da Universidade Tecnológica Federal do Paraná (RIUT) (1)
- Repositorio Institucional de la Universidad de El Salvador (1)
- Repositorio Institucional de la Universidad de La Laguna (1)
- Repositorio Institucional de la Universidad de Málaga (4)
- Repositório Institucional UNESP - Universidade Estadual Paulista "Julio de Mesquita Filho" (17)
- Repositorio Institucional Universidad Católica de Colombia (1)
- Repositorio Institucional Universidad de Medellín (1)
- RUN (Repositório da Universidade Nova de Lisboa) - FCT (Faculdade de Cienecias e Technologia), Universidade Nova de Lisboa (UNL), Portugal (14)
- SAPIENTIA - Universidade do Algarve - Portugal (2)
- SerWisS - Server für Wissenschaftliche Schriften der Fachhochschule Hannover (3)
- South Carolina State Documents Depository (1)
- The Scholarly Commons | School of Hotel Administration; Cornell University Research (1)
- Universidad de Alicante (14)
- Universidad del Rosario, Colombia (10)
- Universidad Politécnica de Madrid (27)
- Universidade Complutense de Madrid (10)
- Universidade Federal do Pará (1)
- Universidade Federal do Rio Grande do Norte (UFRN) (8)
- Universidade Metodista de São Paulo (3)
- Universidade Técnica de Lisboa (1)
- Universitat de Girona, Spain (4)
- Universitätsbibliothek Kassel, Universität Kassel, Germany (1)
- Université de Lausanne, Switzerland (2)
- Université de Montréal (3)
- Université de Montréal, Canada (10)
- University of Canberra Research Repository - Australia (4)
- University of Michigan (5)
- University of Queensland eSpace - Australia (2)
- University of Southampton, United Kingdom (13)
- University of Washington (4)
- WestminsterResearch - UK (7)
Resumo:
Virtually every sector of business and industry that uses computing, including financial analysis, search engines, and electronic commerce, incorporate Big Data analysis into their business model. Sophisticated clustering algorithms are popular for deducing the nature of data by assigning labels to unlabeled data. We address two main challenges in Big Data. First, by definition, the volume of Big Data is too large to be loaded into a computer’s memory (this volume changes based on the computer used or available, but there is always a data set that is too large for any computer). Second, in real-time applications, the velocity of new incoming data prevents historical data from being stored and future data from being accessed. Therefore, we propose our Streaming Kernel Fuzzy c-Means (stKFCM) algorithm, which reduces both computational complexity and space complexity significantly. The proposed stKFCM only requires O(n2) memory where n is the (predetermined) size of a data subset (or data chunk) at each time step, which makes this algorithm truly scalable (as n can be chosen based on the available memory). Furthermore, only 2n2 elements of the full N × N (where N >> n) kernel matrix need to be calculated at each time-step, thus reducing both the computation time in producing the kernel elements and also the complexity of the FCM algorithm. Empirical results show that stKFCM, even with relatively very small n, can provide clustering performance as accurately as kernel fuzzy c-means run on the entire data set while achieving a significant speedup.