1 resultado para big data storage
em Digital Commons - Michigan Tech
Filtro por publicador
- JISC Information Environment Repository (3)
- Repository Napier (2)
- Aberdeen University (4)
- Abertay Research Collections - Abertay University’s repository (3)
- Academic Archive On-line (Jönköping University; Sweden) (1)
- Academic Archive On-line (Stockholm University; Sweden) (1)
- Acceda, el repositorio institucional de la Universidad de Las Palmas de Gran Canaria. España (2)
- AMS Tesi di Dottorato - Alm@DL - Università di Bologna (4)
- AMS Tesi di Laurea - Alm@DL - Università di Bologna (36)
- Andina Digital - Repositorio UASB-Digital - Universidade Andina Simón Bolívar (1)
- Aquatic Commons (9)
- ArchiMeD - Elektronische Publikationen der Universität Mainz - Alemanha (4)
- Archivo Digital para la Docencia y la Investigación - Repositorio Institucional de la Universidad del País Vasco (5)
- Aston University Research Archive (21)
- Avian Conservation and Ecology - Eletronic Cientific Hournal - Écologie et conservation des oiseaux: (1)
- B-Digital - Universidade Fernando Pessoa - Portugal (1)
- Biblioteca de Teses e Dissertações da USP (2)
- Biblioteca Digital da Produção Intelectual da Universidade de São Paulo (8)
- Biblioteca Digital da Produção Intelectual da Universidade de São Paulo (BDPI/USP) (6)
- Biblioteca Digital de la Universidad Católica Argentina (1)
- Biblioteca Digital de Teses e Dissertações Eletrônicas da UERJ (1)
- BORIS: Bern Open Repository and Information System - Berna - Suiça (14)
- Boston University Digital Common (3)
- Bulgarian Digital Mathematics Library at IMI-BAS (6)
- CaltechTHESIS (4)
- Cambridge University Engineering Department Publications Database (6)
- CentAUR: Central Archive University of Reading - UK (49)
- Chinese Academy of Sciences Institutional Repositories Grid Portal (49)
- Cochin University of Science & Technology (CUSAT), India (9)
- Coffee Science - Universidade Federal de Lavras (1)
- Comissão Econômica para a América Latina e o Caribe (CEPAL) (11)
- CORA - Cork Open Research Archive - University College Cork - Ireland (8)
- Corvinus Research Archive - The institutional repository for the Corvinus University of Budapest (1)
- CUNY Academic Works (1)
- Dalarna University College Electronic Archive (8)
- DI-fusion - The institutional repository of Université Libre de Bruxelles (1)
- Digital Commons - Michigan Tech (1)
- Digital Commons @ Winthrop University (1)
- Digital Commons at Florida International University (9)
- Digital Peer Publishing (8)
- DigitalCommons@The Texas Medical Center (1)
- Doria (National Library of Finland DSpace Services) - National Library of Finland, Finland (5)
- DRUM (Digital Repository at the University of Maryland) (6)
- Duke University (6)
- eResearch Archive - Queensland Department of Agriculture; Fisheries and Forestry (3)
- Glasgow Theses Service (1)
- Greenwich Academic Literature Archive - UK (4)
- Helda - Digital Repository of University of Helsinki (8)
- Helvia: Repositorio Institucional de la Universidad de Córdoba (1)
- Indian Institute of Science - Bangalore - Índia (46)
- Instituto Politécnico de Castelo Branco - Portugal (1)
- Instituto Politécnico de Leiria (1)
- Instituto Politécnico do Porto, Portugal (2)
- Lume - Repositório Digital da Universidade Federal do Rio Grande do Sul (1)
- Martin Luther Universitat Halle Wittenberg, Germany (2)
- Massachusetts Institute of Technology (5)
- Memoria Académica - FaHCE, UNLP - Argentina (3)
- Memorial University Research Repository (1)
- Open University Netherlands (1)
- Plymouth Marine Science Electronic Archive (PlyMSEA) (5)
- Portal de Periódicos Eletrônicos da UFPB (1)
- Portal de Revistas Científicas Complutenses - Espanha (4)
- Publishing Network for Geoscientific & Environmental Data (4)
- QSpace: Queen's University - Canada (3)
- QUB Research Portal - Research Directory and Institutional Repository for Queen's University Belfast (41)
- Queensland University of Technology - ePrints Archive (124)
- RCAAP - Repositório Científico de Acesso Aberto de Portugal (1)
- RDBU - Repositório Digital da Biblioteca da Unisinos (2)
- Repositório Aberto da Universidade Aberta de Portugal (1)
- Repositório Científico da Universidade de Évora - Portugal (4)
- Repositório Científico do Instituto Politécnico de Lisboa - Portugal (1)
- Repositório digital da Fundação Getúlio Vargas - FGV (14)
- Repositório Digital da UNIVERSIDADE DA MADEIRA - Portugal (1)
- Repositório Institucional da Universidade de Aveiro - Portugal (5)
- Repositório Institucional da Universidade Estadual de São Paulo - UNESP (5)
- Repositorio Institucional de la Universidad de La Laguna (1)
- Repositorio Institucional de la Universidad de Málaga (4)
- Repositório Institucional UNESP - Universidade Estadual Paulista "Julio de Mesquita Filho" (24)
- Repositorio Institucional Universidad Católica de Colombia (1)
- RUN (Repositório da Universidade Nova de Lisboa) - FCT (Faculdade de Cienecias e Technologia), Universidade Nova de Lisboa (UNL), Portugal (6)
- SAPIENTIA - Universidade do Algarve - Portugal (3)
- The Scholarly Commons | School of Hotel Administration; Cornell University Research (1)
- Universidad de Alicante (9)
- Universidad del Rosario, Colombia (2)
- Universidad Politécnica de Madrid (31)
- Universidade Complutense de Madrid (10)
- Universidade Federal do Pará (1)
- Universidade Federal do Rio Grande do Norte (UFRN) (4)
- Universidade Metodista de São Paulo (2)
- Universitat de Girona, Spain (1)
- Universitätsbibliothek Kassel, Universität Kassel, Germany (5)
- Université de Lausanne, Switzerland (3)
- Université de Montréal (2)
- Université de Montréal, Canada (5)
- Université Laval Mémoires et thèses électroniques (1)
- University of Canberra Research Repository - Australia (3)
- University of Connecticut - USA (1)
- University of Michigan (6)
- University of Queensland eSpace - Australia (1)
- University of Southampton, United Kingdom (17)
- University of Washington (8)
- WestminsterResearch - UK (9)
Resumo:
Virtually every sector of business and industry that uses computing, including financial analysis, search engines, and electronic commerce, incorporate Big Data analysis into their business model. Sophisticated clustering algorithms are popular for deducing the nature of data by assigning labels to unlabeled data. We address two main challenges in Big Data. First, by definition, the volume of Big Data is too large to be loaded into a computer’s memory (this volume changes based on the computer used or available, but there is always a data set that is too large for any computer). Second, in real-time applications, the velocity of new incoming data prevents historical data from being stored and future data from being accessed. Therefore, we propose our Streaming Kernel Fuzzy c-Means (stKFCM) algorithm, which reduces both computational complexity and space complexity significantly. The proposed stKFCM only requires O(n2) memory where n is the (predetermined) size of a data subset (or data chunk) at each time step, which makes this algorithm truly scalable (as n can be chosen based on the available memory). Furthermore, only 2n2 elements of the full N × N (where N >> n) kernel matrix need to be calculated at each time-step, thus reducing both the computation time in producing the kernel elements and also the complexity of the FCM algorithm. Empirical results show that stKFCM, even with relatively very small n, can provide clustering performance as accurately as kernel fuzzy c-means run on the entire data set while achieving a significant speedup.