1 resultado para Text categorization
em University of Queensland eSpace - Australia
Filtro por publicador
- JISC Information Environment Repository (5)
- Aberystwyth University Repository - Reino Unido (3)
- Acceda, el repositorio institucional de la Universidad de Las Palmas de Gran Canaria. España (6)
- AMS Campus - Alm@DL - Università di Bologna (1)
- AMS Tesi di Dottorato - Alm@DL - Università di Bologna (2)
- AMS Tesi di Laurea - Alm@DL - Università di Bologna (11)
- Aquatic Commons (1)
- ArchiMeD - Elektronische Publikationen der Universität Mainz - Alemanha (2)
- Archive of European Integration (31)
- Archivo Digital para la Docencia y la Investigación - Repositorio Institucional de la Universidad del País Vasco (3)
- Aston University Research Archive (1)
- Biblioteca Digital da Produção Intelectual da Universidade de São Paulo (4)
- Biblioteca Digital da Produção Intelectual da Universidade de São Paulo (BDPI/USP) (2)
- Biodiversity Heritage Library, United States (11)
- BORIS: Bern Open Repository and Information System - Berna - Suiça (110)
- Boston University Digital Common (4)
- Brock University, Canada (7)
- Bucknell University Digital Commons - Pensilvania - USA (9)
- Cambridge University Engineering Department Publications Database (30)
- CentAUR: Central Archive University of Reading - UK (56)
- Central European University - Research Support Scheme (1)
- Chinese Academy of Sciences Institutional Repositories Grid Portal (9)
- Cochin University of Science & Technology (CUSAT), India (3)
- Collection Of Biostatistics Research Archive (1)
- Comissão Econômica para a América Latina e o Caribe (CEPAL) (8)
- Dalarna University College Electronic Archive (11)
- Deakin Research Online - Australia (105)
- Department of Computer Science E-Repository - King's College London, Strand, London (3)
- DI-fusion - The institutional repository of Université Libre de Bruxelles (1)
- Digital Archives@Colby (2)
- Digital Commons - Michigan Tech (2)
- Digital Commons @ DU | University of Denver Research (2)
- Digital Commons at Florida International University (2)
- Digital Peer Publishing (7)
- DigitalCommons - The University of Maine Research (1)
- DigitalCommons@The Texas Medical Center (8)
- DigitalCommons@University of Nebraska - Lincoln (2)
- Digitale Sammlungen - Goethe-Universität Frankfurt am Main (62)
- Duke University (3)
- FUNDAJ - Fundação Joaquim Nabuco (1)
- Greenwich Academic Literature Archive - UK (1)
- Harvard University (2)
- Helda - Digital Repository of University of Helsinki (23)
- Indian Institute of Science - Bangalore - Índia (21)
- Instituto Politécnico do Porto, Portugal (1)
- Massachusetts Institute of Technology (6)
- Memoria Académica - FaHCE, UNLP - Argentina (21)
- Ministerio de Cultura, Spain (88)
- National Center for Biotechnology Information - NCBI (5)
- Open University Netherlands (1)
- Plymouth Marine Science Electronic Archive (PlyMSEA) (1)
- Portal de Revistas Científicas Complutenses - Espanha (1)
- Publishing Network for Geoscientific & Environmental Data (3)
- QUB Research Portal - Research Directory and Institutional Repository for Queen's University Belfast (66)
- Queensland University of Technology - ePrints Archive (101)
- ReCiL - Repositório Científico Lusófona - Grupo Lusófona, Portugal (1)
- Repositório Científico do Instituto Politécnico de Lisboa - Portugal (1)
- Repositório digital da Fundação Getúlio Vargas - FGV (1)
- Repositório Digital da UNIVERSIDADE DA MADEIRA - Portugal (1)
- Repositório Institucional da Universidade Estadual de São Paulo - UNESP (1)
- Repositório Institucional UNESP - Universidade Estadual Paulista "Julio de Mesquita Filho" (10)
- Research Open Access Repository of the University of East London. (1)
- RUN (Repositório da Universidade Nova de Lisboa) - FCT (Faculdade de Cienecias e Technologia), Universidade Nova de Lisboa (UNL), Portugal (1)
- SAPIENTIA - Universidade do Algarve - Portugal (3)
- Savoirs UdeS : plateforme de diffusion de la production intellectuelle de l’Université de Sherbrooke - Canada (1)
- School of Medicine, Washington University, United States (1)
- Universidad Autónoma de Nuevo León, Mexico (6)
- Universidad de Alicante (10)
- Universidad del Rosario, Colombia (1)
- Universidad Politécnica de Madrid (13)
- Universidade Federal do Pará (1)
- Universidade Federal do Rio Grande do Norte (UFRN) (1)
- Universitat de Girona, Spain (2)
- Universitätsbibliothek Kassel, Universität Kassel, Germany (4)
- Université de Lausanne, Switzerland (4)
- Université de Montréal, Canada (3)
- University of Connecticut - USA (1)
- University of Michigan (37)
- University of Queensland eSpace - Australia (1)
- University of Southampton, United Kingdom (10)
- University of Washington (2)
- WestminsterResearch - UK (2)
Resumo:
Document classification is a supervised machine learning process, where predefined category labels are assigned to documents based on the hypothesis derived from training set of labelled documents. Documents cannot be directly interpreted by a computer system unless they have been modelled as a collection of computable features. Rogati and Yang [M. Rogati and Y. Yang, Resource selection for domain-specific cross-lingual IR, in SIGIR 2004: Proceedings of the 27th annual international conference on Research and Development in Information Retrieval, ACM Press, Sheffied: United Kingdom, pp. 154-161.] pointed out that the effectiveness of document classification system may vary in different domains. This implies that the quality of document model contributes to the effectiveness of document classification. Conventionally, model evaluation is accomplished by comparing the effectiveness scores of classifiers on model candidates. However, this kind of evaluation methods may encounter either under-fitting or over-fitting problems, because the effectiveness scores are restricted by the learning capacities of classifiers. We propose a model fitness evaluation method to determine whether a model is sufficient to distinguish positive and negative instances while still competent to provide satisfactory effectiveness with a small feature subset. Our experiments demonstrated how the fitness of models are assessed. The results of our work contribute to the researches of feature selection, dimensionality reduction and document classification.