2 resultados para text mining
em University of Queensland eSpace - Australia
Filtro por publicador
- JISC Information Environment Repository (9)
- Aberdeen University (3)
- Aberystwyth University Repository - Reino Unido (11)
- Academic Archive On-line (Jönköping University; Sweden) (1)
- Academic Archive On-line (Mid Sweden University; Sweden) (1)
- AMS Tesi di Dottorato - Alm@DL - Università di Bologna (1)
- AMS Tesi di Laurea - Alm@DL - Università di Bologna (12)
- Aquatic Commons (3)
- Archimer: Archive de l'Institut francais de recherche pour l'exploitation de la mer (1)
- Archivo Digital para la Docencia y la Investigación - Repositorio Institucional de la Universidad del País Vasco (6)
- Argos - Repositorio Institucional de la Secretaría de Investigación y Postgrado de la Facultad de Humanidades y Ciencias Sociales de la Universidad Nacional de Misiones (2)
- Aston University Research Archive (11)
- Biblioteca Digital da Produção Intelectual da Universidade de São Paulo (2)
- Biblioteca Digital da Produção Intelectual da Universidade de São Paulo (BDPI/USP) (2)
- Biblioteca Digital de Teses e Dissertações Eletrônicas da UERJ (1)
- Biodiversity Heritage Library, United States (1)
- BORIS: Bern Open Repository and Information System - Berna - Suiça (5)
- Boston University Digital Common (3)
- Bucknell University Digital Commons - Pensilvania - USA (3)
- Bulgarian Digital Mathematics Library at IMI-BAS (2)
- Cambridge University Engineering Department Publications Database (33)
- CentAUR: Central Archive University of Reading - UK (25)
- Chinese Academy of Sciences Institutional Repositories Grid Portal (24)
- CORA - Cork Open Research Archive - University College Cork - Ireland (1)
- Department of Computer Science E-Repository - King's College London, Strand, London (1)
- DI-fusion - The institutional repository of Université Libre de Bruxelles (1)
- Digital Commons - Michigan Tech (12)
- Digital Commons - Montana Tech (9)
- Digital Commons @ DU | University of Denver Research (1)
- Digital Commons @ Winthrop University (1)
- Digital Commons at Florida International University (12)
- Digital Peer Publishing (2)
- DigitalCommons@The Texas Medical Center (1)
- Doria (National Library of Finland DSpace Services) - National Library of Finland, Finland (1)
- DRUM (Digital Repository at the University of Maryland) (2)
- Duke University (9)
- eResearch Archive - Queensland Department of Agriculture; Fisheries and Forestry (3)
- Gallica, Bibliotheque Numerique - Bibliothèque nationale de France (French National Library) (BnF), France (1)
- Glasgow Theses Service (1)
- Greenwich Academic Literature Archive - UK (2)
- Harvard University (1)
- Helda - Digital Repository of University of Helsinki (29)
- Illinois Digital Environment for Access to Learning and Scholarship Repository (3)
- Indian Institute of Science - Bangalore - Índia (44)
- Infoteca EMBRAPA (1)
- Massachusetts Institute of Technology (1)
- National Center for Biotechnology Information - NCBI (2)
- Open University Netherlands (1)
- Plymouth Marine Science Electronic Archive (PlyMSEA) (3)
- Portal de Revistas Científicas Complutenses - Espanha (1)
- QUB Research Portal - Research Directory and Institutional Repository for Queen's University Belfast (36)
- Queensland University of Technology - ePrints Archive (355)
- RDBU - Repositório Digital da Biblioteca da Unisinos (1)
- Repositório digital da Fundação Getúlio Vargas - FGV (2)
- Repositório Digital da UNIVERSIDADE DA MADEIRA - Portugal (1)
- Repositório Institucional da Universidade de Aveiro - Portugal (1)
- Repositório Institucional da Universidade de Brasília (2)
- Repositorio Institucional de la Universidad de Málaga (2)
- Repositório Institucional UNESP - Universidade Estadual Paulista "Julio de Mesquita Filho" (3)
- Research Open Access Repository of the University of East London. (1)
- RUN (Repositório da Universidade Nova de Lisboa) - FCT (Faculdade de Cienecias e Technologia), Universidade Nova de Lisboa (UNL), Portugal (2)
- SAPIENTIA - Universidade do Algarve - Portugal (1)
- Savoirs UdeS : plateforme de diffusion de la production intellectuelle de l’Université de Sherbrooke - Canada (2)
- School of Medicine, Washington University, United States (2)
- Scielo Uruguai (1)
- SerWisS - Server für Wissenschaftliche Schriften der Fachhochschule Hannover (2)
- Universidad de Alicante (7)
- Universidad del Rosario, Colombia (1)
- Universidad Politécnica de Madrid (4)
- Universidade Federal de Uberlândia (1)
- Universidade Metodista de São Paulo (3)
- Universitat de Girona, Spain (4)
- Université de Montréal, Canada (4)
- University of Michigan (173)
- University of Queensland eSpace - Australia (2)
- University of Southampton, United Kingdom (2)
- University of Washington (2)
Resumo:
In this paper we explore the use of text-mining methods for the identification of the author of a text. We apply the support vector machine (SVM) to this problem, as it is able to cope with half a million of inputs it requires no feature selection and can process the frequency vector of all words of a text. We performed a number of experiments with texts from a German newspaper. With nearly perfect reliability the SVM was able to reject other authors and detected the target author in 60–80% of the cases. In a second experiment, we ignored nouns, verbs and adjectives and replaced them by grammatical tags and bigrams. This resulted in slightly reduced performance. Author detection with SVMs on full word forms was remarkably robust even if the author wrote about different topics.