1 resultado para Binary Classification
em SerWisS - Server für Wissenschaftliche Schriften der Fachhochschule Hannover
Filtro por publicador
- Acceda, el repositorio institucional de la Universidad de Las Palmas de Gran Canaria. España (1)
- AMS Tesi di Laurea - Alm@DL - Università di Bologna (1)
- Applied Math and Science Education Repository - Washington - USA (2)
- Archive of European Integration (5)
- Aston University Research Archive (5)
- Biblioteca Digital da Produção Intelectual da Universidade de São Paulo (5)
- Biblioteca Digital da Produção Intelectual da Universidade de São Paulo (BDPI/USP) (85)
- Biblioteca Virtual del Sistema Sanitario Público de Andalucía (BV-SSPA), Junta de Andalucía. Consejería de Salud y Bienestar Social, Spain (3)
- Biodiversity Heritage Library, United States (31)
- BORIS: Bern Open Repository and Information System - Berna - Suiça (4)
- Brock University, Canada (6)
- Bulgarian Digital Mathematics Library at IMI-BAS (6)
- CentAUR: Central Archive University of Reading - UK (126)
- Cochin University of Science & Technology (CUSAT), India (24)
- Collection Of Biostatistics Research Archive (1)
- Consorci de Serveis Universitaris de Catalunya (CSUC), Spain (77)
- CUNY Academic Works (1)
- Dalarna University College Electronic Archive (9)
- Department of Computer Science E-Repository - King's College London, Strand, London (9)
- Digital Commons @ DU | University of Denver Research (1)
- Diposit Digital de la UB - Universidade de Barcelona (6)
- Doria (National Library of Finland DSpace Services) - National Library of Finland, Finland (25)
- DRUM (Digital Repository at the University of Maryland) (1)
- eResearch Archive - Queensland Department of Agriculture; Fisheries and Forestry (1)
- Gallica, Bibliotheque Numerique - Bibliothèque nationale de France (French National Library) (BnF), France (24)
- Illinois Digital Environment for Access to Learning and Scholarship Repository (1)
- Institute of Public Health in Ireland, Ireland (2)
- Instituto Politécnico do Porto, Portugal (12)
- Iowa Publications Online (IPO) - State Library, State of Iowa (Iowa), United States (10)
- Martin Luther Universitat Halle Wittenberg, Germany (4)
- Massachusetts Institute of Technology (7)
- Ministerio de Cultura, Spain (2)
- National Center for Biotechnology Information - NCBI (1)
- QUB Research Portal - Research Directory and Institutional Repository for Queen's University Belfast (1)
- Repositório Aberto da Universidade Aberta de Portugal (1)
- Repositório Científico da Universidade de Évora - Portugal (1)
- Repositório Científico do Instituto Politécnico de Lisboa - Portugal (27)
- Repositório da Produção Científica e Intelectual da Unicamp (10)
- Repositório do Centro Hospitalar de Lisboa Central, EPE - Centro Hospitalar de Lisboa Central, EPE, Portugal (3)
- Repositório Institucional UNESP - Universidade Estadual Paulista "Julio de Mesquita Filho" (3)
- RUN (Repositório da Universidade Nova de Lisboa) - FCT (Faculdade de Cienecias e Technologia), Universidade Nova de Lisboa (UNL), Portugal (18)
- School of Medicine, Washington University, United States (2)
- Scielo Saúde Pública - SP (72)
- Scottish Institute for Research in Economics (SIRE) (SIRE), United Kingdom (1)
- SerWisS - Server für Wissenschaftliche Schriften der Fachhochschule Hannover (1)
- Universidad Autónoma de Nuevo León, Mexico (1)
- Universidad de Alicante (1)
- Universidad del Rosario, Colombia (1)
- Universidad Politécnica de Madrid (4)
- Universidade do Minho (9)
- Universidade dos Açores - Portugal (1)
- Universidade Federal do Rio Grande do Norte (UFRN) (1)
- Universitat de Girona, Spain (10)
- Universitätsbibliothek Kassel, Universität Kassel, Germany (3)
- Université de Lausanne, Switzerland (141)
- Université de Montréal, Canada (25)
- Université Laval Mémoires et thèses électroniques (1)
- University of Queensland eSpace - Australia (38)
- University of Southampton, United Kingdom (1)
Resumo:
The dependency of word similarity in vector space models on the frequency of words has been noted in a few studies, but has received very little attention. We study the influence of word frequency in a set of 10 000 randomly selected word pairs for a number of different combinations of feature weighting schemes and similarity measures. We find that the similarity of word pairs for all methods, except for the one using singular value decomposition to reduce the dimensionality of the feature space, is determined to a large extent by the frequency of the words. In a binary classification task of pairs of synonyms and unrelated words we find that for all similarity measures the results can be improved when we correct for the frequency bias.