1 resultado para classification task
em SerWisS - Server für Wissenschaftliche Schriften der Fachhochschule Hannover
Filtro por publicador
- JISC Information Environment Repository (1)
- Aberdeen University (1)
- Aberystwyth University Repository - Reino Unido (11)
- AMS Tesi di Laurea - Alm@DL - Università di Bologna (1)
- Aquatic Commons (24)
- ArchiMeD - Elektronische Publikationen der Universität Mainz - Alemanha (1)
- Archivo Digital para la Docencia y la Investigación - Repositorio Institucional de la Universidad del País Vasco (16)
- Aston University Research Archive (8)
- Biblioteca de Teses e Dissertações da USP (1)
- Biblioteca Digital | Sistema Integrado de Documentación | UNCuyo - UNCUYO. UNIVERSIDAD NACIONAL DE CUYO. (1)
- Biblioteca Digital da Produção Intelectual da Universidade de São Paulo (5)
- Biblioteca Digital da Produção Intelectual da Universidade de São Paulo (BDPI/USP) (2)
- Biblioteca Digital de Teses e Dissertações Eletrônicas da UERJ (1)
- BORIS: Bern Open Repository and Information System - Berna - Suiça (5)
- Boston University Digital Common (21)
- Brock University, Canada (1)
- Bulgarian Digital Mathematics Library at IMI-BAS (1)
- CaltechTHESIS (4)
- Cambridge University Engineering Department Publications Database (131)
- CentAUR: Central Archive University of Reading - UK (8)
- Chinese Academy of Sciences Institutional Repositories Grid Portal (74)
- Cochin University of Science & Technology (CUSAT), India (3)
- CORA - Cork Open Research Archive - University College Cork - Ireland (4)
- Corvinus Research Archive - The institutional repository for the Corvinus University of Budapest (1)
- Dalarna University College Electronic Archive (1)
- Deakin Research Online - Australia (21)
- DI-fusion - The institutional repository of Université Libre de Bruxelles (6)
- Digital Commons at Florida International University (1)
- Digital Peer Publishing (1)
- DRUM (Digital Repository at the University of Maryland) (1)
- Duke University (12)
- eResearch Archive - Queensland Department of Agriculture; Fisheries and Forestry (5)
- FAUBA DIGITAL: Repositorio institucional científico y académico de la Facultad de Agronomia de la Universidad de Buenos Aires (2)
- Greenwich Academic Literature Archive - UK (6)
- Helda - Digital Repository of University of Helsinki (18)
- Indian Institute of Science - Bangalore - Índia (115)
- Massachusetts Institute of Technology (8)
- Ministerio de Cultura, Spain (1)
- National Center for Biotechnology Information - NCBI (1)
- Nottingham eTheses (3)
- Open University Netherlands (1)
- Plymouth Marine Science Electronic Archive (PlyMSEA) (14)
- Portal de Revistas Científicas Complutenses - Espanha (3)
- Publishing Network for Geoscientific & Environmental Data (1)
- QSpace: Queen's University - Canada (1)
- QUB Research Portal - Research Directory and Institutional Repository for Queen's University Belfast (111)
- Queensland University of Technology - ePrints Archive (285)
- Repositório Científico da Universidade de Évora - Portugal (1)
- Repositorio Institucional de la Universidad Pública de Navarra - Espanha (2)
- Repositório Institucional UNESP - Universidade Estadual Paulista "Julio de Mesquita Filho" (9)
- Savoirs UdeS : plateforme de diffusion de la production intellectuelle de l’Université de Sherbrooke - Canada (1)
- SerWisS - Server für Wissenschaftliche Schriften der Fachhochschule Hannover (1)
- Universidad de Alicante (1)
- Universidad Politécnica de Madrid (9)
- Universidade Federal do Pará (1)
- Universidade Federal do Rio Grande do Norte (UFRN) (2)
- Universitat de Girona, Spain (4)
- Université de Montréal, Canada (1)
- Université Laval Mémoires et thèses électroniques (1)
- University of Queensland eSpace - Australia (2)
Resumo:
The dependency of word similarity in vector space models on the frequency of words has been noted in a few studies, but has received very little attention. We study the influence of word frequency in a set of 10 000 randomly selected word pairs for a number of different combinations of feature weighting schemes and similarity measures. We find that the similarity of word pairs for all methods, except for the one using singular value decomposition to reduce the dimensionality of the feature space, is determined to a large extent by the frequency of the words. In a binary classification task of pairs of synonyms and unrelated words we find that for all similarity measures the results can be improved when we correct for the frequency bias.