1 resultado para CONTENT WORDS
em SerWisS - Server für Wissenschaftliche Schriften der Fachhochschule Hannover
Filtro por publicador
- JISC Information Environment Repository (2)
- Repository Napier (2)
- ABACUS. Repositorio de Producción Científica - Universidad Europea (1)
- Aberystwyth University Repository - Reino Unido (6)
- Adam Mickiewicz University Repository (1)
- Aquatic Commons (23)
- Archive of European Integration (1)
- Archivo Digital para la Docencia y la Investigación - Repositorio Institucional de la Universidad del País Vasco (10)
- Aston University Research Archive (2)
- B-Digital - Universidade Fernando Pessoa - Portugal (1)
- Biblioteca Digital da Produção Intelectual da Universidade de São Paulo (1)
- Biblioteca Digital de la Universidad Católica Argentina (1)
- Biblioteca Digital de Teses e Dissertações Eletrônicas da UERJ (21)
- BORIS: Bern Open Repository and Information System - Berna - Suiça (1)
- Boston University Digital Common (8)
- Brock University, Canada (15)
- Cambridge University Engineering Department Publications Database (47)
- CentAUR: Central Archive University of Reading - UK (4)
- Center for Jewish History Digital Collections (1)
- Chinese Academy of Sciences Institutional Repositories Grid Portal (142)
- Cochin University of Science & Technology (CUSAT), India (19)
- CORA - Cork Open Research Archive - University College Cork - Ireland (2)
- Dalarna University College Electronic Archive (1)
- DI-fusion - The institutional repository of Université Libre de Bruxelles (3)
- Digital Commons at Florida International University (2)
- Digital Peer Publishing (1)
- DigitalCommons - The University of Maine Research (1)
- Diposit Digital de la UB - Universidade de Barcelona (2)
- Doria (National Library of Finland DSpace Services) - National Library of Finland, Finland (2)
- Duke University (8)
- eResearch Archive - Queensland Department of Agriculture; Fisheries and Forestry (22)
- Gallica, Bibliotheque Numerique - Bibliothèque nationale de France (French National Library) (BnF), France (1)
- Greenwich Academic Literature Archive - UK (3)
- Helda - Digital Repository of University of Helsinki (53)
- Illinois Digital Environment for Access to Learning and Scholarship Repository (1)
- Indian Institute of Science - Bangalore - Índia (61)
- Instituto Politécnico do Porto, Portugal (8)
- Ministerio de Cultura, Spain (7)
- Open University Netherlands (1)
- Plymouth Marine Science Electronic Archive (PlyMSEA) (43)
- Portal de Revistas Científicas Complutenses - Espanha (5)
- QUB Research Portal - Research Directory and Institutional Repository for Queen's University Belfast (142)
- Queensland University of Technology - ePrints Archive (228)
- Repositório Científico do Instituto Politécnico de Lisboa - Portugal (6)
- Repositório Institucional da Universidade de Aveiro - Portugal (3)
- Research Open Access Repository of the University of East London. (1)
- RUN (Repositório da Universidade Nova de Lisboa) - FCT (Faculdade de Cienecias e Technologia), Universidade Nova de Lisboa (UNL), Portugal (6)
- SAPIENTIA - Universidade do Algarve - Portugal (4)
- SerWisS - Server für Wissenschaftliche Schriften der Fachhochschule Hannover (1)
- Universidad Autónoma de Nuevo León, Mexico (3)
- Universidad del Rosario, Colombia (6)
- Universidad Politécnica de Madrid (1)
- Universidade de Lisboa - Repositório Aberto (2)
- Universita di Parma (1)
- Universitätsbibliothek Kassel, Universität Kassel, Germany (1)
- Université de Lausanne, Switzerland (1)
- Université de Montréal, Canada (16)
- University of Queensland eSpace - Australia (1)
- University of Southampton, United Kingdom (17)
- University of Washington (2)
- WestminsterResearch - UK (3)
- Worcester Research and Publications - Worcester Research and Publications - UK (1)
Resumo:
Distributional semantics tries to characterize the meaning of words by the contexts in which they occur. Similarity of words hence can be derived from the similarity of contexts. Contexts of a word are usually vectors of words appearing near to that word in a corpus. It was observed in previous research that similarity measures for the context vectors of two words depend on the frequency of these words. In the present paper we investigate this dependency in more detail for one similarity measure, the Jensen-Shannon divergence. We give an empirical model of this dependency and propose the deviation of the observed Jensen-Shannon divergence from the divergence expected on the basis of the frequencies of the words as an alternative similarity measure. We show that this new similarity measure is superior to both the Jensen-Shannon divergence and the cosine similarity in a task, in which pairs of words, taken from Wordnet, have to be classified as being synonyms or not.