1 resultado para Characteristic frequencies
em SerWisS - Server für Wissenschaftliche Schriften der Fachhochschule Hannover
Filtro por publicador
- Aberystwyth University Repository - Reino Unido (2)
- AMS Tesi di Dottorato - Alm@DL - Università di Bologna (3)
- AMS Tesi di Laurea - Alm@DL - Università di Bologna (1)
- ANIMAL PRODUCTION JOURNAL (1)
- Aquatic Commons (3)
- ARCA - Repositório Institucional da FIOCRUZ (1)
- ArchiMeD - Elektronische Publikationen der Universität Mainz - Alemanha (4)
- Archive of European Integration (1)
- Archivo Digital para la Docencia y la Investigación - Repositorio Institucional de la Universidad del País Vasco (1)
- Aston University Research Archive (18)
- Biblioteca Digital da Produção Intelectual da Universidade de São Paulo (15)
- Biblioteca Digital da Produção Intelectual da Universidade de São Paulo (BDPI/USP) (7)
- Biblioteca Digital de Teses e Dissertações Eletrônicas da UERJ (1)
- Biodiversity Heritage Library, United States (2)
- Bioline International (1)
- BORIS: Bern Open Repository and Information System - Berna - Suiça (22)
- Brock University, Canada (1)
- Bucknell University Digital Commons - Pensilvania - USA (1)
- Bulgarian Digital Mathematics Library at IMI-BAS (5)
- CaltechTHESIS (2)
- Cambridge University Engineering Department Publications Database (36)
- CentAUR: Central Archive University of Reading - UK (17)
- Chinese Academy of Sciences Institutional Repositories Grid Portal (67)
- Cochin University of Science & Technology (CUSAT), India (18)
- CORA - Cork Open Research Archive - University College Cork - Ireland (1)
- Corvinus Research Archive - The institutional repository for the Corvinus University of Budapest (2)
- Digital Commons - Michigan Tech (4)
- Digital Commons at Florida International University (4)
- DigitalCommons@The Texas Medical Center (3)
- DRUM (Digital Repository at the University of Maryland) (1)
- Duke University (2)
- eResearch Archive - Queensland Department of Agriculture; Fisheries and Forestry (3)
- Greenwich Academic Literature Archive - UK (1)
- Helda - Digital Repository of University of Helsinki (3)
- Indian Institute of Science - Bangalore - Índia (78)
- Institutional Repository of Leibniz University Hannover (1)
- Instituto Politécnico do Porto, Portugal (1)
- National Center for Biotechnology Information - NCBI (13)
- Plymouth Marine Science Electronic Archive (PlyMSEA) (8)
- Publishing Network for Geoscientific & Environmental Data (62)
- QUB Research Portal - Research Directory and Institutional Repository for Queen's University Belfast (32)
- Queensland University of Technology - ePrints Archive (316)
- Repositorio Académico de la Universidad Nacional de Costa Rica (1)
- Repositório Científico do Instituto Politécnico de Lisboa - Portugal (2)
- Repositório Institucional da Universidade de Aveiro - Portugal (1)
- Repositório Institucional UNESP - Universidade Estadual Paulista "Julio de Mesquita Filho" (53)
- SAPIENTIA - Universidade do Algarve - Portugal (4)
- School of Medicine, Washington University, United States (2)
- SerWisS - Server für Wissenschaftliche Schriften der Fachhochschule Hannover (1)
- Universidad de Alicante (1)
- Universidad del Rosario, Colombia (1)
- Universidad Politécnica de Madrid (15)
- Universidade Complutense de Madrid (2)
- Universidade Federal do Pará (7)
- Universidade Federal do Rio Grande do Norte (UFRN) (3)
- Universitat de Girona, Spain (1)
- Universitätsbibliothek Kassel, Universität Kassel, Germany (3)
- Université de Lausanne, Switzerland (1)
- Université de Montréal, Canada (3)
- University of Canberra Research Repository - Australia (1)
- University of Connecticut - USA (1)
- University of Innsbruck Digital Library - Austria (1)
- University of Michigan (45)
- University of Queensland eSpace - Australia (11)
- University of Washington (3)
- WestminsterResearch - UK (1)
Resumo:
Distributional semantics tries to characterize the meaning of words by the contexts in which they occur. Similarity of words hence can be derived from the similarity of contexts. Contexts of a word are usually vectors of words appearing near to that word in a corpus. It was observed in previous research that similarity measures for the context vectors of two words depend on the frequency of these words. In the present paper we investigate this dependency in more detail for one similarity measure, the Jensen-Shannon divergence. We give an empirical model of this dependency and propose the deviation of the observed Jensen-Shannon divergence from the divergence expected on the basis of the frequencies of the words as an alternative similarity measure. We show that this new similarity measure is superior to both the Jensen-Shannon divergence and the cosine similarity in a task, in which pairs of words, taken from Wordnet, have to be classified as being synonyms or not.