1 resultado para Text summarization
em Bulgarian Digital Mathematics Library at IMI-BAS
Filtro por publicador
- JISC Information Environment Repository (5)
- Aberystwyth University Repository - Reino Unido (1)
- Acceda, el repositorio institucional de la Universidad de Las Palmas de Gran Canaria. España (6)
- AMS Campus - Alm@DL - Università di Bologna (1)
- AMS Tesi di Dottorato - Alm@DL - Università di Bologna (2)
- AMS Tesi di Laurea - Alm@DL - Università di Bologna (10)
- Aquatic Commons (1)
- ArchiMeD - Elektronische Publikationen der Universität Mainz - Alemanha (2)
- Archive of European Integration (32)
- Archivo Digital para la Docencia y la Investigación - Repositorio Institucional de la Universidad del País Vasco (3)
- Aston University Research Archive (1)
- Biblioteca Digital da Produção Intelectual da Universidade de São Paulo (5)
- Biblioteca Digital da Produção Intelectual da Universidade de São Paulo (BDPI/USP) (1)
- Biodiversity Heritage Library, United States (11)
- BORIS: Bern Open Repository and Information System - Berna - Suiça (104)
- Boston University Digital Common (1)
- Brock University, Canada (7)
- Bucknell University Digital Commons - Pensilvania - USA (6)
- Bulgarian Digital Mathematics Library at IMI-BAS (1)
- Cambridge University Engineering Department Publications Database (27)
- CentAUR: Central Archive University of Reading - UK (47)
- Central European University - Research Support Scheme (1)
- Chinese Academy of Sciences Institutional Repositories Grid Portal (8)
- Cochin University of Science & Technology (CUSAT), India (3)
- Comissão Econômica para a América Latina e o Caribe (CEPAL) (8)
- Dalarna University College Electronic Archive (6)
- Deakin Research Online - Australia (94)
- Department of Computer Science E-Repository - King's College London, Strand, London (3)
- DI-fusion - The institutional repository of Université Libre de Bruxelles (1)
- Digital Archives@Colby (2)
- Digital Commons at Florida International University (4)
- Digital Peer Publishing (6)
- DigitalCommons@The Texas Medical Center (5)
- DigitalCommons@University of Nebraska - Lincoln (2)
- Digitale Sammlungen - Goethe-Universität Frankfurt am Main (62)
- DRUM (Digital Repository at the University of Maryland) (1)
- Duke University (3)
- Greenwich Academic Literature Archive - UK (1)
- Harvard University (3)
- Helda - Digital Repository of University of Helsinki (14)
- Indian Institute of Science - Bangalore - Índia (19)
- Instituto Politécnico do Porto, Portugal (1)
- Massachusetts Institute of Technology (4)
- Memoria Académica - FaHCE, UNLP - Argentina (6)
- Ministerio de Cultura, Spain (87)
- National Center for Biotechnology Information - NCBI (1)
- Publishing Network for Geoscientific & Environmental Data (2)
- QUB Research Portal - Research Directory and Institutional Repository for Queen's University Belfast (62)
- Queensland University of Technology - ePrints Archive (95)
- ReCiL - Repositório Científico Lusófona - Grupo Lusófona, Portugal (1)
- Repositório digital da Fundação Getúlio Vargas - FGV (1)
- Repositório Digital da UNIVERSIDADE DA MADEIRA - Portugal (1)
- Repositório Institucional da Universidade de Aveiro - Portugal (1)
- Repositório Institucional UNESP - Universidade Estadual Paulista "Julio de Mesquita Filho" (6)
- Research Open Access Repository of the University of East London. (1)
- RUN (Repositório da Universidade Nova de Lisboa) - FCT (Faculdade de Cienecias e Technologia), Universidade Nova de Lisboa (UNL), Portugal (1)
- School of Medicine, Washington University, United States (1)
- Universidad Autónoma de Nuevo León, Mexico (6)
- Universidad de Alicante (17)
- Universidad Politécnica de Madrid (13)
- Universidade de Lisboa - Repositório Aberto (1)
- Universidade Federal do Pará (1)
- Universitat de Girona, Spain (1)
- Universitätsbibliothek Kassel, Universität Kassel, Germany (4)
- Université de Lausanne, Switzerland (3)
- Université de Montréal, Canada (3)
- University of Connecticut - USA (2)
- University of Michigan (139)
- University of Southampton, United Kingdom (10)
- University of Washington (1)
- WestminsterResearch - UK (2)
Resumo:
In this paper, we present an innovative topic segmentation system based on a new informative similarity measure that takes into account word co-occurrence in order to avoid the accessibility to existing linguistic resources such as electronic dictionaries or lexico-semantic databases such as thesauri or ontology. Topic segmentation is the task of breaking documents into topically coherent multi-paragraph subparts. Topic segmentation has extensively been used in information retrieval and text summarization. In particular, our architecture proposes a language-independent topic segmentation system that solves three main problems evidenced by previous research: systems based uniquely on lexical repetition that show reliability problems, systems based on lexical cohesion using existing linguistic resources that are usually available only for dominating languages and as a consequence do not apply to less favored languages and finally systems that need previously existing harvesting training data. For that purpose, we only use statistics on words and sequences of words based on a set of texts. This solution provides a flexible solution that may narrow the gap between dominating languages and less favored languages thus allowing equivalent access to information.