1 resultado para Speech synthesis Data processing
em Massachusetts Institute of Technology
Filtro por publicador
- Abertay Research Collections - Abertay University’s repository (1)
- AMS Tesi di Dottorato - Alm@DL - Università di Bologna (9)
- AMS Tesi di Laurea - Alm@DL - Università di Bologna (1)
- ArchiMeD - Elektronische Publikationen der Universität Mainz - Alemanha (3)
- Archimer: Archive de l'Institut francais de recherche pour l'exploitation de la mer (2)
- Archive of European Integration (21)
- Aston University Research Archive (35)
- Biblioteca de Teses e Dissertações da USP (1)
- Biblioteca Digital da Produção Intelectual da Universidade de São Paulo (BDPI/USP) (11)
- Biblioteca Virtual del Sistema Sanitario Público de Andalucía (BV-SSPA), Junta de Andalucía. Consejería de Salud y Bienestar Social, Spain (1)
- Biodiversity Heritage Library, United States (2)
- BORIS: Bern Open Repository and Information System - Berna - Suiça (7)
- Brock University, Canada (5)
- Bucknell University Digital Commons - Pensilvania - USA (1)
- Bulgarian Digital Mathematics Library at IMI-BAS (6)
- CentAUR: Central Archive University of Reading - UK (10)
- Cochin University of Science & Technology (CUSAT), India (3)
- Comissão Econômica para a América Latina e o Caribe (CEPAL) (4)
- Consorci de Serveis Universitaris de Catalunya (CSUC), Spain (377)
- Cor-Ciencia - Acuerdo de Bibliotecas Universitarias de Córdoba (ABUC), Argentina (1)
- CORA - Cork Open Research Archive - University College Cork - Ireland (1)
- CUNY Academic Works (2)
- Dalarna University College Electronic Archive (2)
- Digital Commons - Michigan Tech (1)
- Digital Commons at Florida International University (7)
- Digital Peer Publishing (1)
- DigitalCommons - The University of Maine Research (1)
- DigitalCommons@The Texas Medical Center (2)
- Doria (National Library of Finland DSpace Services) - National Library of Finland, Finland (22)
- DRUM (Digital Repository at the University of Maryland) (1)
- Glasgow Theses Service (1)
- Instituto Politécnico de Bragança (1)
- Instituto Politécnico de Leiria (1)
- Instituto Politécnico do Porto, Portugal (26)
- Iowa Publications Online (IPO) - State Library, State of Iowa (Iowa), United States (5)
- Martin Luther Universitat Halle Wittenberg, Germany (1)
- Massachusetts Institute of Technology (1)
- National Center for Biotechnology Information - NCBI (14)
- Portal do Conhecimento - Ministerio do Ensino Superior Ciencia e Inovacao, Cape Verde (1)
- Publishing Network for Geoscientific & Environmental Data (11)
- QUB Research Portal - Research Directory and Institutional Repository for Queen's University Belfast (3)
- Repositório Científico do Instituto Politécnico de Lisboa - Portugal (17)
- Repositório da Produção Científica e Intelectual da Unicamp (3)
- Repositório do Centro Hospitalar de Lisboa Central, EPE - Centro Hospitalar de Lisboa Central, EPE, Portugal (2)
- Repositório Institucional UNESP - Universidade Estadual Paulista "Julio de Mesquita Filho" (13)
- Repositorio Institucional Universidad EAFIT - Medelin - Colombia (1)
- RUN (Repositório da Universidade Nova de Lisboa) - FCT (Faculdade de Cienecias e Technologia), Universidade Nova de Lisboa (UNL), Portugal (8)
- Savoirs UdeS : plateforme de diffusion de la production intellectuelle de l’Université de Sherbrooke - Canada (1)
- Scielo Saúde Pública - SP (4)
- Universidad de Alicante (5)
- Universidad Politécnica de Madrid (23)
- Universidade do Minho (6)
- Universidade dos Açores - Portugal (2)
- Universidade Federal do Pará (4)
- Universidade Metodista de São Paulo (3)
- Universitätsbibliothek Kassel, Universität Kassel, Germany (162)
- Université de Lausanne, Switzerland (26)
- Université de Montréal, Canada (4)
- University of Michigan (61)
- University of Queensland eSpace - Australia (11)
- University of Southampton, United Kingdom (1)
- University of Washington (3)
- WestminsterResearch - UK (1)
- Worcester Research and Publications - Worcester Research and Publications - UK (1)
Resumo:
We present MikeTalk, a text-to-audiovisual speech synthesizer which converts input text into an audiovisual speech stream. MikeTalk is built using visemes, which are a small set of images spanning a large range of mouth shapes. The visemes are acquired from a recorded visual corpus of a human subject which is specifically designed to elicit one instantiation of each viseme. Using optical flow methods, correspondence from every viseme to every other viseme is computed automatically. By morphing along this correspondence, a smooth transition between viseme images may be generated. A complete visual utterance is constructed by concatenating viseme transitions. Finally, phoneme and timing information extracted from a text-to-speech synthesizer is exploited to determine which viseme transitions to use, and the rate at which the morphing process should occur. In this manner, we are able to synchronize the visual speech stream with the audio speech stream, and hence give the impression of a photorealistic talking face.