946 resultados para Audio-visual materials


Relevância:

90.00% 90.00%

Publicador:

Resumo:

The performance of visual speech recognition (VSR) systems are significantly influenced by the accuracy of the visual front-end. The current state-of-the-art VSR systems use off-the-shelf face detectors such as Viola- Jones (VJ) which has limited reliability for changes in illumination and head poses. For a VSR system to perform well under these conditions, an accurate visual front end is required. This is an important problem to be solved in many practical implementations of audio visual speech recognition systems, for example in automotive environments for an efficient human-vehicle computer interface. In this paper, we re-examine the current state-of-the-art VSR by comparing off-the-shelf face detectors with the recently developed Fourier Lucas-Kanade (FLK) image alignment technique. A variety of image alignment and visual speech recognition experiments are performed on a clean dataset as well as with a challenging automotive audio-visual speech dataset. Our results indicate that the FLK image alignment technique can significantly outperform off-the shelf face detectors, but requires frequent fine-tuning.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Spoken term detection (STD) is the task of looking up a spoken term in a large volume of speech segments. In order to provide fast search, speech segments are first indexed into an intermediate representation using speech recognition engines which provide multiple hypotheses for each speech segment. Approximate matching techniques are usually applied at the search stage to compensate the poor performance of automatic speech recognition engines during indexing. Recently, using visual information in addition to audio information has been shown to improve phone recognition performance, particularly in noisy environments. In this paper, we will make use of visual information in the form of lip movements of the speaker in indexing stage and will investigate its effect on STD performance. Particularly, we will investigate if gains in phone recognition accuracy will carry through the approximate matching stage to provide similar gains in the final audio-visual STD system over a traditional audio only approach. We will also investigate the effect of using visual information on STD performance in different noise environments.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

The MACHAL, the acronym for “Mitnadvei Hutz LaAretz” ("Volunteers from Abroad"), consisted of about 3500 men and women from over 40 countries from a variety of social and religious backgrounds who volunteered to fight for the establishment of Israel. This collection is unique in that it deals specifically with the experience of MACHAL and Aliyah Bet volunteers from Canada and the United States and others living in the United States. The collections consists of files on 500 volunteers, over 2000 original and reproduction photographs, numerous audio-visual material, books, manuscripts, and memoirs.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

This paper gives an introduction to "Interculture TV", an educational videocast project initiated by the Department of "Intercultural Studies and Business Communications" at the Friedrich Schiller University, Jena. The project provides open access to audio-visual teaching/learning materials produced by intercultural student work groups and offers opportunities for cooperation. Starting from a definition of the term "educast", the article analyses the videocast episodes on Interculture TV and discusses their potential for inter-cultural instruction and learning.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Musical improvisation combines technical proficiency and musical intuition. Due to its interactive nature, improvisation provides an avenue of communication among all art forms. This dissertation project explores the collaborative aspects of improvisation involving a musician, visual artist, a small group of dancers, and videographer. Video footage from two separate recording sessions provided hours of visual materials which were studied and edited. The first session was a live performance recorded in front of a studio audience. The second session was a two-day collaboration between musician and dancers in a studio space. The process of editing and compiling images with audio-an important element in this project-presented many unforeseeable challenges and lessons. This recorded dissertation is comprised of seven music videos that demonstrate my ability as an artist in collaboration with visual artist-professor Richard Klank, dancers David Yates, Jamie Garcia, Raha Behnam, Rachel Wolfe and Adrian Galvin, and video artist Nguyen Nguyen. Each video represents an individual creative process involving musical performance, studio lighting, sound recording, and video editing.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

The aim of this work is to show the type of media coverage done by the newspapers La Razón, El País and Público about the 15-M social movement during the time that the camping at Sol took place. Specifically, in terms of how the characterization of the “indignados” (outraged) got made. Based on our previous descriptive observations, we approached a visual analysis of the photographs published on the paper editions of those mainstream media from May 15-June 12 of the 2011. we started from a total sample of 379 items, developing:1) A content analysis of La Razón, El País, and Público; the most frequents words of each media, articles classifications from the reviews found on them (expositive, positive-evaluation, negative-evaluation).2) An analysis of the 408 images obtained from the total sample, which establishes a clear evolution of the “indignados” profile and how differently each media took the movement as such. That’s, when they stop naming them “indignados”, and recognize its nature as social movement by calling it: “Movimiento 15-M"...

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Visual literacy is essential for 21st century learners. Across the higher education curriculum, students are being asked to use and produce images and visual media in their academic work, and they must be prepared to do so. The Association of College and Research Libraries has published the Visual Literacy Competency Standards for Higher Education, which for the first time, outline specific visual literacy learning outcomes. These Standards present new opportunities for libraries to expand their role in student learning through standards-based teaching and assessment, and to contribute to campus-wide collaborative efforts to develop students’ skills and critical thinking with regard to visual materials.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Crear un material audio-visual. Mejorar la calidad de la enseñanza. Estudiar la aplicación de programas audio-visuales en el aula. Buscar una metodología adecuada a la utilización didáctica de los medios audio-visuales. Comprobar las diferencias que pueden existir entre diferentes medios audio-visuales, diapositivas-vídeo. La muestra está formada por los niños de tres aulas de segundo de BUP del Colegio Escoles Pies de Sarrià (Barcelona). En total 102 sujetos que han estudiado primero de BUP en el mismo centro. Se expone el marco teórico. Se describen las variables (medios audio-visuales, rendimiento escolar, rendimiento escolar anterior, metodología, inteligencia, clase social, profesor y edad). Se describe la muestra. División de la muestra en tres clases (sin medio audio-visual, con vídeo, con diapositivas). Realización del material audio-visual. Se realizan las sesiones pertinentes en cada clase. Aplicación de la prueba objetiva. Se analizan los datos. Se ofrecen conclusiones y alternativas. Prueba objetiva de rendimiento. Test d'aptituds diferencials. Baremo de puntuaciones anteriores. Diferencia de medias, estadística descriptiva, análisis de varianza, prueba de Scheffe, para establecer si hay diferencias entre el grupo que ha trabajado con medio audio-visual, visual y sin medio audiovisual. La metodología experimental aplicada no ha producido los resultados esperados, hay razones para afirmar que han intervenido factores no controlados, ajenos a la experimentación. Se constata un gran interés de los alumnos por el uso del vídeo como elemento de motivación. Se señala la importancia de incidir en este campo creando metodologías activas adecuadas y series de programas válidos. Hace falta una intensa investigación en las posibilidades y efectos de dichas metodologías.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

This paper studies the auditory, visual and combined audio-visual recognition of vowels by severely and profoundly hearing impaired children.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Recent interest in material objects - the things of everyday interaction - has led to articulations of their role in the literature on organizational knowledge and learning. What is missing is a sense of how the use of these 'things' is patterned across both industrial settings and time. This research addresses this gap with a particular emphasis on visual materials. Practices are analysed in two contrasting design settings: a capital goods manufacturer and an architectural firm. Materials are observed to be treated both as frozen, and hence unavailable for change; and as fluid, open and dynamic. In each setting temporal patterns of unfreezing and refreezing are associated with the different types of materials used. The research suggests that these differing patterns or rhythms of visual practice are important in the evolution of knowledge and in structuring social relations for delivery. Hence, to improve their performance practitioners should not only consider the types of media they use, but also reflect on the pace and style of their interactions.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

I denna uppsats har filmljudet i krigsfilmerna Apocalypse Now och Saving Private Ryan undersökts. Detta har gjorts för att försöka bidra med ökad förståelse för filmljudets användningsområde och funktioner, främst för filmerna i fråga, men även för krigsfilm rent generellt. Filmljud i denna kontext omfattar allt det ljud som finns i film, men utesluter dock all ickediegetisk musik. Båda filmerna har undersökts genom en audio-visuell analys. En sådan analys görs genom att detaljgranska båda filmernas ljud- och bildinnehåll var för sig, för att slutligen undersöka samma filmsekvens som helhet då ljudet och bilden satts ihop igen. Den audio-visuella analysmetod som nyttjats i uppsatsen är Michel Chions metod, Masking. De 30 minuter film som analyserades placerades sedan i olika filmljudzoner, där respektive filmljudzons ljudinnehåll bland annat visade vilka främsta huvudfunktioner somfilmljudet hade i dessa filmer. Dessa funktioner är till för att bibehålla åskådarens fokus och intresse, att skapa närhet till rollkaraktärerna, samt att tillföra en hög känsla av realism och närvaro. Intentionerna med filmljudet verkade vara att flytta åskådaren in i filmens verklighet, att låta åskådaren bli ett med filmen. Att återspegla denna känsla av realism, närvaro, fokus samt intresse, visade sig också vara de intentioner som funnits redan i de båda filmernas förproduktionsstadier. Detta bevisar att de lyckats åstadkomma det de eftersträvat. Men om filmljudet använts på samma sätt eller innehar samma funktioner i krigsfilm rent genrellt går inte att säga.I have for this bachelor’s thesis examined the movie sound of the classic warfare movies Apocalypse Now and Saving Private Ryan. This is an attempt to contribute to a more profound comprehension of the appliance and importance of movie sound. In this context movie sound implies all kinds of sounds within the movies, accept from non-diegetic music. These two movies have been examined by an audio-visual analysis. It's done by auditing the sound and picture content separately, and then combined to audit the same sequence as a whole. Michel Chion, which is the founder of this analysis, calls this method Masking. The sound in this 30 minute sequence was then divided into different zones, where every zone represented a certain main function. These functions are provided to create a stronger connection to the characters, sustain the viewers interest and bring a sense of realism and presence. It seems though the intention with the movies sound is to bring the viewers to the scene in hand, and let it become their reality. To mirror this sense of realism, presence, focus and interest, proves to be the intention from an early stage of the production. This bachelor’s thesis demonstrates a success in their endeavours. Although it can’t confirm whether the movie sound have been utilized in the same manner or if they posess the same functions to warefare movies in general.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Utilizing user-centred system design and evaluation method has become an increasingly important tool to foster better usability in the field of virtual environments (VEs). In recent years, although it is still the norm that designers and developers are concerning the technological advancement and striving for designing impressive multimodal multisensory interfaces, more and more awareness are aroused among the development team that in order to produce usable and useful interfaces, it is essential to have users in mind during design and validate a new design from users' perspective. In this paper, we describe a user study carried out to validate a newly developed haptically enabled virtual training system. By taking consideration of the complexity of individual differences on human performance, adoption and acceptance of haptic and audio-visual I/O devices, we address how well users learn, perform, adapt to and perceive object assembly training. We also explore user experience and interaction with the system, and discuss how multisensory feedback affects user performance, perception and acceptance. At last, we discuss how to better design VEs that enhance users perception, their interaction and motor activity.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

This paper addresses the coordinated use of video and audio cues to capture and index surveillance events with multimodal labels. The focus of this paper is the development of a joint-sensor calibration technique that uses audio-visual observations to improve the calibration process. One significant feature of this approach is the ability to continuously check and update the calibration status of the sensor suite, making it resilient to independent drift in the individual sensors. We present scenarios in which this system is used to enhance surveillance.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

O presente artigo representa uma continuidade dos resultados apresentados em Camargo e Nardi (Revista Brasileira de Ensino de Física 29, 117 (2007)). Encontra-se inserido dentro de um estudo que busca compreender as principais barreiras para a inclusão de alunos com deficiência visual no contexto do ensino de física. Focalizando aulas de óptica, analisa as dificuldades comunicacionais entre licenciandos e discentes com deficiência visual. Para tal, enfatiza as estruturas empírica e semântico-sensorial das linguagens utilizadas, indicando fatores geradores de dificuldades de acessibilidade nas informações veiculadas. Recomenda, ainda, alternativas que visam dar condições à participação efetiva do discente com deficiência visual no processo comunicativo, das quais destacam-se: a identificação da estrutura semântico-sensorial dos significados veiculados, o conhecimento da história visual do aluno, a destituição da estrutura empírica audiovisual interdependente e a exploração das potencialidades comunicacionais das linguagens constituídas de estruturas empíricas de acesso visualmente independente. Conclui afirmando que a comunicação representa a principal barreira à participação efetiva de alunos com deficiência visual em aulas de óptica e enfatiza a importância da criação de canais comunicacionais adequados como condição básica à inclusão desses alunos.