887 resultados para Audio-visual library service.
Resumo:
For the first time in this paper the authors present results showing the effect of out of plane speaker head pose variation on a lip biometric based speaker verification system. Using appearance DCT based features, they adopt a Mutual Information analysis technique to highlight the class discriminant DCT components most robust to changes in out of plane pose. Experiments are conducted using the initial phase of a new multi view Audio-Visual database designed for research and development of pose-invariant speech and speaker recognition. They show that verification performance can be improved by substituting higher order horizontal DCT components for vertical, particularly in the case of a train/test pose angle mismatch.
Resumo:
A software system, recently developed by the authors for the efficient capturing, editing, and delivery of audio-visual web lectures, was used to create a series of lectures for a first-year undergraduate course in Dynamics. These web lectures were developed to serve as an extra study resource for students attending lectures and not as a replacement. A questionnaire was produced to obtain feedback from students. The overall response was very favorable and numerous requests were made for other lecturers to adopt this technology. Despite the students' approval of this added resource, there was no significant improvement in overall examination performance
Resumo:
For the first time in this paper we present results showing the effect of speaker head pose angle on automatic lip-reading performance over a wide range of closely spaced angles. We analyse the effect head pose has upon the features themselves and show that by selecting coefficients with minimum variance w.r.t. pose angle, recognition performance can be improved when train-test pose angles differ. Experiments are conducted using the initial phase of a unique multi view Audio-Visual database designed specifically for research and development of pose-invariant lip-reading systems. We firstly show that it is the higher order horizontal spatial frequency components that become most detrimental as the pose deviates. Secondly we assess the performance of different feature selection masks across a range of pose angles including a new mask based on Minimum Cross-Pose Variance coefficients. We report a relative improvement of 50% in Word Error Rate when using our selection mask over a common energy based selection during profile view lip-reading.
Resumo:
This paper presents a novel method of audio-visual feature-level fusion for person identification where both the speech and facial modalities may be corrupted, and there is a lack of prior knowledge about the corruption. Furthermore, we assume there are limited amount of training data for each modality (e.g., a short training speech segment and a single training facial image for each person). A new multimodal feature representation and a modified cosine similarity are introduced to combine and compare bimodal features with limited training data, as well as vastly differing data rates and feature sizes. Optimal feature selection and multicondition training are used to reduce the mismatch between training and testing, thereby making the system robust to unknown bimodal corruption. Experiments have been carried out on a bimodal dataset created from the SPIDRE speaker recognition database and AR face recognition database with variable noise corruption of speech and occlusion in the face images. The system's speaker identification performance on the SPIDRE database, and facial identification performance on the AR database, is comparable with the literature. Combining both modalities using the new method of multimodal fusion leads to significantly improved accuracy over the unimodal systems, even when both modalities have been corrupted. The new method also shows improved identification accuracy compared with the bimodal systems based on multicondition model training or missing-feature decoding alone.
Resumo:
The Routledge Guide to Interviewing sets out a well-tested and practical approach and methodology: what works, difficulties and dangers to avoid and key questions which must be answered before you set out. Background methodological issues and arguments are considered and drawn upon but the focus is on what is ethical, legally acceptable and productive:
-Rationale (why, what for, where, how)
-Ethics and Legalities (informed consent, data protection, risks, embargoes)
-Resources (organisational, technical, intellectual)
-Preparation (selecting and approaching interviewees, background and biographical research, establishing credentials, identifying topics)
-Technique (developing expertise and confidence)
-Audio-visual interviews
-Analysis (modes, methods, difficulties)
-Storage (archiving and long-term preservation)
-Sharing Resources (dissemination and development)
From death row to the mansion of a head of state, small kitchens and front parlours, to legislatures and presbyteries, Anna Bryson and Seán McConville’s wide interviewing experience has been condensed into this book. The material set out here has been acquired by trial, error and reflection over a period of more than four decades. The interviewees have ranged from the delightfully straightforward to the painfully difficult to the near impossible – with a sprinkling of those that were impossible.
Successful interviewing draws on the survival skills of everyday life. This guide will help you to adapt, develop and apply these innate skills. Including a range of useful information such as sample waivers, internet resources, useful hints and checklists, it provides sound and plain-speaking support for the oral historian, social scientist and investigator.
Resumo:
Existing referencing systems frequently prove inadequate for the citation of moving image and sound media such as vidcasts, streaming television, sound files, un-catalogued archive footage, amateur content hosted online or
non-broadcast radio recordings. Back in 2009 and 2010 a British working group funded by Higher Education Funding Council for England (HEFCE) and co-ordinated by the British Universities Film and Video Council investigated this problem. This report documents the early stages of the project.
Resumo:
This paper presents a novel method of audio-visual fusion for person identification where both the speech and facial modalities may be corrupted, and there is a lack of prior knowledge about the corruption. Furthermore, we assume there is a limited amount of training data for each modality (e.g., a short training speech segment and a single training facial image for each person). A new representation and a modified cosine similarity are introduced for combining and comparing bimodal features with limited training data as well as vastly differing data rates and feature sizes. Optimal feature selection and multicondition training are used to reduce the mismatch between training and testing, thereby making the system robust to unknown bimodal corruption. Experiments have been carried out on a bimodal data set created from the SPIDRE and AR databases with variable noise corruption of speech and occlusion in the face images. The new method has demonstrated improved recognition accuracy.
Resumo:
Through the concept of sonic resonance, the project Cidade Museu – Museum City explores five derelict or transitional spaces in the city of Viseu. The activation and capture of these spaces develops an audio- visual memory that reflects architectures, stories and experiences, while creating a sense of place through sounds and images.
The project brings together musicians with a background in contemporary music, electroacoustic music and improvisation and a visual artist focusing on photography and video.
Each member of the collective explores the selected spaces in order to activate them with the help of their respective instruments and through sound projection in an iterative process in which the source of activation gradually gives way to the characteristics of each space, their resonances and acoustic characteristics. The museum city (a nickname for the city of Viseu), in this performance, exposes the contrast between the grandeur and multi-faceted architecture of Viseu’s Cathedral with spaces that spread throughout the city waiting for a new future.
The performance in the Cathedral (Sé) is characterised by a trio ensemble, an eight channel sound system and video projecting audio recordings and images made in each of the five spaces. The audience is invited to explore the relations between the various buildings and their stories while being immersed in their resonances and visual projections.
The performance explores the following spaces in Viseu: the old Orfeão (music hall), an old wine cellar, a mansion home to the national road services, a house with its grounds in Rua Silva Gaio and an old slaughterhouse.
Resumo:
Neste estudo, reflectimos sobre os critérios de recolha e de classificação do acervo de cancioneiro tradicional reunido por nós no concelho de Baião (distrito do Porto). Etapas fundamentais na constituição de um cancioneiro, são frequentemente sujeitas a erros que desvirtuam o produto final. Comentámos por isso os processos seguidos por alguns investigadores do folclore literário português, que, ao adoptarem metodologias desadequadas, deturparam a objectividade dos seus trabalhos. Com efeito, para além de incorrecções no sistema de classificação, vários autores alteraram a genuinidade de alguns originais, prejudicando assim a cientificidade da sua obra. Em relação à recolha, que comporta os registos escrito e electrónico (gravação sonora e audio-visual), a nossa experiência mostrou-nos que o próprio comportamento do intérprete e dos ouvintes, os comentários de agrado ou desaprovação e as correcções consideradas oportunas constituem valiosas informações para a compreensão do fenómeno poético oral.
Resumo:
Relatório da Prática de Ensino Supervisionada, Mestrado em Ensino de História e Geografia no 3º Ciclo do Ensino Básico e no Ensino Secundário, Universidade de Lisboa, 2014
Resumo:
Relatório da Prática de Ensino Supervisionada, Mestrado em Ensino de Português e de Alemão no 3º ciclo do Ensino Básico e no Ensino Secundário, Universidade de Lisboa, 2015
Resumo:
Energy-using products (EuPs), such as domestic appliances, audio-visual and ICT equipment contribute significantly to CO2 emissions, both in the domestic and non-domestic sectors. Policies that encourage the use of more energy efficient products can therefore generate significant reductions in overall energy consumption and hence, CO2 emissions. To the extent that these policies cause an increase the average production cost of EuPs, they may impose economic costs on producers, or on consumers, or on both. In this theoretical paper, an adaptation of a simple vertical product differentiation model – in which products are characterised in terms of their quality and their energy consumption – is used to analyse the impact of the different EuP polices on product innovation and to assess the resultant economic impacts on producers and consumers. It is shown that whereas the imposition of a binding product standard for energy efficiency unambiguously reduces aggregate profit and increases the average market price in the absence of any learning effects, the introduction or strengthening of demand-side measures (such as energy labelling) may reduce, or increase, aggregate profit. Even in the case where the overall impact is unambiguously negative, the effects of product innovation and learning can be in either direction.
Resumo:
Dissertação apresentada à Escola Superior de Comunicação Social como parte dos requisitos para obtenção de grau de mestre em Audiovisual e Multimédia.
Resumo:
Dissertação apresentada para obtenção do grau de Mestre em Educação Matemática na Educação Pré-Escolar e nos 1º e 2º Ciclos do Ensino Básico na especialidade de Didática da Matemática