873 resultados para Audio-visual Speech Recognition, Visual Feature Extraction, Free-parts, Monolithic, ROI
Resumo:
Acompanha: Epidemias na escola? Só em filmes: possibilidades de contaminação na aprendizagem significativa
Resumo:
O presente relatório tem como principal objetivo desenvolver a competência da interculturalidade no âmbito da linguagem não-verbal de alunos portugueses, do 3º ciclo do ensino básico, com frequência à disciplina de Espanhol. A Linguagem não-verbal tem um contributo preponderante na comunicação e, quando há a pretensão de se conhecer outra cultura, é forçoso que se interpretem os diferentes sistemas onde cada indivíduo se integra e interage, porque comunicar eficazmente com o outro implica um conhecimento das estruturas simbólicas e dos códigos culturais intrínsecos, não só à cultura de um outro específico, mas também ao seu próprio contexto sociocultural, histórico-cultural e económico-cultural. Este trabalho faz apologia de um ensino intercultural que promova o diálogo entre culturas, sabendo-se de antemão que há representações que devem ser desconstruídas, bem como uma linguagem não-verbal específica que pode interferir na pragmática da interculturalidade. Trata-se de uma investigação-ação demarcada por dois momentos distintos: um primeiro estudo vocacionado para a consciencialização dos alunos de que a comunicação não-verbal é uma competência que se ensina e se aprende e um segundo estudo dedicado a aspetos culturais diferenciadores, entre Espanha e Portugal, na linguagem não-verbal , com enfoque nos gestos culturais e no tratamento do tempo. Os dados a analisar são: a transcrição de uma aula gravada, onde foram aplicados vários recursos audiovisuais e escritos consentâneos com as unidades programáticas, e as respostas a um questionário dirigido à turma de intervenção e a uma turma de nacionalidade espanhola que com ela colaborou. A implementação destas atividades/estratégias didáticas permitiu concluir que, por um lado, os alunos interpretam os diferentes códigos não -verbais à luz de uma perspetiva universal, por outro, há uma forte influência de estereótipos herdados e filtrados, a partir de diferentes marcos histórico-temporais. Este estudo sobre o não- verbal também se traduziu num alicerce bastante hábil para motivar à aprendizagem em geral e para enriquecer o conhecimento sobre a cultura do outro e a sua própria cultura, através da aquisição de códigos não- verbais comunicativo-funcionais.
Resumo:
Though the trend rarely receives attention, since the 1970s many American filmmakers have been taking sound and music tropes from children’s films, television shows, and other forms of media and incorporating those sounds into films intended for adult audiences. Initially, these references might seem like regressive attempts at targeting some nostalgic desire to relive childhood. However, this dissertation asserts that these children’s sounds are instead designed to reconnect audience members with the multi-faceted fantasies and coping mechanisms that once, through children’s media, helped these audience members manage life’s anxieties. Because sound is the sense that Western audiences most associate with emotion and memory, it offers audiences immediate connection with these barely conscious longings. The first chapter turns to children’s media itself and analyzes Disney’s 1950s forays into television. The chapter argues that by selectively repurposing the gentlest sonic devices from the studio’s films, television shows like Disneyland created the studio’s signature sentimental “Disney sound.” As a result, a generation of baby boomers like Steven Spielberg comes of age and longs to recreate that comforting sound world. The second chapter thus focuses on Spielberg, who incorporates Disney music in films like Close Encounters of the Third Kind (1977). Rather than recreate Disney’s sound world, Spielberg uses this music as a springboard into a new realm I refer to as “sublime refuge” - an acoustic haven that combines overpowering sublimity and soothing comfort into one fantastical experience. The second half of the dissertation pivots into more experimental children’s cartoons like Gerald McBoing-Boing (1951) - cartoons that embrace audio-visual dissonance in ways that soothe even as they create tension through a phenomenon I call “comfortable discord.” In the final chapter, director Wes Anderson reveals that these sonic tensions have just as much appeal to adults. In films like The Royal Tenenbaums (2001), Anderson demonstrates that comfortable discord can simultaneously provide a balm for anxiety and create an open-ended space that makes empathetic connections between characters possible. The dissertation closes with a call to rethink nostalgia, not as a romanticization of the past, but rather as a reconnection with forgotten affective channels.
Resumo:
Forensic speaker comparison exams have complex characteristics, demanding a long time for manual analysis. A method for automatic recognition of vowels, providing feature extraction for acoustic analysis is proposed, aiming to contribute as a support tool in these exams. The proposal is based in formant measurements by LPC (Linear Predictive Coding), selectively by fundamental frequency detection, zero crossing rate, bandwidth and continuity, with the clustering being done by the k-means method. Experiments using samples from three different databases have shown promising results, in which the regions corresponding to five of the Brasilian Portuguese vowels were successfully located, providing visualization of a speaker’s vocal tract behavior, as well as the detection of segments corresponding to target vowels.
Resumo:
Automatic analysis of human behaviour in large collections of videos is gaining interest, even more so with the advent of file sharing sites such as YouTube. However, challenges still exist owing to several factors such as inter- and intra-class variations, cluttered backgrounds, occlusion, camera motion, scale, view and illumination changes. This research focuses on modelling human behaviour for action recognition in videos. The developed techniques are validated on large scale benchmark datasets and applied on real-world scenarios such as soccer videos. Three major contributions are made. The first contribution is in the area of proper choice of a feature representation for videos. This involved a study of state-of-the-art techniques for action recognition, feature extraction processing and dimensional reduction techniques so as to yield the best performance with optimal computational requirements. Secondly, temporal modelling of human behaviour is performed. This involved frequency analysis and temporal integration of local information in the video frames to yield a temporal feature vector. Current practices mostly average the frame information over an entire video and neglect the temporal order. Lastly, the proposed framework is applied and further adapted to real-world scenario such as soccer videos. A dataset consisting of video sequences depicting events of players falling is created from actual match data to this end and used to experimentally evaluate the proposed framework.
Resumo:
Formação - Professores
Resumo:
369 p.
Resumo:
[SPA] El objetivo de la investigación es conocer cual es la aportación cuantitativa y cualitativa de la documentación audiovisual en la información que ofrece diariamente la televisión. El marco temporal de la investigación de campo se sitúa en los años 1993 y 1994, en un marco geográfico constituido por los canales que emiten en el estado español. El estudio parte de una aproximación teórica a la documentación periodística, a la documentación audiovisual y a los estudios sobre la comunicación de masas, y lleva a cabo una investigación de campo en tres áreas: 1) Análisis de programas informativos diarios de seis cadenas de televisión (ETB, TVE, Canal Sur, TV3, Antena 3 y Canal+), a través de tres muestras independientes. 2) Análisis de las peticiones de documentación audiovisual realizadas desde las redacciones de programas informativos a los servicios de documentación. 3) Estudio de las funciones, tareas, estructura y organización de los servicios de documentación de televisión, basado en encuestas, visitas y entrevistas. En anexo se ofrece el análisis detallado de 620 noticias, así como la información de los centros de documentación. La investigación concluye afirmando que la documentación audiovisual es uno de los elementos constitutivos de la información de actualidad, tanto por su presencia cuantitativa (más de un 40% de las noticias emitidas la emplean), como por su aportación cualitativa y su utilización generalizada en todas las secciones informativas. Las conclusiones señalan que la importancia de las noticias incide positivamente en el empleo de documentación audiovisual, sintetizan las funciones de esta documentación y las características específicas de su uso. Confirman el carácter de retroalimentación de la documentación informativa en televisión. Señalan un empleo de esta documentación como documentación puramente visual. Y afirman que la documentación audiovisual, además de contribuir en la producción, coadyuva a la calidad de los programas informativos, en la medida en que facilita la tarea de ofrecer una información más completa y contextualizada.
Resumo:
Nous proposons, dans ce mémoire, d’explorer les possibilités pratiques et pédagogiques d’une approche autopoïétique de la création sonore au cinéma. Notre principal souci sera de saisir les modalités de l’ascèse propre aux artistes qui se livrent à une telle activité, comprise comme un « apprentissage de soi par soi » (Foucault), afin de faire celui qui peut faire l’œuvre (processus de subjectivation), et le rôle descriptif et opératoire de cet exercice - en tant qu’effort pour penser de façon critique son propre savoir-faire -, dans le faire-œuvre et l’invention de possibles dans l’écriture audio-visuelle cinématographique. Pour ce faire, d’une part, nous étudierons, à partir de témoignages autopoïétiques, le rapport réflexif de trois créateurs sonores à leur pratique et leur effort pour penser (et mettre en place) les conditions d’une pratique et d’une esthétique du son filmique comme forme d’art sonore dans un contexte audio-visuel, alors qu’ils travaillent dans un cadre normalisant : Randy Thom, Walter Murch et Franck Warner. D’autre part, nous recourrons à différentes considérations théoriques (la théorie de l’art chez Deleuze et Guattari, la « surécoute » chez Szendy, l’histoire de la poïétique à partir de Valéry, etc.) et pratiques (la recherche musicale chez Schaeffer, la relation maître-apprenti, les rapports entre automatisme et pensée dans le cinéma moderne chez Artaud et Godard, etc.), afin de contextualiser et d’analyser ces expériences de création, avec l’objectif de problématiser la figure de l’artiste-poïéticien sur un plan éthique dans le sillage de la théorie des techniques de soi chez Foucault.
Resumo:
In this thesis, an image enhancement application is developed for low-vision patients when they use iPhones to see images/watch videos. The thesis has two contributions. The first contribution is the new image enhancement algorithm which combines human vision features. The new image enhancement algorithm is modified from a wavelet transform based image enhancement algorithm developed by Dr. Jinshan Tang. Different from the original algorithm, the new image enhancement algorithm combines human visual feature into the algorithm and thus can make the new algorithm more effective. Experimental simulation results show that the proposed algorithm has better visual results than the algorithm without combining visual features. The second contribution of this thesis is the development of a mobile image enhancement application. In this application, users with low-vision can see clearer images on an iPhone which is installed with the application I have developed.
Resumo:
To analyze the characteristics and predict the dynamic behaviors of complex systems over time, comprehensive research to enable the development of systems that can intelligently adapt to the evolving conditions and infer new knowledge with algorithms that are not predesigned is crucially needed. This dissertation research studies the integration of the techniques and methodologies resulted from the fields of pattern recognition, intelligent agents, artificial immune systems, and distributed computing platforms, to create technologies that can more accurately describe and control the dynamics of real-world complex systems. The need for such technologies is emerging in manufacturing, transportation, hazard mitigation, weather and climate prediction, homeland security, and emergency response. Motivated by the ability of mobile agents to dynamically incorporate additional computational and control algorithms into executing applications, mobile agent technology is employed in this research for the adaptive sensing and monitoring in a wireless sensor network. Mobile agents are software components that can travel from one computing platform to another in a network and carry programs and data states that are needed for performing the assigned tasks. To support the generation, migration, communication, and management of mobile monitoring agents, an embeddable mobile agent system (Mobile-C) is integrated with sensor nodes. Mobile monitoring agents visit distributed sensor nodes, read real-time sensor data, and perform anomaly detection using the equipped pattern recognition algorithms. The optimal control of agents is achieved by mimicking the adaptive immune response and the application of multi-objective optimization algorithms. The mobile agent approach provides potential to reduce the communication load and energy consumption in monitoring networks. The major research work of this dissertation project includes: (1) studying effective feature extraction methods for time series measurement data; (2) investigating the impact of the feature extraction methods and dissimilarity measures on the performance of pattern recognition; (3) researching the effects of environmental factors on the performance of pattern recognition; (4) integrating an embeddable mobile agent system with wireless sensor nodes; (5) optimizing agent generation and distribution using artificial immune system concept and multi-objective algorithms; (6) applying mobile agent technology and pattern recognition algorithms for adaptive structural health monitoring and driving cycle pattern recognition; (7) developing a web-based monitoring network to enable the visualization and analysis of real-time sensor data remotely. Techniques and algorithms developed in this dissertation project will contribute to research advances in networked distributed systems operating under changing environments.
Resumo:
Nous proposons, dans ce mémoire, d’explorer les possibilités pratiques et pédagogiques d’une approche autopoïétique de la création sonore au cinéma. Notre principal souci sera de saisir les modalités de l’ascèse propre aux artistes qui se livrent à une telle activité, comprise comme un « apprentissage de soi par soi » (Foucault), afin de faire celui qui peut faire l’œuvre (processus de subjectivation), et le rôle descriptif et opératoire de cet exercice - en tant qu’effort pour penser de façon critique son propre savoir-faire -, dans le faire-œuvre et l’invention de possibles dans l’écriture audio-visuelle cinématographique. Pour ce faire, d’une part, nous étudierons, à partir de témoignages autopoïétiques, le rapport réflexif de trois créateurs sonores à leur pratique et leur effort pour penser (et mettre en place) les conditions d’une pratique et d’une esthétique du son filmique comme forme d’art sonore dans un contexte audio-visuel, alors qu’ils travaillent dans un cadre normalisant : Randy Thom, Walter Murch et Franck Warner. D’autre part, nous recourrons à différentes considérations théoriques (la théorie de l’art chez Deleuze et Guattari, la « surécoute » chez Szendy, l’histoire de la poïétique à partir de Valéry, etc.) et pratiques (la recherche musicale chez Schaeffer, la relation maître-apprenti, les rapports entre automatisme et pensée dans le cinéma moderne chez Artaud et Godard, etc.), afin de contextualiser et d’analyser ces expériences de création, avec l’objectif de problématiser la figure de l’artiste-poïéticien sur un plan éthique dans le sillage de la théorie des techniques de soi chez Foucault.
Resumo:
This paper describes a novel algorithm for tracking the motion of the urethra from trans-perineal ultrasound. Our work is based on the structure-from-motion paradigm and therefore handles well structures with ill-defined and partially missing boundaries. The proposed approach is particularly well-suited for video sequences of low resolution and variable levels of blurriness introduced by anatomical motion of variable speed. Our tracking method identifies feature points on a frame by frame basis using the SURF detector/descriptor. Inter-frame correspondence is achieved using nearest-neighbor matching in the feature space. The motion is estimated using a non-linear bi-quadratic model, which adequately describes the deformable motion of the urethra. Experimental results are promising and show that our algorithm performs well when compared to manual tracking.
Resumo:
The purpose of this work in progress study was to test the concept of recognising plants using images acquired by image sensors in a controlled noise-free environment. The presence of vegetation on railway trackbeds and embankments presents potential problems. Woody plants (e.g. Scots pine, Norway spruce and birch) often establish themselves on railway trackbeds. This may cause problems because legal herbicides are not effective in controlling them; this is particularly the case for conifers. Thus, if maintenance administrators knew the spatial position of plants along the railway system, it may be feasible to mechanically harvest them. Primary data were collected outdoors comprising around 700 leaves and conifer seedlings from 11 species. These were then photographed in a laboratory environment. In order to classify the species in the acquired image set, a machine learning approach known as Bag-of-Features (BoF) was chosen. Irrespective of the chosen type of feature extraction and classifier, the ability to classify a previously unseen plant correctly was greater than 85%. The maintenance planning of vegetation control could be improved if plants were recognised and localised. It may be feasible to mechanically harvest them (in particular, woody plants). In addition, listed endangered species growing on the trackbeds can be avoided. Both cases are likely to reduce the amount of herbicides, which often is in the interest of public opinion. Bearing in mind that natural objects like plants are often more heterogeneous within their own class rather than outside it, the results do indeed present a stable classification performance, which is a sound prerequisite in order to later take the next step to include a natural background. Where relevant, species can also be listed under the Endangered Species Act.