9 resultados para audio-visual automatic speech recognition

em Acceda, el repositorio institucional de la Universidad de Las Palmas de Gran Canaria. España


Relevância:

100.00% 100.00%

Publicador:

Resumo:

[EN]Perceptual User Interfaces (PUIs) aim at facilitating human-computer interaction with the aid of human-like capacities (computer vision, speech recognition, etc.). In PUIs, the human face is a central element, since it conveys not only identity but also other important information, particularly with respect to the user’s mood or emotional state. This paper describes both a face detector and a smile detector for PUIs. Both are suitable for real-time interaction.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Automatic face recognition has been mainly tackled by matching a new image to a set of previously computed identity models. The literature describes approximations where those identity models are based on a single sample or a set of them. However, face representation keeps being a topic of great debate in the psychology literature, with some results suggesting the use of an average image. In this paper, instead of restricting our system to a fixed and precomputed classifier, the system learns iteratively based on the experience extracted from each meeting.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

[ES] Los erizos de mar han servido como modelo prototípico de organismo en el desarrollo de la Biología. La irrupción de este animal como especie invasora en los fondos canarios, combinada con el éxito reproductivo que ha tenido en nuestras aguas, ha creado un problema medioambiental importante que se ha intentado atajar con la puesta en marcha de proyectos e iniciativas orientados a su erradicación (matanzas masivas) o su contención con intentos de estimular su explotación comercial para uso gastronómico. En el transcurso de este trabajo se pretende explorar la robustez con la que se pueden clasificar visualmente diferentes tipos de erizos (principalmente Diadema antillarumy y Erizos autóctonos) a partir tanto de imágenes estáticas como de secuencias de vídeo para evaluar si, mediante el empleo de técnicas de visión por computador, es posible resolver estas tareas mediante la inspección automática de vídeos e imágenes.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

[EN]An accurate estimation of the number of people entering / leaving a controlled area is an interesting capability for automatic surveil- lance systems. Potential applications where this technology can be ap- plied include those related to security, safety, energy saving or fraud control. In this paper we present a novel con guration of a multi-sensor system combining both visual and range data specially suited for trou- blesome scenarios such as public transportation. The approach applies probabilistic estimation lters on raw sensor data to create intermediate level hypothesis that are later fused using a certainty-based integration stage. Promising results have been obtained in several tests performed on a realistic test bed scenario under variable lightning conditions.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The automatic extraction of biometric descriptors of anonymous people is a challenging scenario in camera networks. This task is typically accomplished making use of visual information. Calibrated RGBD sensors make possible the extraction of point cloud information. We present a novel approach for people semantic description and re-identification using the individual point cloud information. The proposal combines the use of simple geometric features with point cloud features based on surface normals.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

[EN]During the last decade, researchers have verified that clothing can provide information for gender recognition. However, before extracting features, it is necessary to segment the clothing region. We introduce a new clothes segmentation method based on the application of the GrabCut technique over a trixel mesh, obtaining very promising results for a close to real time system. Finally, the clothing features are combined with facial and head context information to outperform previous results in gender recognition with a public database.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

[EN]Different researches suggest that inner facial features are not the only discriminative features for tasks such as person identification or gender classification. Indeed, they have shown an influence of features which are part of the local face context, such as hair, on these tasks. However, object-centered approaches which ignore local context dominate the research in computational vision based facial analysis. In this paper, we performed an analysis to study which areas and which resolutions are diagnostic for the gender classification problem. We first demonstrate the importance of contextual features in human observers for gender classification using a psychophysical ”bubbles” technique.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

[EN]In this paper a system for face recognition from a tabula rasa (i.e. blank slate) perspective is described. A priori, the system has the only ability to detect automatically faces and represent them in a space of reduced dimension. Later, the system is exposed to over 400 different identities, observing its recognition performance evolution. The preliminary results achieved indicate on the one side that the system is able to reject most of unknown individuals after an initialization stage.