12 resultados para Video Processing
em Universitat de Girona, Spain
Resumo:
In the context of the round table the following topics related to image colour processing will be discussed: historical point of view. Studies of Aguilonius, Gerritsen, Newton and Maxwell. CIE standard (Commission International de lpsilaEclaraige). Colour models. RGB, HIS, etc. Colour segmentation based on HSI model. Industrial applications. Summary and discussion. At the end, video images showing the robustness of colour in front of B/W images will be presented
Resumo:
A visual SLAM system has been implemented and optimised for real-time deployment on an AUV equipped with calibrated stereo cameras. The system incorporates a novel approach to landmark description in which landmarks are local sub maps that consist of a cloud of 3D points and their associated SIFT/SURF descriptors. Landmarks are also sparsely distributed which simplifies and accelerates data association and map updates. In addition to landmark-based localisation the system utilises visual odometry to estimate the pose of the vehicle in 6 degrees of freedom by identifying temporal matches between consecutive local sub maps and computing the motion. Both the extended Kalman filter and unscented Kalman filter have been considered for filtering the observations. The output of the filter is also smoothed using the Rauch-Tung-Striebel (RTS) method to obtain a better alignment of the sequence of local sub maps and to deliver a large-scale 3D acquisition of the surveyed area. Synthetic experiments have been performed using a simulation environment in which ray tracing is used to generate synthetic images for the stereo system
Resumo:
L’objectiu d’aquest PFC és el desenvolupament d’una eina pel modelatge procedural d’edificis i altres estructures arquitectòniques. El modelatge d’edificis és, per si sol, un bon tema on aplicar‐hi la programació procedural. Un edifici normal compte sempre amb elements que es repeteixen en altura i amplada. El fet de “repetir” una tasca suggereix sempre l’aplicació d’algun tipus de procediment per tal de simplificar i reduir la feina de l’usuari a l’hora de desenvolupar aquesta feina
Resumo:
We investigate whether dimensionality reduction using a latent generative model is beneficial for the task of weakly supervised scene classification. In detail, we are given a set of labeled images of scenes (for example, coast, forest, city, river, etc.), and our objective is to classify a new image into one of these categories. Our approach consists of first discovering latent ";topics"; using probabilistic Latent Semantic Analysis (pLSA), a generative model from the statistical text literature here applied to a bag of visual words representation for each image, and subsequently, training a multiway classifier on the topic distribution vector for each image. We compare this approach to that of representing each image by a bag of visual words vector directly and training a multiway classifier on these vectors. To this end, we introduce a novel vocabulary using dense color SIFT descriptors and then investigate the classification performance under changes in the size of the visual vocabulary, the number of latent topics learned, and the type of discriminative classifier used (k-nearest neighbor or SVM). We achieve superior classification performance to recent publications that have used a bag of visual word representation, in all cases, using the authors' own data sets and testing protocols. We also investigate the gain in adding spatial information. We show applications to image retrieval with relevance feedback and to scene classification in videos
Resumo:
A common problem in video surveys in very shallow waters is the presence of strong light fluctuations, due to sun light refraction. Refracted sunlight casts fast moving patterns, which can significantly degrade the quality of the acquired data. Motivated by the growing need to improve the quality of shallow water imagery, we propose a method to remove sunlight patterns in video sequences. The method exploits the fact that video sequences allow several observations of the same area of the sea floor, over time. It is based on computing the image difference between a given reference frame and the temporal median of a registered set of neighboring images. A key observation is that this difference will have two components with separable spectral content. One is related to the illumination field (lower spatial frequencies) and the other to the registration error (higher frequencies). The illumination field, recovered by lowpass filtering, is used to correct the reference image. In addition to removing the sunflickering patterns, an important advantage of the approach is the ability to preserve the sharpness in corrected image, even in the presence of registration inaccuracies. The effectiveness of the method is illustrated in image sets acquired under strong camera motion containing non-rigid benthic structures. The results testify the good performance and generality of the approach
Resumo:
This paper presents a complete solution for creating accurate 3D textured models from monocular video sequences. The methods are developed within the framework of sequential structure from motion, where a 3D model of the environment is maintained and updated as new visual information becomes available. The camera position is recovered by directly associating the 3D scene model with local image observations. Compared to standard structure from motion techniques, this approach decreases the error accumulation while increasing the robustness to scene occlusions and feature association failures. The obtained 3D information is used to generate high quality, composite visual maps of the scene (mosaics). The visual maps are used to create texture-mapped, realistic views of the scene
Resumo:
Photo-mosaicing techniques have become popular for seafloor mapping in various marine science applications. However, the common methods cannot accurately map regions with high relief and topographical variations. Ortho-mosaicing borrowed from photogrammetry is an alternative technique that enables taking into account the 3-D shape of the terrain. A serious bottleneck is the volume of elevation information that needs to be estimated from the video data, fused, and processed for the generation of a composite ortho-photo that covers a relatively large seafloor area. We present a framework that combines the advantages of dense depth-map and 3-D feature estimation techniques based on visual motion cues. The main goal is to identify and reconstruct certain key terrain feature points that adequately represent the surface with minimal complexity in the form of piecewise planar patches. The proposed implementation utilizes local depth maps for feature selection, while tracking over several views enables 3-D reconstruction by bundle adjustment. Experimental results with synthetic and real data validate the effectiveness of the proposed approach
Resumo:
En el caso que nos ocupa, el trabajo a desarrollar con los videos estuvo enfocado principalmente a reconocer las formas de interacción que se dan entre la docente en formación y los niños, las habilidades para utilizar el cuerpo como una herramienta de expresión y comunicación, el uso de la voz como una herramienta que genera motivación, integración a la tarea, aceptación o rechazo, agrado, desagrado y construcción simbólica por mencionar las más importantes y en un menor nivel de importancia, pero también como una posibilidad, el reconocer aspectos didácticos de la práctica como son: pertinencia de la actividad, respuesta de los niños, situaciones que hayan alterado la práctica e incidentes críticos en general
Resumo:
El presente artículo describe tres estudios sobre la producción del verbo y la estructura argumental en niños con Trastorno Específico del Lenguaje (TEL) usando diferentes metodologías. El primero es un estudio observacional que usa una muestra de habla espontánea. El segundo usa una tarea experimental de denominación de oraciones como resultado de la observación de videos de acciones. El tercero comprende la tarea de denominación de oraciones con imágenes estáticas en eventos con diferente complejidad argumental. Aunque los datos concretos varían en función de la metodología usada, hay una clara evidencia de que los niños de habla catalana y española con TEL presentan especiales dificultades en la producción de verbos con una alta complejidad en relación a la estructura argumental y cometen errores en la especificación de los argumentos obligatorios. Se concluye que tanto limitaciones en el procesamiento como déficits en la representación semántica de los verbos pueden estar implicados en estas dificultades
Resumo:
Diffusion Tensor Imaging (DTI) is a new magnetic resonance imaging modality capable of producing quantitative maps of microscopic natural displacements of water molecules that occur in brain tissues as part of the physical diffusion process. This technique has become a powerful tool in the investigation of brain structure and function because it allows for in vivo measurements of white matter fiber orientation. The application of DTI in clinical practice requires specialized processing and visualization techniques to extract and represent acquired information in a comprehensible manner. Tracking techniques are used to infer patterns of continuity in the brain by following in a step-wise mode the path of a set of particles dropped into a vector field. In this way, white matter fiber maps can be obtained.
Resumo:
The main objective of this thesis was the integration of microstructure information in synoptic descriptors of turbulence, that reflects the mixing processes. Turbulent patches are intermittent in space and time, but they represent the dominant process for mixing. In this work, the properties of turbulent patches were considered the potential input for integrating the physical microscale measurements. The development of a method for integrating the properties of the turbulent patches required solving three main questions: a) how can we detect the turbulent patches from he microstructure measurements?; b) which are the most relevant properties of the turbulent patches?; and ) once an interval of time has been selected, what kind of synoptic parameters could better reflect the occurrence and properties of the turbulent patches? The answers to these questions were the final specific objectives of this thesis.
Resumo:
La sang és un subproducte amb un alt potencial de valorització que s'obté en quantitats importants en els escorxadors industrials. Actualment, la majoria de sistemes de recollida de la sang no segueixen unes mesures d'higiene estrictes, pel que esdevé un producte de baixa qualitat microbiològica. Conseqüentment, l'aprofitament de la sang és una sortida poc estimulant des del punt de vista econòmic, ja que acostuma a perdre les qualitats que permetrien l'obtenció de productes d'alt valor afegit. El capítol I del present treball s'inclou dins d'un projecte que proposa la inoculació de bacteris de l'àcid làctic (LAB) com un cultiu bioconservador de la sang, un sistema senzill i de baix cost que cerca l'estabilitat de la sang, tant microbiològica com fisicoquímica, durant el període del seu emmagatzematge. El capítol II s'emmarca dins d'un projecte que cerca la millora de l'aprofitament integral de la sang que, en el cas de la fracció plasmàtica, es centra en l'estudi de la funcionalitat dels seus principals constituents. Conèixer la contribució dels components majoritaris ha de permetre la millora de la funcionalitat dels ingredients alimentaris derivats. Els resultats presentats en aquesta tesi poden ajudar a la valorització de la sang porcina d'escorxadors industrials, mitjançant els coneixements adquirits pel que fa a la millora del seu sistema de recollida i del desenvolupament d'ingredients alimentaris amb interessants propietats funcionals.