954 resultados para video images
Resumo:
Vision-based object detection from a moving platform becomes particularly challenging in the field of advanced driver assistance systems (ADAS). In this context, onboard vision-based vehicle verification strategies become critical, facing challenges derived from the variability of vehicles appearance, illumination, and vehicle speed. In this paper, an optimized HOG configuration for onboard vehicle verification is proposed which not only considers its spatial and orientation resolution, but descriptor processing strategies and classification. An in-depth analysis of the optimal settings for HOG for onboard vehicle verification is presented, in the context of SVM classification with different kernels. In contrast to many existing approaches, the evaluation is realized in a public and heterogeneous database of vehicle and non-vehicle images in different areas of the road, rendering excellent verification rates that outperform other similar approaches in the literature.
Resumo:
The importance of vision-based systems for Sense-and-Avoid is increasing nowadays as remotely piloted and autonomous UAVs become part of the non-segregated airspace. The development and evaluation of these systems demand flight scenario images which are expensive and risky to obtain. Currently Augmented Reality techniques allow the compositing of real flight scenario images with 3D aircraft models to produce useful realistic images for system development and benchmarking purposes at a much lower cost and risk. With the techniques presented in this paper, 3D aircraft models are positioned firstly in a simulated 3D scene with controlled illumination and rendering parameters. Realistic simulated images are then obtained using an image processing algorithm which fuses the images obtained from the 3D scene with images from real UAV flights taking into account on board camera vibrations. Since the intruder and camera poses are user-defined, ground truth data is available. These ground truth annotations allow to develop and quantitatively evaluate aircraft detection and tracking algorithms. This paper presents the software developed to create a public dataset of 24 videos together with their annotations and some tracking application results.
Resumo:
La embriogénesis es el proceso mediante el cual una célula se convierte en un ser un vivo. A lo largo de diferentes etapas de desarrollo, la población de células va proliferando a la vez que el embrión va tomando forma y se configura. Esto es posible gracias a la acción de varios procesos genéticos, bioquímicos y mecánicos que interaccionan y se regulan entre ellos formando un sistema complejo que se organiza a diferentes escalas espaciales y temporales. Este proceso ocurre de manera robusta y reproducible, pero también con cierta variabilidad que permite la diversidad de individuos de una misma especie. La aparición de la microscopía de fluorescencia, posible gracias a proteínas fluorescentes que pueden ser adheridas a las cadenas de expresión de las células, y los avances en la física óptica de los microscopios han permitido observar este proceso de embriogénesis in-vivo y generar secuencias de imágenes tridimensionales de alta resolución espacio-temporal. Estas imágenes permiten el estudio de los procesos de desarrollo embrionario con técnicas de análisis de imagen y de datos, reconstruyendo dichos procesos para crear la representación de un embrión digital. Una de las más actuales problemáticas en este campo es entender los procesos mecánicos, de manera aislada y en interacción con otros factores como la expresión genética, para que el embrión se desarrolle. Debido a la complejidad de estos procesos, estos problemas se afrontan mediante diferentes técnicas y escalas específicas donde, a través de experimentos, pueden hacerse y confrontarse hipótesis, obteniendo conclusiones sobre el funcionamiento de los mecanismos estudiados. Esta tesis doctoral se ha enfocado sobre esta problemática intentando mejorar las metodologías del estado del arte y con un objetivo específico: estudiar patrones de deformación que emergen del movimiento organizado de las células durante diferentes estados del desarrollo del embrión, de manera global o en tejidos concretos. Estudios se han centrado en la mecánica en relación con procesos de señalización o interacciones a nivel celular o de tejido. En este trabajo, se propone un esquema para generalizar el estudio del movimiento y las interacciones mecánicas que se desprenden del mismo a diferentes escalas espaciales y temporales. Esto permitiría no sólo estudios locales, si no estudios sistemáticos de las escalas de interacción mecánica dentro de un embrión. Por tanto, el esquema propuesto obvia las causas de generación de movimiento (fuerzas) y se centra en la cuantificación de la cinemática (deformación y esfuerzos) a partir de imágenes de forma no invasiva. Hoy en día las dificultades experimentales y metodológicas y la complejidad de los sistemas biológicos impiden una descripción mecánica completa de manera sistemática. Sin embargo, patrones de deformación muestran el resultado de diferentes factores mecánicos en interacción con otros elementos dando lugar a una organización mecánica, necesaria para el desarrollo, que puede ser cuantificado a partir de la metodología propuesta en esta tesis. La metodología asume un medio continuo descrito de forma Lagrangiana (en función de las trayectorias de puntos materiales que se mueven en el sistema en lugar de puntos espaciales) de la dinámica del movimiento, estimado a partir de las imágenes mediante métodos de seguimiento de células o de técnicas de registro de imagen. Gracias a este esquema es posible describir la deformación instantánea y acumulada respecto a un estado inicial para cualquier dominio del embrión. La aplicación de esta metodología a imágenes 3D + t del pez zebra sirvió para desvelar estructuras mecánicas que tienden a estabilizarse a lo largo del tiempo en dicho embrión, y que se organizan a una escala semejante al del mapa de diferenciación celular y con indicios de correlación con patrones de expresión genética. También se aplicó la metodología al estudio del tejido amnioserosa de la Drosophila (mosca de la fruta) durante el cierre dorsal, obteniendo indicios de un acoplamiento entre escalas subcelulares, celulares y supracelulares, que genera patrones complejos en respuesta a la fuerza generada por los esqueletos de acto-myosina. En definitiva, esta tesis doctoral propone una estrategia novedosa de análisis de la dinámica celular multi-escala que permite cuantificar patrones de manera inmediata y que además ofrece una representación que reconstruye la evolución de los procesos como los ven las células, en lugar de como son observados desde el microscopio. Esta metodología por tanto permite nuevas formas de análisis y comparación de embriones y tejidos durante la embriogénesis a partir de imágenes in-vivo. ABSTRACT The embryogenesis is the process from which a single cell turns into a living organism. Through several stages of development, the cell population proliferates at the same time the embryo shapes and the organs develop gaining their functionality. This is possible through genetic, biochemical and mechanical factors that are involved in a complex interaction of processes organized in different levels and in different spatio-temporal scales. The embryogenesis, through this complexity, develops in a robust and reproducible way, but allowing variability that makes possible the diversity of living specimens. The advances in physics of microscopes and the appearance of fluorescent proteins that can be attached to expression chains, reporting about structural and functional elements of the cell, have enabled for the in-vivo observation of embryogenesis. The imaging process results in sequences of high spatio-temporal resolution 3D+time data of the embryogenesis as a digital representation of the embryos that can be further analyzed, provided new image processing and data analysis techniques are developed. One of the most relevant and challenging lines of research in the field is the quantification of the mechanical factors and processes involved in the shaping process of the embryo and their interactions with other embryogenesis factors such as genetics. Due to the complexity of the processes, studies have focused on specific problems and scales controlled in the experiments, posing and testing hypothesis to gain new biological insight. However, methodologies are often difficult to be exported to study other biological phenomena or specimens. This PhD Thesis is framed within this paradigm of research and tries to propose a systematic methodology to quantify the emergent deformation patterns from the motion estimated in in-vivo images of embryogenesis. Thanks to this strategy it would be possible to quantify not only local mechanisms, but to discover and characterize the scales of mechanical organization within the embryo. The framework focuses on the quantification of the motion kinematics (deformation and strains), neglecting the causes of the motion (forces), from images in a non-invasive way. Experimental and methodological challenges hamper the quantification of exerted forces and the mechanical properties of tissues. However, a descriptive framework of deformation patterns provides valuable insight about the organization and scales of the mechanical interactions, along the embryo development. Such a characterization would help to improve mechanical models and progressively understand the complexity of embryogenesis. This framework relies on a Lagrangian representation of the cell dynamics system based on the trajectories of points moving along the deformation. This approach of analysis enables the reconstruction of the mechanical patterning as experienced by the cells and tissues. Thus, we can build temporal profiles of deformation along stages of development, comprising both the instantaneous events and the cumulative deformation history. The application of this framework to 3D + time data of zebrafish embryogenesis allowed us to discover mechanical profiles that stabilized through time forming structures that organize in a scale comparable to the map of cell differentiation (fate map), and also suggesting correlation with genetic patterns. The framework was also applied to the analysis of the amnioserosa tissue in the drosophila’s dorsal closure, revealing that the oscillatory contraction triggered by the acto-myosin network organized complexly coupling different scales: local force generation foci, cellular morphology control mechanisms and tissue geometrical constraints. In summary, this PhD Thesis proposes a theoretical framework for the analysis of multi-scale cell dynamics that enables to quantify automatically mechanical patterns and also offers a new representation of the embryo dynamics as experienced by cells instead of how the microscope captures instantaneously the processes. Therefore, this framework enables for new strategies of quantitative analysis and comparison between embryos and tissues during embryogenesis from in-vivo images.
Resumo:
Transects of a Remotely Operated Vehicle (ROV) providing sea-bed videos and photographs were carried out during POLARSTERN expedition ANT-XV/3 focussing on the ecology of benthic assemblages on the Antarctic shelf in the South-Eastern Weddell Sea. The ROV-system sprint 103 was equiped with two video- and one still camera, lights, flash-lights, compass, and parallel lasers providing a scale in the images, a tether-management system (TMS), a winch, and the board units. All cameras used the same main lense and could be tilted. Videos were recorded in Betacam-format and (film-)slides were made by decision of the scientific pilot. The latter were mainly made under the aspect to improve the identification of organisms depicted in the videos because the still photographs have a much higher optical resolution than the videos. In the photographs species larger than 3 mm, in the videos larger than 1 cm are recognisable and countable. Under optimum conditions the transects were strait; the speed and direction of the ROV were determined by the drift of the ship in the coastal current, since both, the ship and the ROV were used as a drifting system; the option to operate the vehicle actively was only used to avoide obstacles and to reach at best a distance of only approximately 30 cm to the sea-floor. As a consequence the width of the photographs in the foreground is approximately 50 cm. Deviations from this strategy resulted mainly from difficult ice- and weather conditions but also from high current velocity and local up-welling close to the sea-bed. The sea-bed images provide insights into the general composition of key species, higher systematic groups and ecological guilds. Within interdisciplinary approaches distributions of assemblages can be attributed to environmental conditions such as bathymetry, sediment characteristics, water masses and current regimes. The images also contain valuable information on how benthic species are associated to each other. Along the transects, small- to intermediate-scaled disturbances, e.g. by grounding icebergs were analysed and further impact to the entire benthic system by local succession of recolonisation was studied. This information can be used for models predicting the impact of climate change to benthic life in the Southern Ocean. All these approaches contribute to a better understanding of the fiunctioning of the benthic system and related components of the entire Antarctic marine ecosystem. Despite their scientific value the imaging methods meet concerns about the protection of sensitive Antarctic benthic systems since they are non-invasive and they also provide valuable material for education and outreach purposes.
Resumo:
Background: Flexible video bronchoscopes, in particular the Olympus BF Type 3C160, are commonly used in pediatric respiratory medicine. There is no data on the magnification and distortion effects of these bronchoscopes yet important clinical decisions are made from the images. The aim of this study was to systematically describe the magnification and distortion of flexible bronchoscope images taken at various distances from the object. Methods: Using images of known objects and processing these by digital video and computer programs both magnification and distortion scales were derived. Results: Magnification changes as a linear function between 100 mm ( x 1) and 10 mm ( x 9.55) and then as an exponential function between 10 mm and 3 mm ( x 40) from the object. Magnification depends on the axis of orientation of the object to the optic axis or geometrical axis of the bronchoscope. Magnification also varies across the field of view with the central magnification being 39% greater than at the periphery of the field of view at 15 mm from the object. However, in the paediatric situation the diameter of the orifices is usually less than 10 mm and thus this limits the exposure to these peripheral limits of magnification reduction. Intraclass correlations for measurements and repeatability studies between instruments are very high, r = 0.96. Distortion occurs as both barrel and geometric types but both types are heterogeneous across the field of view. Distortion of geometric type ranges up to 30% at 3 mm from the object but may be as low as 5% depending on the position of the object in relation to the optic axis. Conclusion: We conclude that the optimal working distance range is between 40 and 10 mm from the object. However the clinician should be cognisant of both variations in magnification and distortion in clinical judgements.
Resumo:
Digital still cameras capable of filming short video clips are readily available, but the quality of these recordings for telemedicine has not been reported. We performed a blinded study using four commonly available digital cameras. A simulated patient with a hemiplegic gait pattern was filmed by the same videographer in an identical, brightly lit indoor setting. Six neurologists viewed the blinded video clips on their PC and comparisons were made between cameras, between video clips recorded with and without a tripod, and between video clips filmed on high- or low-quality settings. Use of a tripod had a smaller effect than expected, while images taken on a high-quality setting were strongly preferred to those taken on a low-quality setting. Although there was some variability in video quality between selected cameras, all were of sufficient quality to identify physical signs such as gait and tremor. Adequate-quality video clips of movement disorders can be produced with low-cost cameras and transmitted by email for teleneurology purposes.
Resumo:
The oculomotor synergy as expressed by the CA/C and AC/A ratios was investigated to examine its influence on our previous observation that whereas convergence responses to stereoscopic images are generally stable, some individuals exhibit significant accommodative overshoot. Using a modified video refraction unit while viewing a stereoscopic LCD, accommodative and convergence responses to balanced and unbalanced vergence and focal stimuli (BVFS and UBVFS) were measured. Accommodative overshoot of at least 0.3 D was found in 3 out of 8 subjects for UBVFS. The accommodative response differential (RD) was taken to be the difference between the initial response and the subsequent mean static steady-state response. Without overshoot, RD was quantified by finding the initial response component. A mean RD of 0.11 +/- 0.27 D was found for the 1.0 D step UBVFS condition. The mean RD for the BVFS was 0.00 +/- 0.17 D. There was a significant positive correlation between CA/C ratio and RD (r = +0.75, n = 8, p <0.05) for only UBVFS. We propose that inter-subject variation in RD is influenced by the CA/C ratio as follows: an initial convergence response, induced by disparity of the image, generates convergence-driven accommodation commensurate with the CA/C ratio; the associated transient defocus subsequently decays to a balanced position between defocus-induced and convergence-induced accommodations.
Resumo:
A domain independent ICA-based approach to watermarking is presented. This approach can be used on images, music or video to embed either a robust or fragile watermark. In the case of robust watermarking, the method shows high information rate and robustness against malicious and non-malicious attacks, while keeping a low induced distortion. The fragile watermarking scheme, on the other hand, shows high sensitivity to tampering attempts while keeping the requirement for high information rate and low distortion. The improved performance is achieved by employing a set of statistically independent sources (the independent components) as the feature space and principled statistical decoding methods. The performance of the suggested method is compared to other state of the art approaches. The paper focuses on applying the method to digitized images although the same approach can be used for other media, such as music or video.
Resumo:
Using video refraction accommodative and convergence dynamic responses were measured to stepped changes in convergence stimuli with unchanged accommodative stimuli (conflicting stereoscopic image) and compared with responses to non-conflicting target stimuli. Three targets were used that varied in their spatial frequency components. An accommodative transient overshoot was evident in four out of seven subjects for only conflicting stimuli. One showed accommodative and convergence oscillation probably due to difficulty in fusing the stereoscopic target when it had a higher spatial component, however, this oscillation diminished when the target was spatial low-pass filtered. We hypothesise that transient responses to step stimuli is initiated by convergence-driven accommodation and subsequently followed by slower fine-control of accommodation modulated by the amount of blur. Inter-subject differences in convergence-driven accommodation may also be a factor to consider. For stereoscopic stimuli, it is proposed that the increase in blur immediately after the onset of the accommodative response inhibits cessation of the response.
Resumo:
We investigate the problem of obtaining a dense reconstruction in real-time, from a live video stream. In recent years, multi-view stereo (MVS) has received considerable attention and a number of methods have been proposed. However, most methods operate under the assumption of a relatively sparse set of still images as input and unlimited computation time. Video based MVS has received less attention despite the fact that video sequences offer significant benefits in terms of usability of MVS systems. In this paper we propose a novel video based MVS algorithm that is suitable for real-time, interactive 3d modeling with a hand-held camera. The key idea is a per-pixel, probabilistic depth estimation scheme that updates posterior depth distributions with every new frame. The current implementation is capable of updating 15 million distributions/s. We evaluate the proposed method against the state-of-the-art real-time MVS method and show improvement in terms of accuracy. © 2011 Elsevier B.V. All rights reserved.
Resumo:
Doctored images can cause people to believe in and remember experiences that never occurred, yet the underlying mechanism(s) responsible are not well understood. How does compelling false evidence distort autobiographical memory? Subjects were filmed observing and copying a Research Assistant performing simple actions, then they returned 2 days later for a memory test. Before taking the test, subjects viewed video-clips of simple actions, including actions that they neither observed nor performed earlier. We varied the format of the video-clips between-subjects to tap into the source-monitoring mechanisms responsible for the 'doctored-evidence effect.' The distribution of belief and memory distortions across conditions suggests that at least two mechanisms are involved: doctored images create an illusion of familiarity, and also enhance the perceived credibility of false suggestions. These findings offer insight into how external evidence influences source-monitoring. © 2009 Elsevier Inc. All rights reserved.
Resumo:
Transects of a Remotely Operated Vehicle (ROV) providing sea-bed videos and photographs were carried out during POLARSTERN expedition ANT-XIII/3 focussing on the ecology of benthic assemblages on the Antarctic shelf in the South-Eastern Weddell Sea. The ROV-system sprint 103 was equiped with two video- and one still camera, lights, flash-lights, compass, and parallel lasers providing a scale in the images, a tether-management system (TMS), a winch, and the board units. All cameras used the same main lense and could be tilted. Videos were recorded in Betacam-format and (film-)slides were made by decision of the scientific pilot. The latter were mainly made under the aspect to improve the identification of organisms depicted in the videos because the still photographs have a much higher optical resolution than the videos. In the photographs species larger than 3 mm, in the videos larger than 1 cm are recognisable and countable. Under optimum conditions the transects were strait; the speed and direction of the ROV were determined by the drift of the ship in the coastal current, since both, the ship and the ROV were used as a drifting system; the option to operate the vehicle actively was only used to avoide obstacles and to reach at best a distance of only approximately 30 cm to the sea-floor. As a consequence the width of the photographs in the foreground is approximately 50 cm. Deviations from this strategy resulted mainly from difficult ice- and weather conditions but also from high current velocity and local up-welling close to the sea-bed. The sea-bed images provide insights into the general composition of key species, higher systematic groups and ecological guilds. Within interdisciplinary approaches distributions of assemblages can be attributed to environmental conditions such as bathymetry, sediment characteristics, water masses and current regimes. The images also contain valuable information on how benthic species are associated to each other. Along the transects, small- to intermediate-scaled disturbances, e.g. by grounding icebergs were analysed and further impact to the entire benthic system by local succession of recolonisation was studied. This information can be used for models predicting the impact of climate change to benthic life in the Southern Ocean. All these approaches contribute to a better understanding of the fiunctioning of the benthic system and related components of the entire Antarctic marine ecosystem. Despite their scientific value the imaging methods meet concerns about the protection of sensitive Antarctic benthic systems since they are non-invasive and they also provide valuable material for education and outreach purposes.
Resumo:
Transects of a Remotely Operated Vehicle (ROV) providing sea-bed videos and photographs were carried out during POLARSTERN expedition ANT-XVII/3 focussing on the ecology of benthic assemblages on the Antarctic shelf in the South-Eastern Weddell Sea. The ROV-system sprint 103 was equiped with two video- and one still camera, lights, flash-lights, compass, and parallel lasers providing a scale in the images, a tether-management system (TMS), a winch, and the board units. All cameras used the same main lense and could be tilted. Videos were recorded in Betacam-format and (film-)slides were made by decision of the scientific pilot. The latter were mainly made under the aspect to improve the identification of organisms depicted in the videos because the still photographs have a much higher optical resolution than the videos. In the photographs species larger than 3 mm, in the videos larger than 1 cm are recognisable and countable. Under optimum conditions the transects were strait; the speed and direction of the ROV were determined by the drift of the ship in the coastal current, since both, the ship and the ROV were used as a drifting system; the option to operate the vehicle actively was only used to avoide obstacles and to reach at best a distance of only approximately 30 cm to the sea-floor. As a consequence the width of the photographs in the foreground is approximately 50 cm. Deviations from this strategy resulted mainly from difficult ice- and weather conditions but also from high current velocity and local up-welling close to the sea-bed. The sea-bed images provide insights into the general composition of key species, higher systematic groups and ecological guilds. Within interdisciplinary approaches distributions of assemblages can be attributed to environmental conditions such as bathymetry, sediment characteristics, water masses and current regimes. The images also contain valuable information on how benthic species are associated to each other. Along the transects, small- to intermediate-scaled disturbances, e.g. by grounding icebergs were analysed and further impact to the entire benthic system by local succession of recolonisation was studied. This information can be used for models predicting the impact of climate change to benthic life in the Southern Ocean. All these approaches contribute to a better understanding of the fiunctioning of the benthic system and related components of the entire Antarctic marine ecosystem. Despite their scientific value the imaging methods meet concerns about the protection of sensitive Antarctic benthic systems since they are non-invasive and they also provide valuable material for education and outreach purposes.
Resumo:
This dissertation presents a study and experimental research on asymmetric coding of stereoscopic video. A review on 3D technologies, video formats and coding is rst presented and then particular emphasis is given to asymmetric coding of 3D content and performance evaluation methods, based on subjective measures, of methods using asymmetric coding. The research objective was de ned to be an extension of the current concept of asymmetric coding for stereo video. To achieve this objective the rst step consists in de ning regions in the spatial dimension of auxiliary view with di erent perceptual relevance within the stereo pair, which are identi ed by a binary mask. Then these regions are encoded with better quality (lower quantisation) for the most relevant ones and worse quality (higher quantisation) for the those with lower perceptual relevance. The actual estimation of the relevance of a given region is based on a measure of disparity according to the absolute di erence between views. To allow encoding of a stereo sequence using this method, a reference H.264/MVC encoder (JM) has been modi ed to allow additional con guration parameters and inputs. The nal encoder is still standard compliant. In order to show the viability of the method subjective assessment tests were performed over a wide range of objective qualities of the auxiliary view. The results of these tests allow us to prove 3 main goals. First, it is shown that the proposed method can be more e cient than traditional asymmetric coding when encoding stereo video at higher qualities/rates. The method can also be used to extend the threshold at which uniform asymmetric coding methods start to have an impact on the subjective quality perceived by the observers. Finally the issue of eye dominance is addressed. Results from stereo still images displayed over a short period of time showed it has little or no impact on the proposed method.