927 resultados para 3D object recognition


Relevância:

80.00% 80.00%

Publicador:

Resumo:

A novel method that combines shape-based object recognition and image segmentation is proposed for shape retrieval from images. Given a shape prior represented in a multi-scale curvature form, the proposed method identifies the target objects in images by grouping oversegmented image regions. The problem is formulated in a unified probabilistic framework and solved by a stochastic Markov Chain Monte Carlo (MCMC) mechanism. By this means, object segmentation and recognition are accomplished simultaneously. Within each sampling move during the simulation process,probabilistic region grouping operations are influenced by both the image information and the shape similarity constraint. The latter constraint is measured by a partial shape matching process. A generalized parallel algorithm by Barbu and Zhu,combined with a large sampling jump and other implementation improvements, greatly speeds up the overall stochastic process. The proposed method supports the segmentation and recognition of multiple occluded objects in images. Experimental results are provided for both synthetic and real images.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

CONFIGR (CONtour FIgure GRound) is a computational model based on principles of biological vision that completes sparse and noisy image figures. Within an integrated vision/recognition system, CONFIGR posits an initial recognition stage which identifies figure pixels from spatially local input information. The resulting, and typically incomplete, figure is fed back to the “early vision” stage for long-range completion via filling-in. The reconstructed image is then re-presented to the recognition system for global functions such as object recognition. In the CONFIGR algorithm, the smallest independent image unit is the visible pixel, whose size defines a computational spatial scale. Once pixel size is fixed, the entire algorithm is fully determined, with no additional parameter choices. Multi-scale simulations illustrate the vision/recognition system. Open-source CONFIGR code is available online, but all examples can be derived analytically, and the design principles applied at each step are transparent. The model balances filling-in as figure against complementary filling-in as ground, which blocks spurious figure completions. Lobe computations occur on a subpixel spatial scale. Originally designed to fill-in missing contours in an incomplete image such as a dashed line, the same CONFIGR system connects and segments sparse dots, and unifies occluded objects from pieces locally identified as figure in the initial recognition stage. The model self-scales its completion distances, filling-in across gaps of any length, where unimpeded, while limiting connections among dense image-figure pixel groups that already have intrinsic form. Long-range image completion promises to play an important role in adaptive processors that reconstruct images from highly compressed video and still camera images.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

This paper introduces ART-EMAP, a neural architecture that uses spatial and temporal evidence accumulation to extend the capabilities of fuzzy ARTMAP. ART-EMAP combines supervised and unsupervised learning and a medium-term memory process to accomplish stable pattern category recognition in a noisy input environment. The ART-EMAP system features (i) distributed pattern registration at a view category field; (ii) a decision criterion for mapping between view and object categories which can delay categorization of ambiguous objects and trigger an evidence accumulation process when faced with a low confidence prediction; (iii) a process that accumulates evidence at a medium-term memory (MTM) field; and (iv) an unsupervised learning algorithm to fine-tune performance after a limited initial period of supervised network training. ART-EMAP dynamics are illustrated with a benchmark simulation example. Applications include 3-D object recognition from a series of ambiguous 2-D views.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

A neural theory is proposed in which visual search is accomplished by perceptual grouping and segregation, which occurs simultaneous across the visual field, and object recognition, which is restricted to a selected region of the field. The theory offers an alternative hypothesis to recently developed variations on Feature Integration Theory (Treisman, and Sato, 1991) and Guided Search Model (Wolfe, Cave, and Franzel, 1989). A neural architecture and search algorithm is specified that quantitatively explains a wide range of psychophysical search data (Wolfe, Cave, and Franzel, 1989; Cohen, and lvry, 1991; Mordkoff, Yantis, and Egeth, 1990; Treisman, and Sato, 1991).

Relevância:

80.00% 80.00%

Publicador:

Resumo:

A neural model is proposed of how laminar interactions in the visual cortex may learn and recognize object texture and form boundaries. The model brings together five interacting processes: region-based texture classification, contour-based boundary grouping, surface filling-in, spatial attention, and object attention. The model shows how form boundaries can determine regions in which surface filling-in occurs; how surface filling-in interacts with spatial attention to generate a form-fitting distribution of spatial attention, or attentional shroud; how the strongest shroud can inhibit weaker shrouds; and how the winning shroud regulates learning of texture categories, and thus the allocation of object attention. The model can discriminate abutted textures with blurred boundaries and is sensitive to texture boundary attributes like discontinuities in orientation and texture flow curvature as well as to relative orientations of texture elements. The model quantitatively fits a large set of human psychophysical data on orientation-based textures. Object boundar output of the model is compared to computer vision algorithms using a set of human segmented photographic images. The model classifies textures and suppresses noise using a multiple scale oriented filterbank and a distributed Adaptive Resonance Theory (dART) classifier. The matched signal between the bottom-up texture inputs and top-down learned texture categories is utilized by oriented competitive and cooperative grouping processes to generate texture boundaries that control surface filling-in and spatial attention. Topdown modulatory attentional feedback from boundary and surface representations to early filtering stages results in enhanced texture boundaries and more efficient learning of texture within attended surface regions. Surface-based attention also provides a self-supervising training signal for learning new textures. Importance of the surface-based attentional feedback in texture learning and classification is tested using a set of textured images from the Brodatz micro-texture album. Benchmark studies vary from 95.1% to 98.6% with attention, and from 90.6% to 93.2% without attention.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

A neural network model of synchronized oscillations in visual cortex is presented to account for recent neurophysiological findings that such synchronization may reflect global properties of the stimulus. In these experiments, synchronization of oscillatory firing responses to moving bar stimuli occurred not only for nearby neurons, but also occurred between neurons separated by several cortical columns (several mm of cortex) when these neurons shared some receptive field preferences specific to the stimuli. These results were obtained for single bar stimuli and also across two disconnected, but colinear, bars moving in the same direction. Our model and computer simulations obtain these synchrony results across both single and double bar stimuli using different, but formally related, models of preattentive visual boundary segmentation and attentive visual object recognition, as well as nearest-neighbor and randomly coupled models.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

A neural network model of synchronized oscillator activity in visual cortex is presented in order to account for recent neurophysiological findings that such synchronization may reflect global properties of the stimulus. In these recent experiments, it was reported that synchronization of oscillatory firing responses to moving bar stimuli occurred not only for nearby neurons, but also occurred between neurons separated by several cortical columns (several mm of cortex) when these neurons shared some receptive field preferences specific to the stimuli. These results were obtained not only for single bar stimuli but also across two disconnected, but colinear, bars moving in the same direction. Our model and computer simulations obtain these synchrony results across both single and double bar stimuli. For the double bar case, synchronous oscillations are induced in the region between the bars, but no oscillations are induced in the regions beyond the stimuli. These results were achieved with cellular units that exhibit limit cycle oscillations for a robust range of input values, but which approach an equilibrium state when undriven. Single and double bar synchronization of these oscillators was achieved by different, but formally related, models of preattentive visual boundary segmentation and attentive visual object recognition, as well as nearest-neighbor and randomly coupled models. In preattentive visual segmentation, synchronous oscillations may reflect the binding of local feature detectors into a globally coherent grouping. In object recognition, synchronous oscillations may occur during an attentive resonant state that triggers new learning. These modelling results support earlier theoretical predictions of synchronous visual cortical oscillations and demonstrate the robustness of the mechanisms capable of generating synchrony.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

A neural network theory of :3-D vision, called FACADE Theory, is described. The theory proposes a solution of the classical figure-ground problem for biological vision. It does so by suggesting how boundary representations and surface representations are formed within a Boundary Contour System (BCS) and a Feature Contour System (FCS). The BCS and FCS interact reciprocally to form 3-D boundary and surface representations that arc mutually consistent. Their interactions generate 3-D percepts wherein occluding and occluded object completed, and grouped. The theory clarifies how preattentive processes of 3-D perception and figure-ground separation interact reciprocally with attentive processes of spatial localization, object recognition, and visual search. A new theory of stereopsis is proposed that predicts how cells sensitive to multiple spatial frequencies, disparities, and orientations are combined by context-sensitive filtering, competition, and cooperation to form coherent BCS boundary segmentations. Several factors contribute to figure-ground pop-out, including: boundary contrast between spatially contiguous boundaries, whether due to scenic differences in luminance, color, spatial frequency, or disparity; partially ordered interactions from larger spatial scales and disparities to smaller scales and disparities; and surface filling-in restricted to regions surrounded by a connected boundary. Phenomena such as 3-D pop-out from a 2-D picture, DaVinci stereopsis, a 3-D neon color spreading, completion of partially occluded objects, and figure-ground reversals are analysed. The BCS and FCS sub-systems model aspects of how the two parvocellular cortical processing streams that join the Lateral Geniculate Nucleus to prestriate cortical area V4 interact to generate a multiplexed representation of Form-And-Color-And-Depth, or FACADE, within area V4. Area V4 is suggested to support figure-ground separation and to interact. with cortical mechanisms of spatial attention, attentive objcect learning, and visual search. Adaptive Resonance Theory (ART) mechanisms model aspects of how prestriate visual cortex interacts reciprocally with a visual object recognition system in inferotemporal cortex (IT) for purposes of attentive object learning and categorization. Object attention mechanisms of the What cortical processing stream through IT cortex are distinguished from spatial attention mechanisms of the Where cortical processing stream through parietal cortex. Parvocellular BCS and FCS signals interact with the model What stream. Parvocellular FCS and magnocellular Motion BCS signals interact with the model Where stream. Reciprocal interactions between these visual, What, and Where mechanisms arc used to discuss data about visual search and saccadic eye movements, including fast search of conjunctive targets, search of 3-D surfaces, selective search of like-colored targets, attentive tracking of multi-element groupings, and recursive search of simultaneously presented targets.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

We describe an active millimeter-wave holographic imaging system that uses compressive measurements for three-dimensional (3D) tomographic object estimation. Our system records a two-dimensional (2D) digitized Gabor hologram by translating a single pixel incoherent receiver. Two approaches for compressive measurement are undertaken: nonlinear inversion of a 2D Gabor hologram for 3D object estimation and nonlinear inversion of a randomly subsampled Gabor hologram for 3D object estimation. The object estimation algorithm minimizes a convex quadratic problem using total variation (TV) regularization for 3D object estimation. We compare object reconstructions using linear backpropagation and TV minimization, and we present simulated and experimental reconstructions from both compressive measurement strategies. In contrast with backpropagation, which estimates the 3D electromagnetic field, TV minimization estimates the 3D object that produces the field. Despite undersampling, range resolution is consistent with the extent of the 3D object band volume.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

A Concise Intro to Image Processing using C++ presents state-of-the-art image processing methodology, including current industrial practices for image compression, image de-noising methods based on partial differential equations, and new image compression methods such as fractal image compression and wavelet compression. It includes elementary concepts of image processing and related fundamental tools with coding examples as well as exercises. With a particular emphasis on illustrating fractal and wavelet compression algorithms, the text covers image segmentation, object recognition, and morphology. An accompanying CD-ROM contains code for all algorithms.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

A new high performance, programmable image processing chip targeted at video and HDTV applications is described. This was initially developed for image small object recognition but has much broader functional application including 1D and 2D FIR filtering as well as neural network computation. The core of the circuit is made up of an array of twenty one multiplication-accumulation cells based on systolic architecture. Devices can be cascaded to increase the order of the filter both vertically and horizontally. The chip has been fabricated in a 0.6 µ, low power CMOS technology and operates on 10 bit input data at over 54 Megasamples per second. The introduction gives some background to the chip design and highlights that there are few other comparable devices. Section 2 gives a brief introduction to small object detection. The chip architecture and the chip design will be described in detail in the later sections.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

β-amyloid1-42 (Aβ1-42) is a major endogenous pathogen underlying the aetiology of Alzheimer's disease (AD). Recent evidence indicates that soluble Aβ oligomers, rather than plaques, are the major cause of synaptic dysfunction and neurodegeneration. Small molecules that suppress Aβ aggregation, reduce oligomer stability or promote off-pathway non-toxic oligomerization represent a promising alternative strategy for neuroprotection in AD. MRZ-99030 was recently identified as a dipeptide that modulates Aβ1-42 aggregation by triggering a non-amyloidogenic aggregation pathway, thereby reducing the amount of intermediate toxic soluble oligomeric Aβ species. The present study evaluated the relevance of these promising results with MRZ-99030 under pathophysiological conditions i.e. against the synaptotoxic effects of Aβ oligomers on hippocampal long term potentiation (LTP) and two different memory tasks. Aβ1-42 interferes with the glutamatergic system and with neuronal Ca2+ signalling and abolishes the induction of LTP. Here we demonstrate that MRZ-99030 (100–500 nM) at a 10:1 stoichiometric excess to Aβ clearly reversed the synaptotoxic effects of Aβ1-42 oligomers on CA1-LTP in murine hippocampal slices. Co-application of MRZ-99030 also prevented the two-fold increase in resting Ca2+ levels in pyramidal neuron dendrites and spines triggered by Aβ1-42 oligomers. In anaesthetized rats, pre-administration of MRZ-99030 (50 mg/kg s.c.) protected against deficits in hippocampal LTP following i.c.v. injection of oligomeric Aβ1-42. Furthermore, similar treatment significantly ameliorated cognitive deficits in an object recognition task and under an alternating lever cyclic ratio schedule after the i.c.v. application of Aβ1-42 and 7PA2 conditioned medium, respectively. Altogether, these results demonstrate the potential therapeutic benefit of MRZ-99030 in AD.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Actualmente, a formação inicial de professores do 1º ciclo tem-se centrado na flexibilidade dos processos de trabalho, nas vertentes científica e técnica e no desenvolvimento de competências (Comissão Europeia, 2001), colocando-se ainda, no entanto, a tónica no conhecimento científico. O professor deve ser capaz de se adaptar aos diferentes contextos e funções a desempenhar e de resolver situações de grande imprevisibilidade e de grande indefinição. Será que a formação inicial de professores os prepara para um futuro próximo? Que futuro? “Tentarmos descrever o futuro, a partir do agora, significa que o que fizermos hoje será criticamente importante”, porque no futuro a formação inicial de professores será construída a partir do conhecimento básico, das ideias abstractas e das descobertas cientificas que fizermos hoje. “A base do modo como hoje, no século XXI, se formam professores está no que foi descoberto e legado nos anos 60, 70, 80, 90 do século XX”. Que fazemos hoje, agora mesmo, para contribuir para esse legado? Estamos convictos que muito de nada ou muito de pouco. O que alterar? Há quem pense que os professores do 1º ciclo não são analíticos. Talvez intuitivos, mas analíticos não. Ao aceitarmos esta dicotomia estamos a “atrapalhar” o futuro. Não somos apenas analíticos. Não somos apenas intuitivos. Na prática quotidiana, nas salas de aula, não usamos apenas as ferramentas diárias, usamos também a intuição e a análise. Uma análise baseada na teoria, enquanto manifestação do nosso esforço de expressar e partilhar, ou entender a nossa experiência, para influenciar o que nos é externo. Como “ensina” a formação inicial os futuros professores a trabalhar com os outros, para os outros? Todos temos um passado, um presente e um futuro em que nos formamos e que partilhamos uns com os outros, seja pela prática, seja pela teoria. Conceitos teóricos e práticos, como identidade, profissão, socialização profissional, práticas pedagógicas, formação inicial, instituição de formação, supervisão, relações pessoais e institucionais, representações sociais são conceitos construídos individual e socialmente, sempre em relação com os outros. Que percepção têm os professores cooperantes, detentores de uma turma de crianças do 1º ciclo, que “emprestam” aos futuros professores para desenvolverem a prática pedagógica da sua formação inicial, desta formação dada na instituição de formação? Foi o que pretendemos indagar com o presente trabalho. Do ponto de vista metodológico, o estudo foi desenvolvido segundo uma metodologia de natureza qualitativa, quantitativa e interpretativa que cruzou a informação recolhida através de diferentes instrumentos de recolha de dados, como as evocações livres e hierarquizadas, em contexto normal, e evocações hierarquizadas em contexto de substituição, um Teste de Reconhecimento do Objecto e um Questionário de Caracterização do Objecto. O tratamento dos dados foi feito com os programas SPSS, Excel, EVOC 2003 e SIMI. Participaram neste estudo 93 professores cooperantes. As conclusões mais genéricas apontam no sentido de confirmar os pressupostos adiantados no enquadramento teórico, quanto à hipótese da existência de um núcleo central e um sistema periférico, numa abordagem estruturalista das representações sociais, que parecem influenciar o modo como é percepcionada a formação inicial, pelos professores cooperantes.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Tese de dout., Engenharia Electrónica e de Computadores, Faculdade de Ciência e Tecnologia, Universidade do Algarve, 2007

Relevância:

80.00% 80.00%

Publicador:

Resumo:

The goal of the project "SmartVision: active vision for the blind" is to develop a small and portable but intelligent and reliable system for assisting the blind and visually impaired while navigating autonomously, both outdoor and indoor. In this paper we present an overview of the prototype, design issues, and its different modules which integrate a GIS with GPS, Wi-Fi, RFID tags and computer vision. The prototype addresses global navigation by following known landmarks, local navigation with path tracking and obstacle avoidance, and object recognition. The system does not replace the white cane, but extends it beyond its reach. The user-friendly interface consists of a 4-button hand-held box, a vibration actuator in the handle of the cane, and speech synthesis. A future version may also employ active RFID tags for marking navigation landmarks, and speech recognition may complement speech synthesis.