Biblioteca Digital

991 resultados para 280208 Computer Vision

Visual feedback control of a robot in an unknown environment (learning control using neural networks)

Relevância:

80.00% 80.00%

Publicador:

Resumo:

In this paper, a visual feedback control approach based on neural networks is presented for a robot with a camera installed on its end-effector to trace an object in an unknown environment. First, the one-to-one mapping relations between the image feature domain of the object to the joint angle domain of the robot are derived. Second, a method is proposed to generate a desired trajectory of the robot by measuring the image feature parameters of the object. Third, a multilayer neural network is used for off-line learning of the mapping relations so as to produce on-line the reference inputs for the robot. Fourth, a learning controller based on a multilayer neural network is designed for realizing the visual feedback control of the robot. Last, the effectiveness of the present approach is verified by tracing a curved line using a 6-degrees-of-freedom robot with a CCD camera installed on its end-effector. The present approach does not necessitate the tedious calibration of the CCD camera and the complicated coordinate transformations.

Automatic age estimation based on facial aging patterns

Relevância:

80.00% 80.00%

Publicador:

Resumo:

While recognition of most facial variations, such as identity, expression, and gender, has been extensively studied, automatic age estimation has rarely been explored. In contrast to other facial variations, aging variation presents several unique characteristics which make age estimation a challenging task. This paper proposes an automatic age estimation method named AGES (AGing pattErn Subspace). The basic idea is to model the aging pattern, which is defined as the sequence of a particular individual's face images sorted in time order, by constructing a representative subspace. The proper aging pattern for a previously unseen face image is determined by the projection in the subspace that can reconstruct the face image with minimum reconstruction error, while the position of the face image in that aging pattern will then indicate its age. In the experiments, AGES and its variants are compared with the limited existing age estimation methods (WAS and AAS) and some well-established classification methods (kNN, BP, C4.5, and SVM). Moreover, a comparison with human perception ability on age is conducted. It is interesting to note that the performance of AGES is not only significantly better than that of all the other algorithms, but also comparable to that of the human observers.

Simultaneous estimation of optical flow and object state: A modified approach to optical flow calculation

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Differential optical flow methods are widely used within the computer vision community. They are classified as being either local, as in the Lucas-Kanade method, or global, as in the Horn-Schunck technique. As the physical dynamics of an object is inherently coupled into the behavior of its image in the video stream, in this paper, we use such dynamic parameter information in calculating optical flow when tracking a moving object using a video stream. Indeed, we use a modified error function in the minimization that contains physical parameter information. Further, the refined estimates of optical flow is used for better estimation of the physical parameters of the object in the simultaneous estimation of optical flow and object state(SEOS).

Distance-driven fusion of gait and face for human identification in video

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Gait and face are two important biometrics for human identification. Complementary properties of these two biometrics suggest fusion of them. The relationship between gait and face in the fusion is affected by the subject-to-camera distance. On the one hand, gait is a suitable biometric trait for human recognition at a distance. On the other hand, face recognition is more reliable when the subject is close to the camera. This paper proposes an adaptive fusion method called distance-driven fusion to combine gait and face for human identification in video. Rather than predefined fixed fusion rules, distance-driven fusion dynamically adjusts its rule according to the subject-to-camera distance in real time. Experimental results show that distance-driven fusion performs better than not only single biometric, but also the conventional
static fusion rules including MEAN, PRODUCT, MIN, and MAX.

Individual discriminative subspace for face recognition under uncontrolled conditions

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Most face recognition (FR) algorithms require the face images to satisfy certain restrictions in various aspects like view angle, illumination, occlusion, etc. But what is needed in general is the techniques that can recognize any face images recognizable by human beings. This paper provides one potential solution to this problem. A method named Individual Discriminative Subspace (IDS) is proposed for robust face recognition under uncontrolled conditions. IDS is the subspace where only the images from one particular person converge around the origin while those from others scatter. Each IDS can be used to distinguish one individual from others. There is no restriction on the face images fed into the algorithm, which makes it practical for real-life applications. In the experiments, IDS is tested on two large face databases with extensive variations and performs significantly better than 12 existing FR techniques.

On the simultaneous estimation of optical flow and object state

Relevância:

80.00% 80.00%

Publicador:

Resumo:

This thesis focuses on the enhancement of differential optical flow techniques. The framwork of differential optical flow has been built upon to improve object motion estimation within a video stream or image sequence. This augmentation comes in the form of a combined optical flow and object state estimation method (SEOS)

Data-driven indexing mechanism for the recognition of polyhedral objects

Relevância:

80.00% 80.00%

Publicador:

Wavelets/multiwavelets bases and correspondence estimation problem : an analytic study

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Correspondence estimation in one of the most active research areas in the field of computer vision and number of techniques has been proposed, possessing both advantages and shortcomings. Among the techniques reported, multiresolution analysis based stereo correspondence estimation has gained lot of research focus in recent years. Although, the most widely employed medium for multiresolution analysis is wavelets and multiwavelets bases, however, relatively little work has been reported in this context. In this work we have tried to address some of the issues regarding the work done in this domain and the inherited shortcomings. In the light of these shortcomings, we propose a new technique to overcome some of the flaws that could have significantly impact on the algorithm performance and has not been addressed in the earlier propositions. Proposed algorithm uses multiresolution analysis enforced with wavelets/multiwavelts transform modulus maxima to establish correspondences between the stereo pair of images. Variety of wavelets and multiwavelets bases, possessing distinct properties such as orthogonality, approximation order, short support and shape are employed to analyse their effect on the performance of correspondence estimation. The idea is to provide knowledge base to understand and establish relationships between wavelets and multiwavelets properties and their effect on the quality of stereo correspondence estimation.

Human action recognition based on 3D SIFT and LDA model

Relevância:

80.00% 80.00%

Publicador:

Resumo:

How to recognize human action from videos captured by modern cameras efficiently and effectively is a challenge in real applications. Traditional methods which need professional analysts are facing a bottleneck because of their shortcomings. To cope with the disadvantage, methods based on computer vision techniques, without or with only a few human interventions, have been proposed to analyse human actions in videos automatically. This paper provides a method combining the three dimensional Scale Invariant Feature Transform (SIFT) detector and the Latent Dirichlet Allocation (LDA) model for human motion analysis. To represent videos effectively and robustly, we extract the 3D SIFT descriptor around each interest point, which is sampled densely from 3D Space-time video volumes. After obtaining the representation of each video frame, the LDA model is adopted to discover the underlying structure-the categorization of human actions in the collection of videos. Public available standard datasets are used to test our method. The concluding part discusses the research challenges and future directions.

Human action recognition based on pyramid histogram of oriented gradients

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Human action recognition has been attracted lots of interest from computer vision researchers due to its various promising applications. In this paper, we employ Pyramid Histogram of Orientation Gradient (PHOG) to characterize human figures for action recognition. Comparing to silhouette-based features, the PHOG descriptor does not require extraction of human silhouettes or contours. Two state-space models, i.e.; Hidden Markov Model (HMM) and Conditional Random Field (CRF), are adopted to model the dynamic human movement. The proposed PHOG descriptor and the state-space models with respect to different parameters are tested using a standard dataset. We also testify the robustness of the method with respect to various unconstrained conditions and viewpoints. Promising experimental result demonstrates the effectiveness and robustness of our proposed method.

Polar transformation system for offline handwritten character recognition

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Offline handwritten recognition is an important automated process in pattern recognition and computer vision field. This paper presents an approach of polar coordinate-based handwritten recognition system involving Support Vector Machines (SVM) classification methodology to achieve high recognition performance. We provide comparison and evaluation for zoning feature extraction methods applied in Polar system. The recognition results we proposed were trained and tested by using SVM with a set of 650 handwritten character images. All the input images are segmented (isolated) handwritten characters. Compared with Cartesian based handwritten recognition system, the recognition rate is more stable and improved up to 86.63%.

An investigation into the use of physical modeling for the prediction of various feature types visible from different viewpoints

Relevância:

80.00% 80.00%

Publicador:

Combining multiple tracking algorithms for improved general performance

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Automated tracking of objects through a sequence of images has remained one of the difficult problems in computer vision. Numerous algorithms and techniques have been proposed for this task. Some algorithms perform well in restricted environments, such as tracking using stationary cameras, but a general solution is not currently available. A frequent problem is that when an algorithm is refined for one application, it becomes unsuitable for other applications. This paper proposes a general tracking system based on a different approach. Rather than refine one algorithm for a specific tracking task, two tracking algorithms are employed, and used to correct each other during the tracking task. By choosing the two algorithms such that they have complementary failure modes, a robust algorithm is created without increased specialisation.

Modeling edges at subpixel accuracy using the local energy approach

Relevância:

80.00% 80.00%

Publicador:

Resumo:

In this paper we described new technique for 1-D and 2-D edge feature extraction to subpixel accuracy using edge models and the local energy approach. A candidate edge is modeled as one of a number of parametric edge models, and the fit is refined by a least-squared error fitting technique.

Low-cost interactive active monocular range finder

Relevância:

80.00% 80.00%

Publicador:

Resumo:

This paper describes a low-cost interactive active monocular range finder and illustrates the effect of introducing interactivity to the range acquisition process. The range finder consists of only one camera and a laser pointer, to which three LEDs are attached. When a user scans the laser along surfaces of objects, the camera captures the image of spots (one from the laser, and the others from LEDs), and triangulation is carried out using the camera's viewing direction and the optical axis of the laser. The user interaction allows the range finder to acquire range data in which the sampling rate varies across the object depending on the underlying surface structures. Moreover, the processes of separating objects from the background and/or finding parts in the object can be achieved using the operator's knowledge of the objects.

«
1
2
...
59
60
61
62
63
64
65
66
67
»