Biblioteca Digital

289 resultados para Awards, Recognition

Truck-face recognition using Semantic Texton Forests

Relevância:

20.00% 20.00%

Publicador:

Veja mais

Improving Pothole Recognition through Vision Tracking for Automated Pavement Assessment

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Pavement condition assessment is essential when developing road network maintenance programs. In practice, pavement sensing is to a large extent automated when regarding highway networks. Municipal roads, however, are predominantly surveyed manually due to the limited amount of expensive inspection vehicles. As part of a research project that proposes an omnipresent passenger vehicle network for comprehensive and cheap condition surveying of municipal road networks this paper deals with pothole recognition. Existing methods either rely on expensive and high-maintenance range sensors, or make use of acceleration data, which can only provide preliminary and rough condition surveys. In our previous work we created a pothole detection method for pavement images. In this paper we present an improved recognition method for pavement videos that incrementally updates the texture signature for intact pavement regions and uses vision tracking to track detected potholes. The method is tested and results demonstrate its reasonable efficiency.

Veja mais

Pothole Properties Measurement through Visual 2D Recognition and 3D Reconstruction

Relevância:

20.00% 20.00%

Publicador:

Veja mais

Computer Vision and Pattern Recognition Technologies for Construction

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This book will be of particular interest to academics, researchers, and graduate students at universities and industrial practitioners seeking to apply mobile and pervasive computing systems to improve construction industry productivity.

Veja mais

Concrete Columns Recognition for Real Time Concrete Damage Visual Assessment

Relevância:

20.00% 20.00%

Publicador:

Veja mais

Machine Vision-Based Concrete Column Recognition and Crack Properties Retrieval

Relevância:

20.00% 20.00%

Publicador:

Veja mais

Distinguishing Object Category Properties and Property Ranges in the IFC Standard for Visual Pattern Recognition

Relevância:

20.00% 20.00%

Publicador:

Veja mais

Speaker and Noise Factorization for Robust Speech Recognition

Relevância:

20.00% 20.00%

Publicador:

Veja mais

Achieving robust face recognition from video by combining a weak photometric model and a learnt generic face invariant

Relevância:

20.00% 20.00%

Publicador:

Resumo:

In spite of over two decades of intense research, illumination and pose invariance remain prohibitively challenging aspects of face recognition for most practical applications. The objective of this work is to recognize faces using video sequences both for training and recognition input, in a realistic, unconstrained setup in which lighting, pose and user motion pattern have a wide variability and face images are of low resolution. The central contribution is an illumination invariant, which we show to be suitable for recognition from video of loosely constrained head motion. In particular there are three contributions: (i) we show how a photometric model of image formation can be combined with a statistical model of generic face appearance variation to exploit the proposed invariant and generalize in the presence of extreme illumination changes; (ii) we introduce a video sequence re-illumination algorithm to achieve fine alignment of two video sequences; and (iii) we use the smoothness of geodesically local appearance manifold structure and a robust same-identity likelihood to achieve robustness to unseen head poses. We describe a fully automatic recognition system based on the proposed method and an extensive evaluation on 323 individuals and 1474 video sequences with extreme illumination, pose and head motion variation. Our system consistently achieved a nearly perfect recognition rate (over 99.7% on all four databases). © 2012 Elsevier Ltd All rights reserved.

Veja mais

Scale-invariant vote-based 3D recognition and registration from point clouds

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This chapter presents a method for vote-based 3D shape recognition and registration, in particular using mean shift on 3D pose votes in the space of direct similarity transformations for the first time. We introduce a new distance between poses in this spacethe SRT distance. It is left-invariant, unlike Euclidean distance, and has a unique, closed-form mean, in contrast to Riemannian distance, so is fast to compute. We demonstrate improved performance over the state of the art in both recognition and registration on a (real and) challenging dataset, by comparing our distance with others in a mean shift framework, as well as with the commonly used Hough voting approach. © 2013 Springer-Verlag Berlin Heidelberg.

Veja mais

Syllable language models for Mandarin speech recognition: exploiting character language models.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Mandarin Chinese is based on characters which are syllabic in nature and morphological in meaning. All spoken languages have syllabiotactic rules which govern the construction of syllables and their allowed sequences. These constraints are not as restrictive as those learned from word sequences, but they can provide additional useful linguistic information. Hence, it is possible to improve speech recognition performance by appropriately combining these two types of constraints. For the Chinese language considered in this paper, character level language models (LMs) can be used as a first level approximation to allowed syllable sequences. To test this idea, word and character level n-gram LMs were trained on 2.8 billion words (equivalent to 4.3 billion characters) of texts from a wide collection of text sources. Both hypothesis and model based combination techniques were investigated to combine word and character level LMs. Significant character error rate reductions up to 7.3% relative were obtained on a state-of-the-art Mandarin Chinese broadcast audio recognition task using an adapted history dependent multi-level LM that performs a log-linearly combination of character and word level LMs. This supports the hypothesis that character or syllable sequence models are useful for improving Mandarin speech recognition performance.

Veja mais

Structured SVMs for Automatic Speech Recognition

Relevância:

20.00% 20.00%

Publicador:

Veja mais

Automatic Selection of Recognition Errors by Respeaking the Intended Text

Relevância:

20.00% 20.00%

Publicador:

Veja mais

Structured Support Vector Machines for Noise Robust Continuous Speech Recognition

Relevância:

20.00% 20.00%

Publicador:

Veja mais

User target intention recognition from cursor position using kalman filter

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This paper discusses user target intention recognition algorithms for pointing - clicking tasks to reduce users' pointing time and difficulty. Predicting targets by comparing the bearing angles to targets proposed as one of the first algorithms [1] is compared with a Kalman Filter prediction algorithm. Accuracy and sensitivity of prediction are used as performance criteria. The outcomes of a standard point and click experiment are used for performance comparison, collected from both able-bodied and impaired users. © 2013 Springer-Verlag Berlin Heidelberg.

Veja mais

289 resultados para Awards, Recognition

Filtro por publicador