129 resultados para Face recognition makeup riconoscimento volto immagini trucco alterazione
Resumo:
This paper discusses a novel high-speed approach for human action recognition in H.264/AVC compressed domain. The proposed algorithm utilizes cues from quantization parameters and motion vectors extracted from the compressed video sequence for feature extraction and further classification using Support Vector Machines (SVM). The ultimate goal of the proposed work is to portray a much faster algorithm than pixel domain counterparts, with comparable accuracy, utilizing only the sparse information from compressed video. Partial decoding rules out the complexity of full decoding, and minimizes computational load and memory usage, which can result in reduced hardware utilization and faster recognition results. The proposed approach can handle illumination changes, scale, and appearance variations, and is robust to outdoor as well as indoor testing scenarios. We have evaluated the performance of the proposed method on two benchmark action datasets and achieved more than 85 % accuracy. The proposed algorithm classifies actions with speed (> 2,000 fps) approximately 100 times faster than existing state-of-the-art pixel-domain algorithms.
Resumo:
Fringe tracking and fringe order assignment have become the central topics of current research in digital photoelasticity. Isotropic points (IPs) appearing in low fringe order zones are often either overlooked or entirely missed in conventional as well as digital photoelasticity. We aim to highlight image processing for characterizing IPs in an isochromatic fringe field. By resorting to a global analytical solution of a circular disk, sensitivity of IPs to small changes in far-field loading on the disk is highlighted. A local theory supplements the global closed-form solutions of three-, four-, and six-point loading configurations of circular disk. The local theoretical concepts developed in this paper are demonstrated through digital image analysis of isochromatics in circular disks subjected to three-and four-point loads. (C) 2015 Society of Photo-Optical Instrumentation Engineers (SPIE)
Resumo:
With the increasing availability of wearable cameras, research on first-person view videos (egocentric videos) has received much attention recently. While some effort has been devoted to collecting various egocentric video datasets, there has not been a focused effort in assembling one that could capture the diversity and complexity of activities related to life-logging, which is expected to be an important application for egocentric videos. In this work, we first conduct a comprehensive survey of existing egocentric video datasets. We observe that existing datasets do not emphasize activities relevant to the life-logging scenario. We build an egocentric video dataset dubbed LENA (Life-logging EgoceNtric Activities) (http://people.sutd.edu.sg/similar to 1000892/dataset) which includes egocentric videos of 13 fine-grained activity categories, recorded under diverse situations and environments using the Google Glass. Activities in LENA can also be grouped into 5 top-level categories to meet various needs and multiple demands for activities analysis research. We evaluate state-of-the-art activity recognition using LENA in detail and also analyze the performance of popular descriptors in egocentric activity recognition.
Resumo:
Selective and discriminative detection of -NO2 containing high energy organic compounds such as picric acid (PA), 2,4,6-trinitrotoluene (TNT) and dinitrotoluene (DNT) has become a challenging task due to concerns over national security, criminal investigations and environment protections. Among various known detection methods, fluorescence techniques have gained special attention in recent time. A wide variety of fluorescent chemosensors have been developed for nitroaromatic explosive detection. In this review article, we provide an overview of the recent developments made in small molecule-based turn-off fluorescent sensors for nitroaromatic explosives with special focus on organic and H-bonded supramolecular sensors. The fluorescent sensors discussed in this review are classified and organized according to their functionality and their recognition of nitroaromatics by fluorescence quenching.
Resumo:
In optical character recognition of very old books, the recognition accuracy drops mainly due to the merging or breaking of characters. In this paper, we propose the first algorithm to segment merged Kannada characters by using a hypothesis to select the positions to be cut. This method searches for the best possible positions to segment, by taking into account the support vector machine classifier's recognition score and the validity of the aspect ratio (width to height ratio) of the segments between every pair of cut positions. The hypothesis to select the cut position is based on the fact that a concave surface exists above and below the touching portion. These concave surfaces are noted down by tracing the valleys in the top contour of the image and similarly doing it for the image rotated upside-down. The cut positions are then derived as closely matching valleys of the original and the rotated images. Our proposed segmentation algorithm works well for different font styles, shapes and sizes better than the existing vertical projection profile based segmentation. The proposed algorithm has been tested on 1125 different word images, each containing multiple merged characters, from an old Kannada book and 89.6% correct segmentation is achieved and the character recognition accuracy of merged words is 91.2%. A few points of merge are still missed due to the absence of a matched valley due to the specific shapes of the particular characters meeting at the merges.
Resumo:
In this work, we describe a system, which recognises open vocabulary, isolated, online handwritten Tamil words and extend it to recognize a paragraph of writing. We explain in detail each step involved in the process: segmentation, preprocessing, feature extraction, classification and bigram-based post-processing. On our database of 45,000 handwritten words obtained through tablet PC, we have obtained symbol level accuracy of 78.5% and 85.3% without and with the usage of post-processing using symbol level language models, respectively. Word level accuracies for the same are 40.1% and 59.6%. A line and word level segmentation strategy is proposed, which gives promising results of 100% line segmentation and 98.1% word segmentation accuracies on our initial trials of 40 handwritten paragraphs. The two modules have been combined to obtain a full-fledged page recognition system for online handwritten Tamil data. To the knowledge of the authors, this is the first ever attempt on recognition of open vocabulary, online handwritten paragraphs in any Indian language.
Resumo:
The relative energies of triangular face sharing condensed macro polyhedral carboranes: CB20H18 and C2B19H18+ derived from mono- and di-substitution of carbons in (4) B21H18- is calculated at B3LYP/6-31G* level. The relative energies, H center dot center dot center dot H non-bonding distances, NICS values, topological charge analysis and orbital overlap compatibility connotes the face sharing condensed macro polyhedral mono-carboranes, 8 (4-CB20H18) to be the lowest energy isomer. The di-carba- derivative, (36) 4,4'a-C2B19H18+ with carbons substituted in a different B-12 cage in (4) B21H18- in anti-fashion is the most stable isomer among 28 possibilities. This structure has less non-bonding H center dot center dot center dot H interaction and is in agreement with orbital-overlap compatibility, and these two have the pivotal role in deciding the stability of these clusters. An estimate of the inherent stability of these carboranes is made using near-isodesmic equations which show that CB20H18 (8) is in the realm of the possible. (C) 2015 Elsevier B.V. All rights reserved.
Resumo:
A new triphenylamine-based organometallic Pt-II luminogen (1) and its analogous organic compound (2) are reported. The molecules are decorated with aldehyde functionality to improve their photophysical properties by utilising donor-acceptor interactions. The single crystal X-ray structure analysis of PtII analogue 1 revealed that the neighbouring molecules were loosely organised by weak intermolecular C-H center dot center dot center dot pi interactions. Because of the twisted nature of the triphenylamine backbone the compounds showed aggregation-induced emission enhancement in THF/water mixture. Due to their loose crystal packing, upon application of external stimuli these luminogens exhibited mechano-fluorochromic behaviour. The crystalline forms of the compounds displayed a more superior emission efficiency than the grinded samples. Moreover, the compounds showed crystallization-induced emission enhancement (CIEE) and exhibited chemodosimetric response towards cysteine under physiological condition.
Resumo:
Image and video analysis requires rich features that can characterize various aspects of visual information. These rich features are typically extracted from the pixel values of the images and videos, which require huge amount of computation and seldom useful for real-time analysis. On the contrary, the compressed domain analysis offers relevant information pertaining to the visual content in the form of transform coefficients, motion vectors, quantization steps, coded block patterns with minimal computational burden. The quantum of work done in compressed domain is relatively much less compared to pixel domain. This paper aims to survey various video analysis efforts published during the last decade across the spectrum of video compression standards. In this survey, we have included only the analysis part, excluding the processing aspect of compressed domain. This analysis spans through various computer vision applications such as moving object segmentation, human action recognition, indexing, retrieval, face detection, video classification and object tracking in compressed videos.