7 resultados para Word and object behaviorism
em Indian Institute of Science - Bangalore - Índia
Resumo:
This paper describes the efforts at MILE lab, IISc, to create a 100,000-word database each in Kannada and Tamil for the design and development of Online Handwritten Recognition. It has been collected from over 600 users in order to capture the variations in writing style. We describe features of the scripts and how the number of symbols were reduced to be able to effectively train the data for recognition. The list of words include all the characters, Kannada and Indo-Arabic numerals, punctuations and other symbols. A semi-automated tool for the annotation of data from stroke to word level is used. It segments each word into stroke groups and also acts as a validation mechanism for segmentation. The tool displays the stroke, stroke groups and aksharas of a word and hence can be used to study the various styles of writing, delayed strokes and for assigning quality tags to the words. The tool is currently being used for annotating Tamil and Kannada data. The output is stored in a standard XML format.
Resumo:
We consider the problem of extracting a signature representation of similar entities employing covariance descriptors. Covariance descriptors can efficiently represent objects and are robust to scale and pose changes. We posit that covariance descriptors corresponding to similar objects share a common geometrical structure which can be extracted through joint diagonalization. We term this diagonalizing matrix as the Covariance Profile (CP). CP can be used to measure the distance of a novel object to an object set through the diagonality measure. We demonstrate how CP can be employed on images as well as for videos, for applications such as face recognition and object-track clustering.
Resumo:
Image and video analysis requires rich features that can characterize various aspects of visual information. These rich features are typically extracted from the pixel values of the images and videos, which require huge amount of computation and seldom useful for real-time analysis. On the contrary, the compressed domain analysis offers relevant information pertaining to the visual content in the form of transform coefficients, motion vectors, quantization steps, coded block patterns with minimal computational burden. The quantum of work done in compressed domain is relatively much less compared to pixel domain. This paper aims to survey various video analysis efforts published during the last decade across the spectrum of video compression standards. In this survey, we have included only the analysis part, excluding the processing aspect of compressed domain. This analysis spans through various computer vision applications such as moving object segmentation, human action recognition, indexing, retrieval, face detection, video classification and object tracking in compressed videos.
Resumo:
During lightning strike to a tall grounded object (TGO), reflections of current waves are known to occur at either ends of the TGO. These reflection modify the channel current and hence, the lightning electromagnetic fields. This study aims to identify the possible contributing factors to reflection at a TGO-channel junction for the current waves ascending on the TGO. Possible sources of reflection identified are corona sheath and discontinuity of resistance and radius. For analyzing the contribution of corona sheath and discontinuity of resistance at the junction, a macroscopic physical model for the return stroke developed in our earlier work is employed. NEC-2D is used for assessing the contribution of abrupt change in radii at a TGO-channel junction. The wire-cage model adopted for the same is validated using laboratory experiments. Detailed investigation revealed the following. The main contributor for reflection at a TGO-channel junction is the difference between TGO and channel core radii. Also, the discontinuity of resistance at a TGO-channel junction can be of some relevance only for the first microsecond regime. Further, corona sheath does not play any significant role in the reflection.
Resumo:
We consider the rotational motion of an elongated nanoscale object in a fluid under an external torque. The experimentally observed dynamics could be understood from analytical solutions of the Stokes equation, with explicit formulae derived for the dynamical states as a function of the object dimensions and the parameters defining the external torque. Under certain conditions, multiple analytical solutions to the Stokes equations exist, which have been investigated through numerical analysis of their stability against small perturbations and their sensitivity towards initial conditions. These experimental results and analytical formulae are general enough to be applicable to the rotational motion of any isolated elongated object at low Reynolds numbers, and could be useful in the design of non-spherical nanostructures for diverse applications pertaining to microfluidics and nanoscale propulsion technologies.
Resumo:
In this paper, we report a breakthrough result on the difficult task of segmentation and recognition of coloured text from the word image dataset of ICDAR robust reading competition challenge 2: reading text in scene images. We split the word image into individual colour, gray and lightness planes and enhance the contrast of each of these planes independently by a power-law transform. The discrimination factor of each plane is computed as the maximum between-class variance used in Otsu thresholding. The plane that has maximum discrimination factor is selected for segmentation. The trial version of Omnipage OCR is then used on the binarized words for recognition. Our recognition results on ICDAR 2011 and ICDAR 2003 word datasets are compared with those reported in the literature. As baseline, the images binarized by simple global and local thresholding techniques were also recognized. The word recognition rate obtained by our non-linear enhancement and selection of plance method is 72.8% and 66.2% for ICDAR 2011 and 2003 word datasets, respectively. We have created ground-truth for each image at the pixel level to benchmark these datasets using a toolkit developed by us. The recognition rate of benchmarked images is 86.7% and 83.9% for ICDAR 2011 and 2003 datasets, respectively.
Resumo:
Using coherent light interrogating a turbid object perturbed by a focused ultrasound (US) beam, we demonstrate localized measurement of dynamics in the focal region, termed the region-of-interest (ROI), from the decay of the modulation in intensity autocorrelation of light. When the ROI contains a pipe flow, the decay is shown to be sensitive to the average flow velocity from which the mean-squared displacement (MSD) of the scattering centers in the flow can be estimated. While the MSD estimated is seen to be an order of magnitude higher than that obtainable through the usual diffusing wave spectroscopy (DWS) without the US, it is seen to be more accurate as verified by the volume flow estimated from it. It is further observed that, whereas the MSD from the localized measurement grows with time as tau(alpha) with alpha approximate to 1.65, without using the US, a is seen to be much less. Moreover, with the local measurement, this super-diffusive nature of the pipe flow is seen to persist longer, i.e., over a wider range of initial tau, than with the unassisted DWS. The reason for the super-diffusivity of flow, i.e., alpha < 2, in the ROI is the presence of a fluctuating (thermodynamically nonequilibrium) component in the dynamics induced by the US forcing. Beyond this initial range, both methods measure MSDs that rise linearly with time, indicating that ballistic and near-ballistic photons hardly capture anything beyond the background Brownian motion. (C) 2015 Optical Society of America