Biblioteca Digital

893 resultados para Word Category Violations

Power-law transformation for enhanced recognition of born-digital word images

Relevância:

20.00% 20.00%

Publicador:

Resumo:

In this paper, we discuss the issues related to word recognition in born-digital word images. We introduce a novel method of power-law transformation on the word image for binarization. We show the improvement in image binarization and the consequent increase in the recognition performance of OCR engine on the word image. The optimal value of gamma for a word image is automatically chosen by our algorithm with fixed stroke width threshold. We have exhaustively experimented our algorithm by varying the gamma and stroke width threshold value. By varying the gamma value, we found that our algorithm performed better than the results reported in the literature. On the ICDAR Robust Reading Systems Challenge-1: Word Recognition Task on born digital dataset, as compared to the recognition rate of 61.5% achieved by TH-OCR after suitable pre-processing by Yang et. al. and 63.4% by ABBYY Fine Reader (used as baseline by the competition organizers without any preprocessing), we achieved 82.9% using Omnipage OCR applied on the images after being processed by our algorithm.

Language models for online handwritten Tamil word recognition

Relevância:

20.00% 20.00%

Publicador:

Resumo:

N-gram language models and lexicon-based word-recognition are popular methods in the literature to improve recognition accuracies of online and offline handwritten data. However, there are very few works that deal with application of these techniques on online Tamil handwritten data. In this paper, we explore methods of developing symbol-level language models and a lexicon from a large Tamil text corpus and their application to improving symbol and word recognition accuracies. On a test database of around 2000 words, we find that bigram language models improve symbol (3%) and word recognition (8%) accuracies and while lexicon methods offer much greater improvements (30%) in terms of word recognition, there is a large dependency on choosing the right lexicon. For comparison to lexicon and language model based methods, we have also explored re-evaluation techniques which involve the use of expert classifiers to improve symbol and word recognition accuracies.

Benchmarking recognition results on camera captured word image data sets

Relevância:

20.00% 20.00%

Publicador:

Resumo:

We have benchmarked the maximum obtainable recognition accuracy on five publicly available standard word image data sets using semi-automated segmentation and a commercial OCR. These images have been cropped from camera captured scene images, born digital images (BDI) and street view images. Using the Matlab based tool developed by us, we have annotated at the pixel level more than 3600 word images from the five data sets. The word images binarized by the tool, as well as by our own midline analysis and propagation of segmentation (MAPS) algorithm are recognized using the trial version of Nuance Omnipage OCR and these two results are compared with the best reported in the literature. The benchmark word recognition rates obtained on ICDAR 2003, Sign evaluation, Street view, Born-digital and ICDAR 2011 data sets are 83.9%, 89.3%, 79.6%, 88.5% and 86.7%, respectively. The results obtained from MAPS binarized word images without the use of any lexicon are 64.5% and 71.7% for ICDAR 2003 and 2011 respectively, and these values are higher than the best reported values in the literature of 61.1% and 41.2%, respectively. MAPS results of 82.8% for BDI 2011 dataset matches the performance of the state of the art method based on power law transform.

NESP: Nonlinear enhancement and selection of plane for optimal segmentation and recognition of scene word images

Relevância:

20.00% 20.00%

Publicador:

Resumo:

In this paper, we report a breakthrough result on the difficult task of segmentation and recognition of coloured text from the word image dataset of ICDAR robust reading competition challenge 2: reading text in scene images. We split the word image into individual colour, gray and lightness planes and enhance the contrast of each of these planes independently by a power-law transform. The discrimination factor of each plane is computed as the maximum between-class variance used in Otsu thresholding. The plane that has maximum discrimination factor is selected for segmentation. The trial version of Omnipage OCR is then used on the binarized words for recognition. Our recognition results on ICDAR 2011 and ICDAR 2003 word datasets are compared with those reported in the literature. As baseline, the images binarized by simple global and local thresholding techniques were also recognized. The word recognition rate obtained by our non-linear enhancement and selection of plance method is 72.8% and 66.2% for ICDAR 2011 and 2003 word datasets, respectively. We have created ground-truth for each image at the pixel level to benchmark these datasets using a toolkit developed by us. The recognition rate of benchmarked images is 86.7% and 83.9% for ICDAR 2011 and 2003 datasets, respectively.

Addressing insecurities and violations of privacy

Relevância:

20.00% 20.00%

Publicador:

HMM word and phrase alignment for statistical machine translation

Relevância:

20.00% 20.00%

Publicador:

Modeling Environmental Stress

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The word stress when applied to ecosystems is ambiguous. Stress may be low-level, with accompanying near-linear strain, or it may be of finite magnitude, with nonlinear response and possible disintegration of the system. Since there are practically no widely accepted definitions of ecosystem strain, classification of models of stressed systems is tenuous. Despite appearances, most ecosystem models seem to fall into the low-level linear response category. Although they sometimes simulate systems behavior well, they do not provide necessary and sufficient information about sudden structural changes nor structure after transition. Dynamic models of finiteamplitude response to stress are rare because of analytical difficulties. Some idea as to future transition states can be obtained by regarding the behavior of unperturbed functions under limiting strain conditions. Preliminary work shows that, since community variables do respond in a coherent manner to stress, macroscopic analyses of stressed ecosystems offer possible alternatives to compartmental models.

Minimum Bayes-risk word alignments of bilingual texts

Relevância:

20.00% 20.00%

Publicador:

Large vocabulary decoding and confidence estimation using word posterior probabilities

Relevância:

20.00% 20.00%

Publicador:

Class-based language model adaptation using mixtures of word-class weights

Relevância:

20.00% 20.00%

Publicador:

Word frequency cues word order in adults: cross-linguistic evidence

Relevância:

20.00% 20.00%

Publicador:

Resumo:

[EN] One universal feature of human languages is the division between grammatical functors and content words. From a learnability point of view, functors might provide entry points or anchors into the syntactic structure of utterances due to their high frequency. Despite its potentially universal scope, this hypothesis has not yet been tested on typologically different languages and on populations of different ages. Here we report a corpus study and an artificial grammar learning experiment testing the anchoring hypothesis in Basque, Japanese, French, and Italian adults. We show that adults are sensitive to the distribution of functors in their native language and use them when learning new linguistic material. However, compared to infants’ performance on a similar task, adults exhibit a slightly different behavior, matching the frequency distributions of their native language more closely than infants do. This finding bears on the issue of the continuity of language learning mechanism.

Comparison of part-of-speech and automatically derived category-based language models for speech recognition

Relevância:

20.00% 20.00%

Publicador:

Fast implementation methods for Viterbi-based word-spotting

Relevância:

20.00% 20.00%

Publicador:

A variable-length category-based n-gram language model

Relevância:

20.00% 20.00%

Publicador:

Video mail retrieval: the effect of word spotting accuracy on precision

Relevância:

20.00% 20.00%

Publicador:

«
1
2
...
5
6
7
8
9
10
11
...
59
60
»