Multi-script and multi-oriented text localization from scene images


Autoria(s): Kasar, Thotreingam; Ramakrishnan, Angarai G
Data(s)

2012

Resumo

This paper describes a new method of color text localization from generic scene images containing text of different scripts and with arbitrary orientations. A representative set of colors is first identified using the edge information to initiate an unsupervised clustering algorithm. Text components are identified from each color layer using a combination of a support vector machine and a neural network classifier trained on a set of low-level features derived from the geometric, boundary, stroke and gradient information. Experiments on camera-captured images that contain variable fonts, size, color, irregular layout, non-uniform illumination and multiple scripts illustrate the robustness of the method. The proposed method yields precision and recall of 0.8 and 0.86 respectively on a database of 100 images. The method is also compared with others in the literature using the ICDAR 2003 robust reading competition dataset.

Formato

application/pdf

Identificador

http://eprints.iisc.ernet.in/46230/1/CBDAR11_Kasar_LNCS.pdf

Kasar, Thotreingam and Ramakrishnan, Angarai G (2012) Multi-script and multi-oriented text localization from scene images. In: 4th International Workshop, CBDAR 2011, September 22, 2011, Beijing, China.

Publicador

Springer

Relação

http://dx.doi.org/10.1007/978-3-642-29364-1_1

http://eprints.iisc.ernet.in/46230/

Palavras-Chave #Electrical Engineering
Tipo

Conference Paper

PeerReviewed