Hausdorff Distances for Searching in Binary Text Images


Autoria(s): Andreev, Andrey; Kirov, Nikolay
Data(s)

16/09/2009

16/09/2009

2009

Resumo

This work has been partially supported by Grant No. DO 02-275, 16.12.2008, Bulgarian NSF, Ministry of Education and Science.

Hausdorff distance (HD) seems the most efficient instrument for measuring how far two compact non-empty subsets of a metric space are from each other. This paper considers the possibilities provided by HD and some of its modifications used recently by many authors for resemblance between binary text images. Summarizing part of the existing word image matching methods, relied on HD, we investigate a new similar parameterized method which contains almost all of them as particular cases. Numerical experiments for searching words in binary text images are carried out with 333 pages of old Bulgarian typewritten text, 200 printed pages of Bulgarian Chrestomathy from year 1884, and 200 handwritten pages of Slavonic manuscript from year 1574. They outline how the parameters must be set in order to use the advantages of the proposed method for the purposes of word matching in scanned document images.

Identificador

Serdica Journal of Computing, Vol. 3, No 1, (2009), 23p-46p

1312-6555

http://hdl.handle.net/10525/363

Idioma(s)

en

Publicador

Institute of Mathematics and Informatics Bulgarian Academy of Sciences

Palavras-Chave #Hausdorff Distance #Binary Text Image #Word Matching
Tipo

Article