Edge-based connected component approach for skew correction of complex document images


Autoria(s): Kumar, J; Kasar, T; Ramakrishnan, AG
Data(s)

2007

Resumo

Skew correction of complex document images is a difficult task. We propose an edge-based connected component approach for robust skew correction of documents with complex layout and content. The algorithm essentially consists of two steps - an 'initialization' step to determine the image orientation from the centroids of the connected components and a 'search' step to find the actual skew of the image. During initialization, we choose two different sets of points regularly spaced across the the image, one from the left to right and the other from top to bottom. The image orientation is determined from the slope between the two succesive nearest neighbors of each of the points in the chosen set. The search step finds succesive nearest neighbors that satisfy the parameters obtained in the initialization step. The final skew is determined from the slopes obtained in the 'search' step. Unlike other connected component based methods, the proposed method does not require any binarization step that generally precedes connected component analysis. The method works well for scanned documents with complex layout of any skew with a precision of 0.5 degrees.

Formato

application/pdf

Identificador

http://eprints.iisc.ernet.in/26264/1/get.pdf

Kumar, J and Kasar, T and Ramakrishnan, AG (2007) Edge-based connected component approach for skew correction of complex document images. In: IEEE Region 10 Conference ( TENCON 2007), OCT 30-NOV 02, 2007, Taipei.

Publicador

IEEE

Relação

http://ieeexplore.ieee.org/search/srchabstract.jsp?tp=&arnumber=4429083&queryText%3D%28edge-based+connected+component+approach+for+skew+correction+of+complex+document+images%29%26openedRefinements%3D*&tag=1

http://eprints.iisc.ernet.in/26264/

Palavras-Chave #Electrical Engineering
Tipo

Conference Paper

PeerReviewed