Go home now Header Background Image
Search
Submission Procedure
share: |
 
Follow us
 
 
 
 
Volume 17 / Issue 1

available in:   PDF (213 kB) PS (251 kB)
 
get:  
Similar Docs BibTeX   Write a comment
  
get:  
Links into Future
 
DOI:   10.3217/jucs-017-01-0048

 

An OCR Free Method for Word Spotting in Printed Documents: the Evaluation of Different Feature Sets

Israel Rios (Pontifical Catholic University of Parana, Brazil)

Alceu de Souza Britto Jr (Pontifical Catholic University of Parana, Brazil)

Alessandro Lameiras Koerich (Pontifical Catholic University of Parana, Brazil)

Luis Eduardo Soares Oliveira (Federal University of Parana, Brazil)

Abstract: An OCR free word spotting method is developed and evaluated under a strong experimental protocol. Different feature sets are evaluated under the same experimental conditions. In addition, a tuning process in the document segmentation step is proposed which provides a significant reduction in terms of processing time. For this purpose, a complete OCR-free method for word spotting in printed documents was implemented, and a document database containing document images and their corresponding ground truth text files was created. A strong experimental protocol based on 800 document images allows us to compare the results of the three feature sets used to represent the word image.

Keywords: document retrieval, word recognition, word spotting

Categories: I.5, I.7