Optical Character Recognition (OCR) automatically recognizes texts in an image. OCR is still a challenging problem in computer vision. A successful solution to OCR has important scanner applications such as Text-To-Speech (TTS) conversion and automati...
Optical Character Recognition (OCR) automatically recognizes texts in an image. OCR is still a challenging problem in computer vision. A successful solution to OCR has important scanner applications such as Text-To-Speech (TTS) conversion and automatic document classification.
In this work, we evaluate the current state-of-the-art OCR methods. One is based on convolutional neural networks (CNNs) and the other is Tesseract that is developed by HP. For this, we have designed a CNNs architecture for OCR and built our own dataset that contains upper and lower case characters. We have experimented in the presence of Salt and Pepper noise or Gaussian noise and reported their performance comparison in terms of recognition accuracy and processing time. Experimental results indicate that CNNs based OCR outperforms Tesseract in recognition accuracy but takes much more computational resources than Tesseract. In case that processing time has priority, we recommend Tesseract due to its processing speed.
In addition, CNNs is showed 97.49 % recognition accuracy using four-fold cross-validation.