Vol 8, No 5 (2017) > Electrical, Electronics and Computer Engineering >

Histogram Equalization Implementation in the Preprocessing Phase on Optical Character Recognition

Peter Pangestu, Dennis Gunawan, Seng Hansun

 

Abstract: A 2014 report from Digital Marketing Philippines stated that the number of web applications with visual content as their main product has increased significantly. Image processing technology has also undergone significant growth. One example of this is optical character recognition (OCR), which can convert the text on an image to plain text. However, a problem occurs when the image has low contrast and low exposure, which potentially results in information being hidden in the image. To address this problem, histogram equalization is used to enhance the image’s contrast so the hidden information can be shown. Similar to X-ray scanning used in the medical field, histogram equalization processes scanned images that have low brightness and low contrast. In this study, histogram equalization was successfully implemented using OCR preprocessing. The test was done with a dataset that contains dark background images with low light text; the successful outcome resulted in the ability to show 74.95% of the information hidden in the image.
Keywords: Contrast enhancement; Histogram equalization; Image processing; Information hiding; Optical character recognition

Full PDF Download

References


Abbyy-developers.eu, 2015. Image Processing and Binarisation for Camera OCR. Available online at https://abbyy.technology/en:features:ocr:cameraocr-preprocessing-binarisation

Ahmad, N., Hadinegoro, A., 2012. Metode Histogram Equalization untuk Perbaikan Citra Digital. In: Proceedings of Seminar Nasional Teknologi Informasi & Komunikasi Terapan 2012, Semarang: Indonesia, INFRM, pp. 439–445

Akhlis, I., Sugiyanto, 2011. Implementasi Metode Histogram Equalization untuk Meningkatkan Kualitas Citra Digital. Jurnal Fisika, Volume 1(2), pp. 70–74

Alginahi, Y., 2010. Preprocessing Techniques in Character Recognition, Character Recognition, Minoru Mori (Ed.), ISBN: 978-953-307-105-3, InTech. Available online at http://cdn.intechopen.com/pdfs-wm/11405.pdf

Digitalmarketingphilippines.com. 2014. Amazing Facts and Statistics about Visual Web. Available online at http://digitalmarketingphilippines.com/wp-content/uploads/2014/01/Amazing-Facts-and-Statistics-about-Visual-Web.jpg

Gonzalez, R., Woods, R., 2008. Digital Image Processing (3rd ed). New Jersey: Prentice-Hall

Holley, R., 2009. How Good Can It Get? Analysing and Improving OCR Accuracy in Large Scale Historic Newspaper Digitisation Programs. Available online at http://www.dlib.org/dlib/march09/holley/03holley.html

iapr-tc11.org, 2015. Datasets List - TC11. Available online at http://www.iapr-tc11.org/mediawiki/index.php/Datasets_List

Krutsch, R., Tenorio, D., 2011. Histogram Equalization. Guadalajara: Freescale Semiconductor Application Note Number AN4318, Rev 0

MathWorks.com, 2016. Image Processing and Computer Vision Examples. Available online at http://www.mathworks.com/examples/product-group/matlab-image-processing-and-computer-vision

Mithe, R., Indalkar, S., Divekar, N., 2013. Optical Character Recognition. International Journal of Recent Technology and Engineering (IJRTE), Volume 2 (1), pp. 72–75

Rachman, E.M.B.P., 2014. Histogram Equalisation. Available online at http://ilmukomputer.org/wp-content/uploads/2014/02/Histogram-Equalisation-Pengolahan-Citra-Digital.odt

Rice, S.V., Jenkins, F.R., Nartker, T.A., 1995. The Fourth Annual Test of OCR Accuracy. Available online at http://www.expervision.com/wp-content/uploads/2012/12/1995.The_Fourth_Annual_Test_of_OCR_Accuracy.pdf

Sánchez, J., Perronnin, F., de Campos, T., 2012. Modeling the Spatial Layout of Images Beyond Spatial Pyramids. Pattern Recognition Letters, Volume 33(16), pp. 2216–2223

Xcitex, Inc., 2010. Image Processing: Brightness, Contrast, Gamma, and Exponential/Logarithmic Settings in ProAnalyst. Available online at http://www.xcitex.com/Resource%20Center/ProAnalyst/Application%20Notes/App%20Note%20151%20-%20Image%20Processing%20Brightness,%20Contrast,%20Gamma%20and%20Exponential.pdf

Zybert, C., 2014. How does Optical Character Recognition Work. Available online at http://nedocs.com/how-does-optical-character-recognition-work/