Can Machine Automatically Discover Text Image from Overall Perspective

doi:10.23940/ijpe.19.01.p28.281287

Int J Performability Eng ›› 2019, Vol. 15 ›› Issue (1): 281-287.doi: 10.23940/ijpe.19.01.p28.281287

Previous Articles Next Articles

Can Machine Automatically Discover Text Image from Overall Perspective

Wei Jiang^a, Jiayi Wu^a, and Chao Yao^b*()

a School of Software, North China University of Water Resources and Electric Power,Zhengzhou, 450045,China
b School of Automation,Northwestern Polytechnic University,Xi’an,710071, China

Revised on ; Accepted on
Contact: Yao Chao E-mail:yaochao@nwpu.edu.cn
About author:Wei Jiang received the PH.D. degree from Xidan University, Xi’an, China, in 2014. He now works as a senior lecturer in School of Software, North China University of Water Resources and Electric Power in Zhengzhou, China. His interest is scene text detection and recognition.|Jiayi Wu received her master degree from the University of Warwick, England, in 2010. She now works for School of Software, North China University of Water Resources and Electric Power in Zhengzhou, China.|Chao Yao received the PH.D. degree from Xidan University, Xi’an, China, in 2014. He has visited Concordia University in Montreal, Canada as joint PH.D. Student from 2011 to 2012. He has finished post-doc in 2017 and works as assistant professor in School of Automation, Northwestern Polytechnic University, in Xi’an, China. His interest is text recognition and dimension reduction.

Abstract

Abstract:

Recently, more and more researchers have focused on the problem about how to automatically distinguish text images from non-text ones. Most of previous works have originated from local features, which are computational expensive, and usually employ GPU in their procedure. To address this problem, we propose a new and simple but effective scheme from an overall perspective. In the proposed scheme, a sort of holistic feature is first extracted from Fourier spectrum, which describes the characteristic of the image or the sub-image as a whole without local feature extraction; then, random forests are utilized to classify images into text and non-text ones. Experimental results in several public datasets demonstrate that this scheme is efficient and effective.

Key words: natural images, holistic feature, text/non-text image classification, random forests

Wei Jiang, Jiayi Wu, and Chao Yao. Can Machine Automatically Discover Text Image from Overall Perspective [J]. Int J Performability Eng, 2019, 15(1): 281-287.

Add to citation manager EndNote|Reference Manager|ProCite|BibTeX|RefWorks

Figures/Tables 11

Figure 1

Figure 2

Figure 3

Figure 4

Figure 5

Figure 6

Figure 7

Table 1

Table 2

Table 3

Figure 8

References 14

[1]	X. Bai, B. Shi, C. Zhang, X. Cai, L. Qi , “Text/Non-Text Image Classification in the Wild With Convolutional Neural Networks,”Pattern Recognition, Vol. 66, No. 6, pp. 437-446,2017
[2]	N.G. Alessi, S. Battiato, G. Gallo, M. Mancuso, F. Stanco , “Automatic Discrimination of Text Images,”inProceedings of SPIE, pp. 351-359, 2003
[3]	E. Indermuhle, H. Bunke, F. Shafait, T. Breuel, “Text Versus Non-Text Distinction in Online Handwritten Documents,”in Proceedings of the 2010 ACM Symposium on Applied Computing, pp. 3-7, 2010
[4]	V. Vidya, T. R. Indhu, V. K. Bhadran , “Classification of Handwritten Document Image into Text and Non-Text Regions,” inProceedings of theFourth International Conference on Signal and Image Processing, pp. 103-112, 2012
[5]	P. Shivakumara, A. Dutta, T.Q. Phan, C. L. Tan, U. Pal , “A Novel Mutual Nearest Neighbor based Symmetry for Text Frame Classification in Video,” Pattern Recognition, Vol. 44, No. 8, pp. 1671-1683, 2011
[6]	P. Shivakumara, A. Dutta, C.L. Tan , “A New Symmetry based on Proximity of Wavelet-Moments for Text Frame Classification in Video,” inProceedings of International Conference on Pattern Recognition, pp. 129-132, 2010
[7]	P. Shivakumara and C.L. Tan , “Novel Edge Features for Text Frame Classification in Video,” inProceedings of International Conference on Pattern Recognition, pp. 3191-3194, 2010
[8]	C. Zhang, C. Yao, B. Shi, X. Bai , “Automatic Discrimination of Text and Non-Text Natural Images,” in Proceedings of International Conference on Document Analysis and Recognition, pp. 886-889, 2015
[9]	P. Lyu, B. Shi, C. Zhang, X. Bai , “Distinguishing Text/Non-Text Natural Images with Multi-Dimensional Recurrent Neural Networks,” inProceedings of International Conference on Pattern Recognition, pp. 3981-3986, 2016
[10]	X. D. Hou and L. Q. Zhang , “Saliency Detection: ASpectral Residual Approach,”inProceedings of IEEE Conference on Computer Vision and Pattern Recognition, pp. 1-8, 2007
[11]	W. Jiang, Z. Y. Lu, J. Li, X. P. Liu, C. Yao , “Visual Saliency and Text Confidence Map based Background Suppression for Scene Text,” Chinese Journal of Electronics, Vol. 43, No. 1, pp. 62-68, 2015
[12]	N. Sharma, P. Shivakumara, U. Pal, M. Blumenstein, C. L. Tan , “Piece-wise Linearity based Method for Text Frame Classification in Video,” Pattern Recognition, Vol. 48, No. 3, pp. 862-881, 2015
[13]	C. Yao, X. Bai, W. Y. Liu , “AUnified Framework for Multi-Oriented Text Detection and Recognition,” IEEE Transaction on Image Processing, Vol. 23, No. 11, pp. 4737-4749, 2014
[14]	L. Neumann and J. Matas , “A Method for Text Localization and Recognition in Real-World,” inProceedings of ACCV, pp. 770-783,2010

Methods	Text (%)	Non-text (%)
Proposed Method	100	100
Bai	100	100
Shivakumara[12]	97.62	100
Shivakumara[5]	75.54	24.46

Methods	Text (%)	Error (%)
Proposed Method	82.21	17.79
Bai et al.[1]	89.2	10.8
Shivakumara et al.[12]	80.97	19.03
Shivakumara et al.[5]	81.12	18.88

Methods	Precision (%)	Recall (%)	F-Measuremen
Proposed Method	80.5	91.3	85.5
Bai[1]	93.7	95.4	94.6
Zhang[8]	75.4	97.9	85.1
Yao[13]	80.8	90.2	85.3
Neumann[14]	52.5	98.4	68.5

Can Machine Automatically Discover Text Image from Overall Perspective

RichHTML

PDF

Knowledge

Abstract

Cite this article

share this article

Figures/Tables 11

References 14

Related Articles 0

Recommended 0