Detection of Text from Lecture Video Images using CTPN Algorithm

Detection of Text from Lecture Video Images using CTPN Algorithm

  IJETT-book-cover           
  
© 2022 by IJETT Journal
Volume-70 Issue-1
Year of Publication : 2022
Authors : Geeta S Hukkeri, R H Goudar
DOI :  10.14445/22315381/IJETT-V70I1P220

How to Cite?

Geeta S Hukkeri, R H Goudar, "Detection of Text from Lecture Video Images using CTPN Algorithm," International Journal of Engineering Trends and Technology, vol. 70, no. 3, pp. 179-184, 2022. Crossref, https://doi.org/10.14445/22315381/IJETT-V70I1P220

Abstract
Text data in lecture video images play a crucial indication in understanding lecture videos. Different stages are involved in the detection of such data from images that are pictured by text detection methods. The goal of this paper is to study the existing text detection methods and reveal the best method among them. This paper presents a deep learning-based methodology known as CTPN that precisely confines text lines in a common picture in the content discovery stage. The CTPN operates dependably on multi-scale and multilanguage text minus any additional post-preparing, withdrawing from past base-up techniques needing multistep post filtering. The result of detected text has been shown in this paper.

Keywords
Text detection, text localization, Conventional approach, CTPN Algorithm.

Reference
[1] Nils L. Westhausen1 and Bernd T. Meyer, Dual-Signal Transformation LSTM Network for Real-Time Noise Suppression, SPRINGER, 1-5 (2020).
[2] Niddal h. Imam, Vassilios g. Vassilakis, An approach for detecting image spam in OSNs, SPRINGER, (2020) 1-8.
[3] Langcai Cao, Hongwei Li, Rongbiao Xie, And Jinrong Zhu, A Text Detection Algorithm for Image of Student Exercises Based on CTPN and Enhanced YOLOv3, IEEE Access, 8 (2020) 176924- 176934.
[4] Xiangwen Liu, Joe Meehan, Weida Tong, Leihong Wu, Xiaowei Xu, and Joshua Xu, DLI-IT: a deep learning approach to drug label identification through image and text embedding, BMC Medical Informatics and Decision Making, (2020) 1-9.
[5] Anurag Agrahari, Rajib Ghosh, Multi-Oriented text detection in natural scene images based on the intersection of MSER with the locally binarized image, Third International conference on computing and network communications (CoCoNet’ 19), Elsevier, (2020) 322–330.
[6] Rituraj Soni, Bijendra Kumar, and Satish Chand, Text Region Extraction From Scene Images Using AGF and MSER, International Journal of Image and Graphics, 20(2) (2020) 1-23.
[7] A. Vishnu Vardhan, Padarthi Maheeja, Manukonda SaiSudhanvi, Nuthalapati Preethi, Mannava Veena, Detection of Text by Enhancing Stroke Width Transform and Maximally Stable Extremal Regions, © 2020 JCRT, 8(3) (2020).
[8] S.Shiyamala, S.Suganya, Detection, Localization of Text in Images by Mser and Enhanced Swt, International Journal of Innovative Technology and Exploring Engineering (IJITEE), 8 (2019) 2873- 2875.
[9] Wenxian Zeng, Qinglin Meng, Shuqing Zhang, Natural Scene Chinese Character Text Detection Method Based on Improved CTPN, IOP Conf. Series: Journal of Physics: Conf. Serie, (2019) 1-7.
[10] Chenhui Huang, Jinhua Xu, An Anchor-Free Oriented Text Detector with Connectionist Text Proposal Network, Proceedings of Machine Learning Research, (2019) 631.
[11] JIANG Mengdi, CHENG Jianghua, CHEN Minghui, and KU Xishu, An Improved Text Localization Method for Natural Scene Images, IOP Conf. Series: Journal of Physics: Conf. Series, (2018) 1-8.
[12] Xuejian Rong, Chucai Yi, and Yingli Tian, Unambiguous Text Localization and Retrieval for Cluttered Scenes, IEEE Conference on Computer Vision and Pattern Recognition (CVPR), (2017) 5494.
[13] X. Zhou, C. Yao, H. Wen, Y. Wang, S. Zhou, W. He, J. Liang, EAST: an efficient and accurate scene text detector, in Proceedings of the CVPR, (2017) 2642.
[14] Z. Tian, W. Huang, T. He, P. He, Y. Qiao, Detecting text in natural image with connectionist text proposal network, in Proceedings of the ECCV, (2016) 56.
[15] B. Epshtein, E. Ofek, and Y. Wexler. Detecting text in natural scenes with stroke width transform, CVPR, (2010) 2963.
[16] H. Lin, P. Yang, and F. Zhang, Review of scene text detection and recognition, Arch. Comput. Methods Eng., 27(2) (2020) 433–454.
[17] Z. Tian, W. Huang, T. He, P. He, Y. Qiao, Detecting text in natural image with connectionist text proposal network, in Proceedings of the ECCV, (2016) 56.
[18] Jaderberg, M., Vedaldi, A., Zisserman, A., Deep features for text spotting, in European Conference on Computer Vision (ECCV), (2014) 512.
[19] Tian, S., Pan, Y., Huang, C., Lu, S., Yu, K., Tan, C.L., Text flow: A unified text detection system in natural scene images, in IEEE International Conference on Computer Vision (ICCV), (2015) 1.
[20] Wang, K., Babenko, B., Belongie, S., End-to-end scene text recognition, in IEEE International Conference on Computer Vision (ICCV), (2011) 1457.
[21] Huang, W., Lin, Z., Yang, J., Wang, J., Text localization in natural images using stroke feature transform and text covariance descriptors, in IEEE International Conference on Computer Vision (ICCV), (2013) 1241.
[22] Huang, W., Qiao, Y., Tang, X., Robust scene text detection with convolutional neural networks induced mser trees, in European Conference on Computer Vision (ECCV), (2014) 497.
[23] Simonyan, K., Zisserman, A., Very deep convolutional networks for large-scale image recognition, in International Conference on Learning Representation (ICLR), (2015) 1.
[24] D. Chen, J.-M. Odobez, and H. Bourlard, Text detection and recognition in images and video frames, Pattern Recognit., 37(3) (2004) 595–608.
[25] M. R. Lyu, J. Song, and M. Cai, A comprehensive method for multilingual video text detection, localization, and extraction, IEEE Trans. Circuits Syst. Video Technol.,15(2) (2005) 243–255.
[26] T. E. de Campos, B. R. Babu, and M. Varma, Character recognition in natural images, in Proc. VISAPP, (2009) 121–132.
[27] B. Epshtein, E. Ofek, and Y. Wexler, Detecting text in natural scenes with stroke width transform, in Proc. IEEE CVPR, (2010) 2963.
[28] K. Wang and S. Belongie, Word were spotting in the wild, in Proc. 11th ECCV, (2010) 591.
[29] L. Neumann and J. Matas, A method for text localization and recognition in real-world images, in Proc. 10th ACCV, (2010) 770.
[30] C. Yi and Y. Tian, Text string detection from natural scenes by structure-based partition and grouping, IEEE Trans. Image Process., 20(9) (2011) 2594–2605.
[31] A. Coates et al., Text detection and character recognition in scene images with unsupervised feature learning, in Proc. ICDAR, (2011) 440.
[32] Neumann and J. Matas, Real-time scene text localization and recognition, in Proc. IEEE CVPR, (2012) 3538.
[33] T. Novikova, O. Barinova, P. Kohli, and V. Lempitsky, Largelexicon attribute-consistent text recognition in natural images, in Proc. 12th ECCV, (2012) 752.
[34] A. Mishra, K. Alahari, and C. V. Jawahar, Top-down and bottomup cues for scene text recognition, in Proc. IEEE CVPR, (2012) 2687.
[35] A. Ikica and P. Peer, An improved edge profile based method for text detection in images of natural scenes, in Proc. IEEE EUROCON, (2011) 1.
[36] R. Lienhart and F. Stuber, Automatic text recognition in digital videos, Dept. Math. Comput. Sci., Univ. Mannheim, Mannheim, Germany, Tech. Rep, (1995).
[37] Y. Zhong, K. Karu, and A. K. Jain, Locating text in complex color images, Pattern Recognit., 28(10) (1995) 1523–1535.
[38] V. Wu, R. Manmatha, and E. M. Riseman, Finding text in images, in Proc. 2nd ACM Int. Conf. Digital Libraries, (1997) 3.
[39] A. K. Jain and B. Yu, Automatic text location in images and video frames, Pattern Recognit., 31(12) (1998) 2055.
[40] V. Wu, R. Manmatha, and E. M. Riseman, Textfinder: An automatic system to detect and recognize text in images, IEEE Trans. Pattern Anal. Mach. Intell., 21(11) (1999) 1224–1229.
[41] Y. Zhong, H. Zhang, and A. K. Jain, Automatic caption localization in compressed video, IEEE Trans. Pattern Anal. Mach. Intell., 220(4) (2000) 385–392.
[42] H. Li, D. Doermann, and O. Kia, Automatic text detection and tracking in digital video, IEEE Trans. Image Process., 9(1) (2000) 147–156.
[43] Y.-F. Pan, X. Hou, and C.-L. Liu, A hybrid approach to detect and localize texts in natural scene images, IEEE Trans. Image Process., 20(3) (2011) 800–813.
[44] C. Yao, X. Zhang, X. Bai, W. Liu, Y. Ma, and Z. Tu, Rotationinvariant features for multi-oriented text detection in natural images, PLoS ONE, 8(8) (2013) 1-15.
[45] X.-C. Yin, X. Yin, K. Huang, and H.-W. Hao, Accurate and robust text detection: A step-in for text retrieval in natural scene images, in Proc. 36th SIGIR, (2013) 1091.
[46] D. Chen, J. Luettin, and K. Shearer, A survey of text detection and recognition in images and videos, Institut Dalle Molle d’ Intelligence Artificielle Perceptual, EPFL, Martigny, Switzerland, Tech. Rep. (2000) 1-21.
[47] K. Jung, K. Kim, and A. Jain, Text information extraction in images and video: A survey, Pattern Recognit., 37(5) (2004) 977– 997.
[48] J. Liang, D. Doermann, and H. Li, Camera-based analysis of text and documents: A survey, Int. J. Document Anal. Recognit., 7(2) (2005) 84–104.
[49] M Krishna Goriparthy and B Geetha Lakshmi, Balanced Islanding Detection of Distributed Generator using Discrete Wavelet Transform and Artificial Neural Network, International Journal of Engineering Trends and Technology, 69(10) (2021) 57-71.
[50] Shankargoud Patil and Kappargaon S. Prabhushetty, Bi-Attention LSTM with CNN based Multi-task Human Activity Detection in Video Surveillance, International Journal of Engineering Trends and Technology, 69(11) (2021) 192-204.
[51] Vencelin Gino V and Amit KR Ghosh, Enhancing Cyber Security Measures For Online Learning Platforms, IJETT International Journal of Computer Science and Engineering, 8(11) (2021) 1-5.
[52] Neha Chaudhary and Priti Dimri, Enhancing the Latent Fingerprint Segmentation Accuracy Using Hybrid Techniques – WCO and BiLSTM, International Journal of Engineering Trends and Technology,. 69(11) (2021) 161-169.
[53] Emmanoel Pratama Putra Hastono and Gede Putra Kusuma, Evaluation of Blockchain Model for Educational Certificate Using Continuous-Time Markov Chain, International Journal of Engineering Trends and Technology, 69(11) (2021) 61-70.
[54] Ningxia He, Image Sampling Using Q-Learning, IJETT International Journal of Computer Science and Engineering, 8(1) (2021) 5-12,.
[55] Walther Calvo-Niño, Brian Meneses-Claudio, and Alexi Delgado, Implementation of an FM Radio Station to Facilitate Remote Education in the District of Cojata, International Journal of Engineering Trends and Technology, 69(10) (2021) 118-127.
[56] Osama R.Shahin, Zeinab M. Abdel Azim, and Ahmed I Taloba, Maneuvering Digital Watermarking In Face Recognition , International Journal of Engineering Trends and Technology, 69(11) (2021) 104-112.
[57] Yasser Mohammad Al-Sharo, Amer Tahseen Abu-Jassar, Svitlana Sotnik, and Vyacheslav Lyashenko, Neural Networks As A Tool For Pattern Recognition of Fasteners, International Journal of Engineering Trends and Technology, 69(10) (2021) 151-160.
[58] Hoang Thi Phuong, Researching Robot Arms Control System Based On Computer Vision Application And Artificial Intelligence Technology, IJETT International Journal of Computer Science and Engineering, 8(1) (2021) 24-29.
[59] Nilesh Yadav and Dr. Narendra Shekokar, SQLI Detection Based on LDA Topic Model, International Journal of Engineering Trends and Technology, 69(11) (2021) 47-52.
[60] M. Ayaz Ahmad, Volodymyr Gorokhovatskyi, Iryna Tvoroshenko, Nataliia Vlasenko, and Syed Khalid Mustafa, The Research of Image Classification Methods Based on the Introducing Cluster Representation Parameters for the Structural Description, International Journal of Engineering Trends and Technology, 69(10) (2021) 186-192.