- Papangkorn Inkeaw1,
- Jakramate Bootkrajang1,
- Phasit Charoenkwan2,
- Sanparith Marukatat3,
- Shinn-Ying Ho4,5 &
- …
- Jeerayut Chaijaruwanich1
841Accesses
8Citations
Abstract
Character segmentation is an important task in optical character recognition (OCR). The quality of any OCR system is highly dependent on character segmentation algorithm. Despite the availability of various character segmentation methods proposed to date, existing methods cannot satisfyingly segment characters belonging to some complex writing styles such as the Lanna Dhamma characters. In this paper, a new character segmentation method named graph partitioning-based character segmentation is proposed to address the problem. The proposed method can deal with multi-level writing style as well as touching and broken characters. It is considered as a generalization of existing approaches to multi-level writing style. The proposed method consists of three phases. In the first phase, a newly devised over-segmentation technique based on morphological skeleton is used to obtain redundant fragments of a word image. The fragments are then used to form a segmentation hypotheses graph. In the last phase, the hypotheses graph is partitioned into subgraphs each corresponding to a segmented character using the partitioning algorithm developed specifically for character segmentation purpose. Experimental results based on handwritten Lanna Dhamma characters datasets showed that the proposed method achieved high correct segmentation rate and outperformed existing methods for the Lanna Dhamma alphabet.
This is a preview of subscription content,log in via an institution to check access.
Access this article
Subscribe and save
- Get 10 units per month
- Download Article/Chapter or eBook
- 1 Unit = 1 Article or 1 Chapter
- Cancel anytime
Buy Now
Price includes VAT (Japan)
Instant access to the full article PDF.


















Similar content being viewed by others
References
Inkeaw, P., Chueaphun, C., Chaijaruwanich, J., Klomsae, A., Marukatat, S.: Lanna Dharma handwritten character recognition on palm leaves manuscript based on Wavelet transform. In: 2015 IEEE International Conference on Signal and Image Processing Applications (ICSIPA), 19–21, pp 253–258 (2015)
Inkeaw, P., Charoenkwan, P., Huang, H.-L., Marukatat, S., Ho, S.-Y., Chaijaruwanich, J.: Recognition of handwritten Lanna Dhamma characters using a set of optimally designed moment features. IJDAR20(4), 259–274 (2017)
Thammano, A., Pravesjit, S.: Recognition of archaic Lanna handwritten manuscripts using a hybrid bio-inspired algorithm. Memet. Comput.7(1), 3–17 (2015)
Casey, R.G., Lecolinet, E.: A survey of methods and strategies in character segmentation. IEEE Trans. Pattern Anal. Mach. Intell.18(7), 690–706 (1996)
Shi, Z., Govindaraju, V.: Segmentation and recognition of connected handwritten numeral strings. Pattern Recogn.30(9), 1501–1504 (1997)
Elnagar, A., Alhajj, R.: Segmentation of connected handwritten numeral strings. Pattern Recogn.36(3), 625–634 (2003)
Pal, U., Belad, A., Choisy, C.: Touching numeral segmentation using water reservoir concept. Pattern Recogn. Lett.24(1–3), 261–272 (2003)
Pravesjit, S., Thammano, A.: Segmentation of historical Lanna handwritten manuscripts. In: 2012 6th IEEE International Conference Intelligent Systems, 6–8, pp 332–337 (2012)
Ribas, F.C., Oliveira, L.S., Britto, A.S., Sabourin, R.: Handwritten digit segmentation: a comparative study. IJDAR16(2), 127–137 (2013)
Kovalevski, V.A.: Character Readers and Pattern Recognition. Spartan Books, Washington (1968)
Casey, R.G., Nagy, G.: Recursive segmentation and classification of composite character patterns. In: Proceedings of Sixth International Conference on Pattern Recognition (1982)
Elagouni, K., Garcia, C., Mamalet, F., Sebillot, P.: Combining multi-scale character recognition and linguistic knowledge for natural scene text OCR. In: 2012 10th IAPR International Workshop on Document Analysis Systems, 27–29, pp 120–124 (2012)
Fujisawa, H., Nakano, Y., Kurino, K.: Segmentation methods for character recognition: from segmentation to document structure analysis. Proc. IEEE80(7), 1079–1092 (1992)
Xiu, P., Peng, L., Ding, X., Wang, H.: Offline handwritten Arabic character segmentation with probabilistic model. In: Bunke, H., Spitz, A.L. (eds) Proceedings of Document Analysis Systems VII: 7th International Workshop, DAS 2006, Nelson, New Zealand, February 13–15, 2006. Springer, Berlin, pp. 402–412 (2006)
Oliveira, L.S., Sabourin, R., Bortolozzi, F., Suen, C.Y.: Automatic recognition of handwritten numerical strings: a recognition and verification strategy. IEEE Trans. Pattern Anal. Mach. Intell.24(11), 1438–1454 (2002)
Chatchinarat, A.: Thai handwritten segmentation using proportional invariant recognition technique. In: 2009 International Conference on Future Computer and Communication, 3–5, pp. 283–287 (2009)
Chen, Y.-K., Wang, J.-F.: Segmentation of single- or multiple-touching handwritten numeral string using background and foreground analysis. IEEE Trans. Pattern Anal. Mach. Intell.22(11), 1304–1317 (2000)
Fenrich, R., Krishnamoorthy, K.: Segmentation diverse quality handwritten digit strings in near real-time. In: the 5th USPS Advance Technology Conference, pp. 523–537 (1990)
Ji, J., Peng, L., Li, B.: Graph model optimization based historical Chinese character segmentation method. In: 2014 11th IAPR International Workshop on Document Analysis Systems, 7–10, pp 282–286 (2014)
Stentiford, F.W.M., Mortimer, R.G.: Some new heuristics for thinning binary handprinted characters for OCR. IEEE Trans. Syst. Man Cybern.13(1), 81–84 (1983)
Jang, B.K., Chin, R.T.: One-pass parallel thinning: analysis, properties, and quantitative evaluation. IEEE Trans. Pattern Anal. Mach. Intell.14(11), 1129–1140 (1992)
Chen, W., Sui, L., Xu, Z., Lang, Y.: Improved Zhang–Suen thinning algorithm in binary line drawing applications. In: 2012 International Conference on Systems and Informatics (ICSAI2012), 19–20, pp. 1947–1950 (2012)
Juneam, N., Kantabutra, S.: Fast and efficient parallel coarsest refinement. Fundam. Inform.150(2), 211–220 (2017)
Ping, Z., Lihui, C.: A novel feature extraction method and hybrid tree classification for handwritten numeral recognition. Pattern Recogn. Lett.23(1), 45–56 (2002)
Kamranian, Z., Monadjemi, S.A., Nematbakhsh, N.: A novel free format Persian/Arabic handwritten zip code recognition system. Comput. Electr. Eng.39(7), 1970–1979 (2013)
Basu, S., Das, N., Sarkar, R., Kundu, M., Nasipuri, M., Basu, D.K.: A hierarchical approach to recognition of handwritten Bangla characters. Pattern Recogn.42(7), 1467–1484 (2009)
Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’05), 25–25, vol. 881, pp. 886–893 (2005)
Kim, J., Hwang, I., Kim, Y.-H., Moon, B.-R.: Genetic approaches for graph partitioning: a survey. In: Paper presented at the Proceedings of the 13th Annual Conference on Genetic and Evolutionary Computation, Dublin, Ireland (2011)
Klomsae, A.: Image feature extraction for Lanna Dharma handwritten character recognition. Master Thesis, Chiang Mai University, Thailand (2012)
Lanna Digital Archives. (2013) Chiang Mai University.http://library.cmu.ac.th/lanna_ebook/. Accessed 9 June 2016
McLachlan, G.J.: Discriminant analysis and statistical pattern recognition. Wiley series in probability and mathematical statistics. Applied probability and statistics; Wiley series in probability and mathematical statistics. Applied probability and statistics. Wiley, New York (1992)
Haykin, S.S.: Neural Networks and Learning Machines, 3rd edn. Prentice Hall, New York (2009)
Fan, R.-E., Chang, K.-W., Hsieh, C.-J., Wang, X.-R., Lin, C.-J.: LIBLINEAR: a library for large linear classification. J. Mach. Learn. Res.9, 1871–1874 (2008)
Freund, Y., Schapire, R.E.: A decision-theoretic generalization of on-line learning and an application to boosting. J. Comput. Syst. Sci.55(1), 119–139 (1997)
Acknowledgements
This study was funded under the Royal Golden Jubilee Ph.D. Program by the Thailand Research Fund (Grant No. PHD/0185/2556). We would like to thank Chiang Mai University, Thailand, for financial support and collection of digital Lanna archives. We also thank National Chiao Tung University, Taiwan, for supporting this work in the Intelligent Computing Laboratory.
Author information
Authors and Affiliations
Department of Computer Science, Faculty of Science, Chiang Mai University, Chiang Mai, Thailand
Papangkorn Inkeaw, Jakramate Bootkrajang & Jeerayut Chaijaruwanich
College of Arts, Media and Technology, Chiang Mai University, Chiang Mai, Thailand
Phasit Charoenkwan
National Electronics and Computer Technology Center, Pathum Thani, Thailand
Sanparith Marukatat
Institute of Bioinformatics and Systems Biology, National Chiao Tung University, Hsinchu, Taiwan
Shinn-Ying Ho
Department of Biological Science and Technology, National Chiao Tung University, Hsinchu, Taiwan
Shinn-Ying Ho
- Papangkorn Inkeaw
You can also search for this author inPubMed Google Scholar
- Jakramate Bootkrajang
You can also search for this author inPubMed Google Scholar
- Phasit Charoenkwan
You can also search for this author inPubMed Google Scholar
- Sanparith Marukatat
You can also search for this author inPubMed Google Scholar
- Shinn-Ying Ho
You can also search for this author inPubMed Google Scholar
- Jeerayut Chaijaruwanich
You can also search for this author inPubMed Google Scholar
Corresponding author
Correspondence toJeerayut Chaijaruwanich.
Electronic supplementary material
Below is the link to the electronic supplementary material.
Rights and permissions
About this article
Cite this article
Inkeaw, P., Bootkrajang, J., Charoenkwan, P.et al. Recognition-based character segmentation for multi-level writing style.IJDAR21, 21–39 (2018). https://doi.org/10.1007/s10032-018-0302-5
Received:
Revised:
Accepted:
Published:
Issue Date:
Share this article
Anyone you share the following link with will be able to read this content:
Sorry, a shareable link is not currently available for this article.
Provided by the Springer Nature SharedIt content-sharing initiative