Movatterモバイル変換

Part of the book series:Lecture Notes in Computer Science ((LNCS,volume 14809))

Included in the following conference series:

International Conference on Document Analysis and Recognition

601Accesses

Abstract

In recent years, the development of Optical Music Recognition (OMR) has progressed significantly. However, music cultures with smaller communities have only recently been considered in this process. This results in a lack of adequate ground truth datasets needed for the development and benchmarking of OMR systems. In this work, the KuiSCIMA (Jiang Kui Score Images for Musicological Analysis) dataset is introduced. KuiSCIMA is the first machine-readable dataset of thesuzipu notations in Jiang Kui’s collectionBaishidaoren Gequ from 1202. Collected from five different woodblock print editions, the dataset contains 21797 manually annotated instances on 153 pages in total, from which 14500 are text character annotations, and 7297 aresuzipu notation symbols. The dataset comes with an open-source tool which allows editing, visualizing, and exporting the contents of the dataset files. In total, this contribution promotes the preservation and understanding of cultural heritage through digitization.

This is a preview of subscription content,log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

¥17,985 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Subscribe now

Buy Now

Chapter: JPY 3498; Price includes VAT (Japan)

eBook: JPY 8465; Price includes VAT (Japan)

Softcover Book: JPY 10581; Price includes VAT (Japan)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Optical Music Recognition: Recent Advances, Current Challenges, and Future Directions

Museum Exhibit Identification Challenge for the Supervised Domain Adaptation and Beyond

Drawing the Line: Deep Segmentation for Extracting Art from Ancient Etruscan Mirrors

Notes

1.
The abbreviation MUSCIMA stands for MUsic SCore IMAges, being the inspiration for the name of the KuiSCIMA dataset.
2.
https://github.com/SuziAI/gui-tools/blob/main/json_schema_suzipu.json (taggedv2.0).
3.
https://github.com/SuziAI/KuiSCIMA/blob/main/annotation_remarks.pdf (taggedv1.0).
4.
https://github.com/SuziAI/SuziOMR/tree/main/baseline (taggedv1.0).
5.
The dataset extraction procedure is described inhttps://github.com/SuziAI/gui-tools/blob/main/readme_files/README_ANNOTATION_TOOL.md#extract-omr-dataset-from-corpus (taggedv2.0).
6.
https://github.com/SuziAI/KuiSCIMA (taggedv1.0).
7.
https://github.com/SuziAI/gui-tools (taggedv2.0).
8.
http://www.music-encoding.org.

References

Berten, O.: GregoBase: A database of Gregorian scores (2013).https://gregobase.selapa.net
Bradski, G.: The OpenCV library. Dr. Dobb’s J. Softw. Tools (2000)
Google Scholar
Calvo-Zaragoza, J., Toselli, A.H., Vidal, E.: Handwritten music recognition for mensural notation with convolutional recurrent neural networks. Pattern Recogn. Lett.128 (2019).https://doi.org/10.1016/j.patrec.2019.08.021
Chen, G.-F., Sheu, J.-S.: An optical music recognition system for traditional Chinese Kunqu Opera scores written in Gong-Che Notation. EURASIP J. Audio Speech Music Process. pp. 7–17.https://doi.org/10.1186/1687-4722-2014-7
Cheng, H., et al.: SCUT-CAB: a new benchmark dataset of ancient chinese books with complex layouts for document layout analysis, November 2022, pp. 436–451. ISBN 978-3-031-21647-3.https://doi.org/10.1007/978-3-031-21648-0_30
Cheng, Y.: Xi’an Guyue Xi’an old music in new China. ‘Living fossil’ or ‘flowing river’? Dissertation. School of Oriental and African Studies, University of London (2005).https://eprints.soas.ac.uk/29336/ 1/10731431.pdf. Accessed 03 Aug 2023
Fornés, A., et al.: CVC-MUSCIMA: a ground-truth of handwritten music score images for writer identification and staff removal. Int. J. Doc. Anal. Recogn.15(3), 243–251 (2012).https://doi.org/10.1007/s10032-011-0168-2.
Haji Jr., J., Pecina, P.: The MUSCIMA++ dataset for handwritten optical music recognition. In: 14th International Conference on Document Analysis and Recognition. ICDAR 2017, Kyoto, Japan, pp. 39–46 (2017)
Google Scholar
Joshi, P.: Fashion mNIST with Pytorch (93% accuracy) (2019).https://www.kaggle.com/code/pankajj/fashion-mnist-with-pytorch-93-accuracy. Accessed 10 Feb 2024
Lam, J.S.C.: Ci songs from the song dynasty: a Ménage à Trois of lyrics, music, and performance. New Liter. Hist.46(4), 623–646. (2015). ISSN 00286087, 1080661X.http://www.jstor.org/stable/24772762. Accessed 2 Aug 2023
Ma, W., et al.: Joint layout analysis, character detection and recognition for historical document digitization. In: 2020 17th International Conference on Frontiers in Handwriting Recognition (ICFHR), pp. 31– 36 (2020).https://doi.org/10.1109/ICFHR2020.2020.00017
Martinez-Sevilla, J.C., et al.: On the performance of optical music recognition in the absence of specific training data. In: Proceedings of the 24th International Society for Music Information Retrieval Conference (Milan, Italy). ISMIR, November 2023, pp. 319–326 (2023).https://doi.org/ 10.5281/zenodo.10265289
Repolusk, T., Veas, E.: The Suzipu musical annotation tool for the creation of machine-readable datasets of ancient Chinese music. In: Calvo-Zaragoza, J., Pacha, A., Shatri, E. (eds.) Proceedings of the 5th International Workshop on Reading Music Systems, Milan, Italy, pp. 7–11 (2023).https://doi.org/10.48550/arXiv.2311.04091.https://sites.google.com/view/worms2023/proceedings
Saini, R., et al.: ICDAR 2019 historical document reading challenge on large structured chinese family records. In: 2019 International Conference on Document Analysis and Recognition (ICDAR), pp. 1499–1504 (2019).https://doi.org/10.1109/ICDAR.2019.00241
Shen, T., et al.: Semantic recognition of common musical notes in Guqin score based on optimal statistical features. In: 4th International Conference on Advances in Computer Technology, Information Science and Communications (CTISC), pp. 1–4 (2022).https://doi.org/10.1109/CTISC54888.2022.9849792
Sturgeon, D.: Chinese Text Project (2011).https://ctext.org/library.pl. Accessed 30 June 2023
Sturgeon, D.: Large-scale optical character recognition of pre-modern Chinese texts. Int. J. Buddhist Thought Cult.28(2), 11–44 (2018)
Google Scholar
Tang, C.-W., Liu, C.-L., Chiu, P.-S.: HRCenterNet: an anchorless approach to Chinese character segmentation in historical documents. In: 2020 IEEE International Conference on Big Data (Big Data), pp. 1924–1930 (2020)
Google Scholar
West, A.C.: Musical notation for flute in Tangut manuscripts. In: Popova, I. (ed.) Tanguty v Central’noj Azii, pp. 443–454. Vostonaja literatura, Moskva (2012)
Google Scholar
Yang, H., et al.: Dense and tight detection of Chinese characters in historical documents: datasets and a recognition guided detector. IEEE Access6, 30174–30183 (2018).https://doi.org/10.1109/ACCESS.2018.2840218
Article Google Scholar
Yang, Y.: Plum blossom on the far side of the stream. The renaissance of Jiang Kui’s Lyric Oeuvre with facsimiles and a new critical edition of the songs of the Whitestone Daoist. Hong Kong University Press, Hong Kong (2019)
Google Scholar
Wu, S.. Songci Yinyue Zhuanti Yanjiu. Dissertation. Yangzhou University (2013)
Google Scholar
Kui, J.: Baishidaoren Gequ. (Ed. by, Zumou, Z.). Guian: Zhushi (1913)
Google Scholar
Kui, J.,. Baishidaoren Gequ. (Ed. by, Lu, Z.). reprinted in [16], [1743] (2011).https://ctext.org/library.pl?if=en&res=775747. Accessed 30 June 2023
Kui, J.. Baishidaoren Gequ. (Ed. by, Zhang, Y). reprinted in [21], pp. 259–323, [1749] (2019)
Google Scholar
Kui, J.. Baishidaoren Gequ. (Ed. by, Lu, Z., Min, H., Wang, Z.) reprinted in [21], pp. 193–254, [c.1736] (2019)
Google Scholar
Kui J.,. Baishidaoren Gequ. In: Siku Quanshu, vol. 1. reprinted in [16].https://ctext.org/library.pl?res=106386. Accessed 30 June 2023

Download references

Author information

Authors and Affiliations

Know-Center GmbH, Graz, Austria
Tristan Repolusk & Eduardo Veas

Authors

Tristan Repolusk
View author publications
You can also search for this author inPubMed Google Scholar
Eduardo Veas
View author publications
You can also search for this author inPubMed Google Scholar

Corresponding author

Correspondence toTristan Repolusk.

Editor information

Editors and Affiliations

Luleå Tekniska Universitet, Luleå, Sweden
Elisa H. Barney Smith
Luleå Tekniska Universitet, Luleå, Sweden
Marcus Liwicki
Tsinghua University, Beijing, China
Liangrui Peng

Ethics declarations

Disclosure of Interests

The authors have no competing interests to declare that are relevant to the content of this article.

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Repolusk, T., Veas, E. (2024). The KuiSCIMA Dataset for Optical Music Recognition of Ancient Chinese Suzipu Notation. In: Barney Smith, E.H., Liwicki, M., Peng, L. (eds) Document Analysis and Recognition - ICDAR 2024. ICDAR 2024. Lecture Notes in Computer Science, vol 14809. Springer, Cham. https://doi.org/10.1007/978-3-031-70552-6_3

Download citation

DOI:https://doi.org/10.1007/978-3-031-70552-6_3
Published:11 September 2024
Publisher Name:Springer, Cham
Print ISBN:978-3-031-70551-9
Online ISBN:978-3-031-70552-6
eBook Packages:Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The International Association for Pattern Recognition (opens in a new tab)

Movatterモバイル変換

The KuiSCIMA Dataset for Optical Music Recognition of Ancient Chinese Suzipu Notation

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Optical Music Recognition: Recent Advances, Current Challenges, and Future Directions

Museum Exhibit Identification Challenge for the Supervised Domain Adaptation and Beyond

Drawing the Line: Deep Segmentation for Extracting Art from Ancient Etruscan Mirrors

Notes

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Ethics declarations

Disclosure of Interests

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Societies and partnerships

Access this chapter

Subscribe and save

Buy Now