Proceedings of the 2024 10th International Conference on Humanities and Social Science Research (ICHSSR 2024)

A Study of Data Cleaning in Northwest Folk Songs

Authors
Chengqi Dai1, *, Yan Xu1, Xin Wen1
1School of Information Science, Beijing Language and Culture University, Beijing, 100083, China
*Corresponding author. Email: dd04236979@163.com
Corresponding Author
Chengqi Dai
Available Online 2 September 2024.
DOI
10.2991/978-2-38476-277-4_115How to use a DOI?
Keywords
digital humanities; northwest folk songs; digitization
Abstract

The arrival of the digital modernization era has brought a new paradigm of digital humanities research to the humanities, and the integration of digital data, the construction of corpus and the combination of computer analysis and processing have become an important part of digital humanities research. In order to solve the problems of complicated data format and low information content of the Northwest folk song “Hua’er”, Python tools are used to clean the data, and through noise reduction, mean filtering, corrosion and other means, the text recognition degree of the picture material is improved, and cleaner data is obtained, which lays the foundation for the establishment of the database of “Hua’er” and lays the foundation for the establishment of the database of “Hua’er”. It lays the foundation for the establishment of the “flower” database, and provides a referable program for the preprocessing of folk song data.

Copyright
© 2024 The Author(s)
Open Access
Open Access This chapter is licensed under the terms of the Creative Commons Attribution-NonCommercial 4.0 International License (http://creativecommons.org/licenses/by-nc/4.0/), which permits any noncommercial use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made.

Download article (PDF)

Volume Title
Proceedings of the 2024 10th International Conference on Humanities and Social Science Research (ICHSSR 2024)
Series
Advances in Social Science, Education and Humanities Research
Publication Date
2 September 2024
ISBN
978-2-38476-277-4
ISSN
2352-5398
DOI
10.2991/978-2-38476-277-4_115How to use a DOI?
Copyright
© 2024 The Author(s)
Open Access
Open Access This chapter is licensed under the terms of the Creative Commons Attribution-NonCommercial 4.0 International License (http://creativecommons.org/licenses/by-nc/4.0/), which permits any noncommercial use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made.

Cite this article

TY  - CONF
AU  - Chengqi Dai
AU  - Yan Xu
AU  - Xin Wen
PY  - 2024
DA  - 2024/09/02
TI  - A Study of Data Cleaning in Northwest Folk Songs
BT  - Proceedings of the 2024 10th International Conference on Humanities and Social Science Research (ICHSSR 2024)
PB  - Atlantis Press
SP  - 1028
EP  - 1038
SN  - 2352-5398
UR  - https://doi.org/10.2991/978-2-38476-277-4_115
DO  - 10.2991/978-2-38476-277-4_115
ID  - Dai2024
ER  -