Proceedings of the 2023 3rd International Conference on Business Administration and Data Science (BADS 2023)

A Digital Management System for Archive Resources Based on Cloud Storage in the New Media Era

Authors
Qi Yang1, Jinghuan Zhu1, Shilong Wang1, *
1Guangxi Science & Technology Normal University, Laibin, China
*Corresponding author. Email: wangshilong@gxstnu.edu.cn
Corresponding Author
Shilong Wang
Available Online 30 December 2023.
DOI
10.2991/978-94-6463-326-9_33How to use a DOI?
Keywords
Cloud storage; Digitization of archives; Management system
Abstract

In order to solve the problem of incomplete information extraction in the existing digital conversion process of archives, a preset area scanning unit is set up, which is used to activate the OCR scanner of the paper scanning device to scan the preset area when the user operation interface receives the archive upload command, and obtain the first OCR scanning file, which has table distribution characteristics and text distribution characteristics. The file template matching unit is used to match templates based on table distribution characteristics and text distribution characteristics, combined with the file template matching algorithm to generate file matching templates. Template file comparison unit, used to match the template with the first OCR scan file based on the archive, obtain missing attribute information and distribution location of missing attributes. Local compensation scanning unit, used to activate the OCR scanner for local compensation scanning based on missing attribute information and the distribution position of missing attributes, and generate a second OCR scanning file. The archive retrieval result unit is used to synchronize the second OCR scanned file, archive matching template, and project ID information to the cloud server. Based on the project ID information and archive matching template, secondary retrieval is performed in the cloud storage repository embedded in the cloud server to generate digital archive retrieval results. The archive classification result unit is used to activate the archive classification module to adjust the timing of the second OCR scanning file and digital archive retrieval results, generate archive classification results, and update the cloud storage repository.

Copyright
© 2023 The Author(s)
Open Access
Open Access This chapter is licensed under the terms of the Creative Commons Attribution-NonCommercial 4.0 International License (http://creativecommons.org/licenses/by-nc/4.0/), which permits any noncommercial use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made.

Download article (PDF)

Volume Title
Proceedings of the 2023 3rd International Conference on Business Administration and Data Science (BADS 2023)
Series
Atlantis Highlights in Computer Sciences
Publication Date
30 December 2023
ISBN
978-94-6463-326-9
ISSN
2589-4900
DOI
10.2991/978-94-6463-326-9_33How to use a DOI?
Copyright
© 2023 The Author(s)
Open Access
Open Access This chapter is licensed under the terms of the Creative Commons Attribution-NonCommercial 4.0 International License (http://creativecommons.org/licenses/by-nc/4.0/), which permits any noncommercial use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made.

Cite this article

TY  - CONF
AU  - Qi Yang
AU  - Jinghuan Zhu
AU  - Shilong Wang
PY  - 2023
DA  - 2023/12/30
TI  - A Digital Management System for Archive Resources Based on Cloud Storage in the New Media Era
BT  - Proceedings of the 2023 3rd International Conference on Business Administration and Data Science (BADS 2023)
PB  - Atlantis Press
SP  - 312
EP  - 319
SN  - 2589-4900
UR  - https://doi.org/10.2991/978-94-6463-326-9_33
DO  - 10.2991/978-94-6463-326-9_33
ID  - Yang2023
ER  -