Identification Methods for Bibliographic Reference Entries in Editable Scientific and Technical Documents
- DOI
- 10.2991/cset-16.2016.42How to use a DOI?
- Keywords
- Bibliographic reference format regularization, format checking, text description segmentation method, extraction and identification
- Abstract
Based on the in-depth analysis on the characteristics and rules for bibliographic descriptions and the OOXML format recording method for Word documents, the reference extraction process, text description segmentation methods as well as the identification and determination methods for the editable scientific and technical documents were studied and the idea of checking the reference format through the reference extraction, segmentation and identification process was put forward. The experiment indicated that the fusion application of regular expression method, the fuzzy longest common subsequence matching method and other methods improved the accuracy of bibliographic description entry identification. The work done in this paper has great importance to automatic reference format check and reference format regularization.
- Copyright
- © 2016, the Authors. Published by Atlantis Press.
- Open Access
- This is an open access article distributed under the CC BY-NC license (http://creativecommons.org/licenses/by-nc/4.0/).
Cite this article
TY - CONF AU - Yang Lei AU - Ying-ai Tian AU - Ning Li AU - Qi Liang AU - Wei Zhao PY - 2016/08 DA - 2016/08 TI - Identification Methods for Bibliographic Reference Entries in Editable Scientific and Technical Documents BT - Proceedings of the 2016 International Conference on Computer Science and Electronic Technology PB - Atlantis Press SP - 172 EP - 178 SN - 2352-538X UR - https://doi.org/10.2991/cset-16.2016.42 DO - 10.2991/cset-16.2016.42 ID - Lei2016/08 ER -