Improvement Research of the Software of Transforming Semi-Structured Html File into Structured Text File
- DOI
- 10.2991/icmse-17.2017.60How to use a DOI?
- Keywords
- Big data; html file; text file;Expert system shell Pro/3;File scan and transformation;Java
- Abstract
An application research work had improved some functions of a file scan and transformation software (FileScanner) in Pro/3(an expert system shell) by exploring transformation of semi-structured files (Html format ) into structured text files. Some existing problems such as line feed failure and Chinese characters incorrectly displaying in the result file transformed had been solved by improving its Java programming. After a .Html format file be scanned and transformed,a .txt format file produced can implement effectively line feed when it is directly opened, and can display correctly Chinese characters. The structured text file transformed can directly interact with other application programs or databases so as to facilitate the analysis of semi-structured data and mining some values of the information behind the data.
- Copyright
- © 2017, the Authors. Published by Atlantis Press.
- Open Access
- This is an open access article distributed under the CC BY-NC license (http://creativecommons.org/licenses/by-nc/4.0/).
Cite this article
TY - CONF AU - Qiming Cui AU - Xue Wang AU - Guodong Chen AU - Yongbin Zhao AU - Bo Li AU - Yi Ning AU - Shuting Cui AU - Zirong Zhang AU - Rui Zhao AU - Hongyu Meng AU - Yao Zhang AU - Zhenqiang Fu PY - 2017/04 DA - 2017/04 TI - Improvement Research of the Software of Transforming Semi-Structured Html File into Structured Text File BT - Proceedings of the 2017 7th International Conference on Manufacturing Science and Engineering (ICMSE 2017) PB - Atlantis Press SP - 323 EP - 327 SN - 2352-5401 UR - https://doi.org/10.2991/icmse-17.2017.60 DO - 10.2991/icmse-17.2017.60 ID - Cui2017/04 ER -