Proceedings of the 2015 International Conference on Computer Science and Intelligent Communication

Text Mining and Its Applications

Authors
Shengyu Guo, Buyang Cao
Corresponding Author
Shengyu Guo
Available Online July 2015.
DOI
10.2991/csic-15.2015.17How to use a DOI?
Keywords
Information retrieval, Log analysis, Text mining, TF-IDF
Abstract

As nowadays data centers are processing more jobs and collecting more data, the system status monitoring and analyzing functionality ensuring the availability, scalability and efficiency becomes more and more important. In order to build an automated status monitoring and alerting system we need to group jobs performed at a data center upon jobs’ characteristics. Since the job names are generated by system users at will, it is very hard to group them in order to monitor the job status efficiently. Thus we need to find some methods to sort out the system log, and help to group jobs that are beneficial for improving the accuracy and efficiency of the system analysis. This paper proposes a text mining algorithm and its application in grouping jobs for log analysis.

Copyright
© 2015, the Authors. Published by Atlantis Press.
Open Access
This is an open access article distributed under the CC BY-NC license (http://creativecommons.org/licenses/by-nc/4.0/).

Download article (PDF)

Volume Title
Proceedings of the 2015 International Conference on Computer Science and Intelligent Communication
Series
Advances in Computer Science Research
Publication Date
July 2015
ISBN
978-94-62520-84-4
ISSN
2352-538X
DOI
10.2991/csic-15.2015.17How to use a DOI?
Copyright
© 2015, the Authors. Published by Atlantis Press.
Open Access
This is an open access article distributed under the CC BY-NC license (http://creativecommons.org/licenses/by-nc/4.0/).

Cite this article

TY  - CONF
AU  - Shengyu Guo
AU  - Buyang Cao
PY  - 2015/07
DA  - 2015/07
TI  - Text Mining and Its Applications
BT  - Proceedings of the 2015 International Conference on Computer Science and Intelligent Communication
PB  - Atlantis Press
SP  - 72
EP  - 78
SN  - 2352-538X
UR  - https://doi.org/10.2991/csic-15.2015.17
DO  - 10.2991/csic-15.2015.17
ID  - Guo2015/07
ER  -