An Architecture for Unstructured Data Management
- DOI
- 10.2991/iccia.2012.109How to use a DOI?
- Keywords
- unstructured data, classification, storage
- Abstract
As the information age is coming, there is a vast amount of information available in the Internet. Most of data on Web are unstructured. But the significant data should be organized and stored in a suitable way for future purposes. One of the unsolved problems is the management of unstructured data. The unstructured data such as presentation, spreadsheet, text document, memo, images and web pages are difficult to manage while the data become a large scale and the users have different requirements and interests. In this paper, we proposed an architecture for unstructured data management by integrating source query, data collection and data management to solve these problems. The data collection layer extracts the data we care about, we use the existing tools to extract automatic and we can also add the data to the repository manually. The data management layer manage all the collection data by classifying the data, selecting nodes to store and managing centralized as index. The source query layer allows users to query and get the data diversity according the adaptive query service and recommendation service. Finally, we implemented a prototype system OCourse based on this system architecture to show it’s feasible and efficient.
- Copyright
- © 2013, the Authors. Published by Atlantis Press.
- Open Access
- This is an open access article distributed under the CC BY-NC license (http://creativecommons.org/licenses/by-nc/4.0/).
Cite this article
TY - CONF AU - Yaohu Lin AU - Xuelian Lin PY - 2014/05 DA - 2014/05 TI - An Architecture for Unstructured Data Management BT - Proceedings of the 2012 2nd International Conference on Computer and Information Application (ICCIA 2012) PB - Atlantis Press SP - 454 EP - 457 SN - 1951-6851 UR - https://doi.org/10.2991/iccia.2012.109 DO - 10.2991/iccia.2012.109 ID - Lin2014/05 ER -