Volume 7, Issue 3, July 2019, Pages 100 - 106
Semantic Schema Matching for String Attribute with Word Vectors and its Evaluation
Authors
Kenji Nozaki1, *, Teruhisa Hochin2, Hiroki Nomiya2
1Graduate School of Information Science, Kyoto Institute of Technology, Matsugasaki, Sakyo-ku, Kyoto 606-8585, Japan
2Faculty of Information and Human Sciences, Kyoto Institute of Technology, Matsugasaki, Sakyo-ku, Kyoto 606-8585, Japan
*Corresponding author. Email: ken23mybgbc2@gmail.com
Corresponding Author
Kenji Nozaki
Received 7 February 2019, Accepted 6 May 2019, Available Online 23 July 2019.
- DOI
- 10.2991/ijndc.k.190710.001How to use a DOI?
- Keywords
- Instance-based schema matching; schema matching; semantic matching; Word2Vec
- Abstract
Instance-based schema matching is to determine the correspondences between heterogeneous databases by comparing instances. Heterogeneous databases consist of an enormous number of tables containing various attributes, causing the data heterogeneity. In such cases, it is effective to consider semantic information. In this paper, we propose the instance-based schema matching considering attributes’ semantics. We used Word2Vec to match attributes of character strings. The result shows a possibility to detect matching between attributes with high semantic similarity.
- Copyright
- © 2019 The Authors. Published by Atlantis Press SARL.
- Open Access
- This is an open access article distributed under the CC BY-NC 4.0 license (http://creativecommons.org/licenses/by-nc/4.0/).
Download article (PDF)
View full text (HTML)
Cite this article
TY - JOUR AU - Kenji Nozaki AU - Teruhisa Hochin AU - Hiroki Nomiya PY - 2019 DA - 2019/07/23 TI - Semantic Schema Matching for String Attribute with Word Vectors and its Evaluation JO - International Journal of Networked and Distributed Computing SP - 100 EP - 106 VL - 7 IS - 3 SN - 2211-7946 UR - https://doi.org/10.2991/ijndc.k.190710.001 DO - 10.2991/ijndc.k.190710.001 ID - Nozaki2019 ER -