A Mathematical Indexing Method Based on the Hierarchical Features of Operators in Formulae
- DOI
- 10.2991/icacie-17.2017.11How to use a DOI?
- Keywords
- mathematical expression retrieval; index; hierarchical features; operators
- Abstract
Full text search engines widely used today still have no math searching function, which brings inconvenience for people finding their scientific documents with mathematical query words. It is necessary to research and develop the theory and technology of mathematical expression retrieval. This paper proposed an index model of mathematical expressions for realizing math retrieval through analyzing the characteristics of formulae. Firstly, the FDS data was obtained from the formulae expressed in LaTeX description with recursive analysis. Then, the index features including the level and location features of operators were extracted from the FDS data of formulae. Finally, the extracted features were used to construct a feature vector for dividing formulae into several classes and the math index was constructed for the classes respectively. The experiment was carried out on 134199 formulae and the result shows its effectiveness for improving the efficiency of mathematical expression retrieval.
- Copyright
- © 2017, the Authors. Published by Atlantis Press.
- Open Access
- This is an open access article distributed under the CC BY-NC license (http://creativecommons.org/licenses/by-nc/4.0/).
Cite this article
TY - CONF AU - Xuedong Tian PY - 2017/08 DA - 2017/08 TI - A Mathematical Indexing Method Based on the Hierarchical Features of Operators in Formulae BT - Proceedings of the 2017 2nd International Conference on Automatic Control and Information Engineering (ICACIE 2017) PB - Atlantis Press SP - 49 EP - 52 SN - 2352-5401 UR - https://doi.org/10.2991/icacie-17.2017.11 DO - 10.2991/icacie-17.2017.11 ID - Tian2017/08 ER -