Proceedings of the 3rd International Conference on Electric and Electronics

A Re-ranking Method Based on Tag-Topic Model

Authors
Maoyuan Zhang, Shuiyin Chen, Fanli He
Corresponding Author
Maoyuan Zhang
Available Online December 2013.
DOI
10.2991/eeic-13.2013.36How to use a DOI?
Keywords
Information Retrieval; Re-ranking; LDA;Tag-topic model
Abstract

We describe a novel approach to improve the accuracy of the IR which combines the tag-topic model with natural language processing. Tag-topic model extended the Latent Dirichlet We describe a novel approach to improve the accuracy of the IR which combines the tag-topic model with natural language processing. We get tag though semantic fingerprint. Tag-topic model extended the Latent Dirichlet Allocation(LDA) model by adding the data set with 901446 documents, and train the model with different number of topics, we can obtain three important distributions: the document-tag distribution, the tag-topic distribution and the topic-word distribution by using the Tag-topic model. Then through the matrix operation we can get the tag-word distribution which quantify the importance of each word of the document. Finally, based on this distribution these documents are re-ranked. Experiments on NTCIR-5 document collection for SLIR(Single Language IR) show that this method achieves an 13.6% and 19.6% improvement comparing to the initial retrieval method without any re-ranking.

Copyright
© 2013, the Authors. Published by Atlantis Press.
Open Access
This is an open access article distributed under the CC BY-NC license (http://creativecommons.org/licenses/by-nc/4.0/).

Download article (PDF)

Volume Title
Proceedings of the 3rd International Conference on Electric and Electronics
Series
Advances in Intelligent Systems Research
Publication Date
December 2013
ISBN
978-90786-77-92-5
ISSN
1951-6851
DOI
10.2991/eeic-13.2013.36How to use a DOI?
Copyright
© 2013, the Authors. Published by Atlantis Press.
Open Access
This is an open access article distributed under the CC BY-NC license (http://creativecommons.org/licenses/by-nc/4.0/).

Cite this article

TY  - CONF
AU  - Maoyuan Zhang
AU  - Shuiyin Chen
AU  - Fanli He
PY  - 2013/12
DA  - 2013/12
TI  - A Re-ranking Method Based on Tag-Topic Model
BT  - Proceedings of the 3rd International Conference on Electric and Electronics
PB  - Atlantis Press
SP  - 154
EP  - 157
SN  - 1951-6851
UR  - https://doi.org/10.2991/eeic-13.2013.36
DO  - 10.2991/eeic-13.2013.36
ID  - Zhang2013/12
ER  -