Modified LDA Vector and Feedback Analysis for Short Query Information Retrieval Systems

Logic Journal of the IGPL (forthcoming)
  Copy   BIBTEX

Abstract

Information Retrieval systems benefit from the use of long queries containing a large volume of search-relevant information. This situation is not common, as users of such systems tend to use very short and precise queries with few keywords. In this work we propose a modification of the Latent Dirichlet Allocation (LDA) technique using data from the document collection and its vocabulary for a better representation of short queries. Additionally, a study is carried out on how the modification of the proposed LDA weighted vectors increase the performance using relevant documents as feedback. The work shown in this paper is tested using three biomedical corpora (TREC Genomics 2004, TREC Genomics 2005 and OHSUMED) and one legal corpus (FIRE 2017). Results prove that the application of the proposed representation technique, as well as the feedback adjustment, clearly outperforms the baseline methods (BM25 and non-modified LDA).

Links

PhilArchive



    Upload a copy of this work     Papers currently archived: 92,369

External links

Setup an account with your affiliations in order to access resources via your University's proxy server

Through your library

Similar books and articles

概念間の関連度に基づく情報ランク付けを用いた知的検索手法.小島 一秀 藤井 啓彰 - 2002 - Transactions of the Japanese Society for Artificial Intelligence 17:684-689.
トランスダクティブ学習による最小文書判定からのクエリ拡張.山田 誠二 岡部 正幸 - 2006 - Transactions of the Japanese Society for Artificial Intelligence 21 (4):398-405.
Agent Community based Peer-to-Peer Information Retrieval.Matsuno Daisuke Mine Tsunenori - 2004 - Transactions of the Japanese Society for Artificial Intelligence 19:421-428.
Information Technology-Based Patent Retrieval Models.Carson Leung, Wookey Lee & Justin Jongsu Song - 2019 - In Wolfgang Glänzel, Henk F. Moed, Ulrich Schmoch & Mike Thelwall (eds.), Springer Handbook of Science and Technology Indicators. Springer Verlag. pp. 859-874.
Innovative techniques for legal text retrieval.Marie-Francine Moens - 2001 - Artificial Intelligence and Law 9 (1):29-57.
Querying linguistic treebanks with monadic second-order logic in linear time.Stephan Kepser - 2004 - Journal of Logic, Language and Information 13 (4):457-470.

Analytics

Added to PP
2024-05-06

Downloads
3 (#1,716,608)

6 months
3 (#984,149)

Historical graph of downloads
How can I increase my downloads?

Citations of this work

No citations found.

Add more citations

References found in this work

No references found.

Add more references