A semantically enhanced text retrieval framework with abstractive summarization-Reference-Cited by-同舟云学术

A semantically enhanced text retrieval framework with abstractive summarization

Published:2023-09-28 Issue: Volume: Page:
ISSN:0824-7935
Container-title:Computational Intelligence
language:en
Short-container-title:Computational Intelligence

Author:

Pan Min¹²,Li Teng¹,Liu Yu¹,Pei Quanli²,Huang Ellen Anne³,Huang Jimmy X.²^ORCID

Affiliation:

1. School of Computer and Information Engineering Hubei Normal University Huangshi China

2. Information Retrieval and Knowledge Management Research Lab, School of Information Technology York University Toronto Canada

3. Department of Computer Science Western University London Canada

Abstract

AbstractRecently, large pretrained language models (PLMs) have led a revolution in the information retrieval community. In most PLMs‐based retrieval frameworks, the ranking performance broadly depends on the model structure and the semantic complexity of the input text. Sequence‐to‐sequence generative models for question answering or text generation have proven to be competitive, so we wonder whether these models can improve ranking effectiveness by enhancing input semantics. This article introduces SE‐BERT, a semantically enhanced bidirectional encoder representation from transformers (BERT) based ranking framework that captures more semantic information by modifying the input text. SE‐BERT utilizes a pretrained generative language model to summarize both sides of the candidate passage and concatenate them into a new input sequence, allowing BERT to acquire more semantic information within the constraints of the input sequence's length. Experimental results from two Text Retrieval Conference datasets demonstrate that our approach's effectiveness increasing as the length of the input text increases.

Funder

China Scholarship Council

National Natural Science Foundation of China

Natural Sciences and Engineering Research Council of Canada

Publisher

Wiley

Subject

Artificial Intelligence,Computational Mathematics

Link

https://onlinelibrary.wiley.com/doi/pdf/10.1111/coin.12603

Reference42 articles.

1. DevlinJ ChangM‐W LeeK ToutanovaK.BERT: pre‐training of deep bidirectional transformers for language understanding. Proceedings of the 17th Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies(NAACL‐HLT'19); 2019:4171‐4186.

2. NogueiraR ChoK.Passage re‐ranking with BERT. arXiv Preprint arXiv:1901.04085; 2019.

3. THE PROBABILITY RANKING PRINCIPLE IN IR

4. ZhaoJ HuangJX BenH.CRTER: Using cross terms to enhance probabilistic information retrieval. Proceedings of the 34th International ACM SIGIR conference on research and development in Information Retrieval; 2011:155‐164.

5. ZhaoJ HuangJX YeZ.Modeling term associations for probabilistic information retrieval. ACM Transactions on Information Systems (TOIS) vol. 32: 1‐47.

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Utilizing passage‐level relevance and kernel pooling for enhancing BERT‐based document reranking;Computational Intelligence;2024-06