DC-Graph: a chunk optimization model based on document classification and graph learning-Reference-Cited by-同舟云学术

DC-Graph: a chunk optimization model based on document classification and graph learning

Published:2024-05-16 Issue:6 Volume:57 Page:
ISSN:1573-7462
Container-title:Artificial Intelligence Review
language:en
Short-container-title:Artif Intell Rev

Author:

Zhou Jingjing,Zhang Guohao,Alfarraj Osama,Tolba Amr,Li Xuefeng,Zhang Hao

Abstract

AbstractExisting machine reading comprehension methods use a fixed stride to chunk long texts, which leads to missing contextual information at the boundaries of the chunks and a lack of communication between the information within each chunk. This paper proposes DC-Graph model, addressing existing issues in terms of reconstructing and supplementing information in long texts. Knowledge graphs contain extensive knowledge, and the semantic relationships between entities exhibit strong logical characteristics, which can assist the model in semantic understanding and reasoning. By categorizing the questions, this paper filters the content of long texts based on categories and reconstructs the content that aligns with the question category, compressing and optimizing the long text to minimize the number of document chunks when inputted into BERT. Additionally, unstructured text is transformed into a structured knowledge graph, and features are extracted using graph convolutional networks. These features are then added as global information to each chunk, aiding answer prediction. Experimental results on the CoQA, QuAC, and TriviaQA datasets demonstrate that our method outperforms both BERT and Recurrent Chunking Mechanisms, which share the same improvement approach, in terms of F1 and EM score. The code is available at (https://github.com/guohaozhang/DC-Graph.git).

Funder

King Saud University, Riyadh, Saudi Arabia

Publisher

Springer Science and Business Media LLC

Link

https://link.springer.com/content/pdf/10.1007/s10462-024-10771-w.pdf

Reference44 articles.

1. Beltagy I, Peters ME, Cohan A (2020) Longformer: the long-document transformer. In: Proceedings of the conference on empirical methods in natural language processing (EMNLP), pp 4123–4133

2. Bollacker K, Evans C, Paritosh P et al (2008) Freebase: a collaboratively created graph database for structuring human knowledge. In: Proceedings of the 2008 ACM SIGMOD international conference on Management of data, pp 1247–1250

3. Choi E, He H, Iyyer M et al (2018) Quac: question answering in context. In: Proceedings of the 2018 conference on empirical methods in natural language processing, pp 2174–2184

4. Clark C, Gardner M (2018) Simple and effective multi-paragraph reading comprehension. In: Proceedings of the 56th annual meeting of the association for computational linguistics (volume 1: long papers), pp 845–855

5. Diao S, Xu T, Xu R et al (2023) Mixture-of-domain-adapters: decoupling and injecting domain knowledge to pre-trained language models’ memories. In: Proceedings of the 61st annual meeting of the association for computational linguistics (volume 1: long papers), pp 5113–5129