SegFormer: A Topic Segmentation Model with Controllable Range of Attention-Reference-Cited by-同舟云学术

SegFormer: A Topic Segmentation Model with Controllable Range of Attention

Published:2023-06-26 Issue:11 Volume:37 Page:12545-12552
ISSN:2374-3468
Container-title:Proceedings of the AAAI Conference on Artificial Intelligence
language:
Short-container-title:AAAI

Author:

Bai Haitao,Wang Pinghui,Zhang Ruofei,Su Zhou

Abstract

Topic segmentation aims to reveal the latent structure of a document and divide it into multiple parts. However, current neural solutions are limited in the context modeling of sentences and feature representation of candidate boundaries. This causes the model to suffer from inefficient sentence context encoding and noise information interference. In this paper, we design a new text segmentation model SegFormer with unidirectional attention blocks to better model sentence representations. To alleviate the problem of noise information interference, SegFormer uses a novel additional context aggregator and a topic classification loss to guide the model to aggregate the information within the appropriate range. In addition, SegFormer applies an iterative prediction algorithm to search for optimal boundaries progressively. We evaluate SegFormer's generalization ability, multilingual ability, and application ability on multiple challenging real-world datasets. Experiments show that our model significantly improves the performance by 7.5% on the benchmark WIKI-SECTION compared to several strong baselines. The application of SegFormer to a real-world dataset to separate normal and advertisement segments in product marketing essays also achieves superior performance in the evaluation with other cutting-edge models.

Publisher

Association for the Advancement of Artificial Intelligence (AAAI)

Subject

General Medicine

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Remote sensing monitoring of seagrass bed dynamics using cross-temporal-spatial domain transfer learning in Yellow river Delta;International Journal of Remote Sensing;2024-03-07

2. TAQ: Top-K Attention-Aware Quantization for Vision Transformers;2023 IEEE International Conference on Image Processing (ICIP);2023-10-08