Author:
Shi Yunmei,Zhang Haiying,Li Ning,Yang Teng
Abstract
AbstractThe sentence ordering task aims to organize complex, unordered sentences into readable text. This improves accuracy, validity, and reliability in various natural language processing domains, including automatic text generation, text summarization, and machine translation. We begin by analyzing and summarizing the sentence ordering algorithm from two perspectives: the input data approach and the implementation technique approach. Based on the different ways of input data formats, they are classified into pointwise, pairwise, and listwise, and the advantages, disadvantages and representative algorithmic features of each are discussed. Based on the different implementation technologies, we classify them into sentence ordering algorithms based on learning to rank and deep learning, and the core ideas, typical algorithms and research progress of these two categories of methods were specifically explained. We summarize the datasets and evaluation metrics of currently commonly used sentence ordering tasks. Additionally, we analyze the problems and challenges of sentence ordering tasks and look forward to the future direction of this field.
Funder
National Key Research and Development Program of China
National Natural Science Foundation of China
Publisher
Springer Science and Business Media LLC
Reference85 articles.
1. Okazaki, N., Matsuo, Y., Ishizuka, M.: Improving chronological sentence ordering by precedence relation. In: COLING 2004: Proceedings of the 20th International Conference on Computational Linguistics, pp. 750–756 (2004)
2. Paice, C.D.: Constructing literature abstracts by computer: techniques and prospects. Inf. Process. Manag. 26(1), 171–186 (1990)
3. McKeown, K., Klavans, J.L., Hatzivassiloglou, V., Barzilay, R., Eskin, E.: Towards multidocument summarization by reformulation: progress and prospects. In: AAAI/IAAI (1999). https://api.semanticscholar.org/CorpusID:8115414
4. Barzilay, R., Elhadad, N., McKeown, K.: Inferring strategies for sentence ordering in multidocument news summarization. J. Artif. Intell. Res. 17, 35–55 (2002)
5. Barzilay, R., Lee, L.: Catching the drift: Probabilistic content models, with applications to generation and summarization. arXiv preprint https://arxiv.org/pdf/cs/0405039.pdf(2004)