Evaluating Document Coherence Modeling

Author:

Shen Aili1,Mistica Meladel2,Salehi Bahar3,Li Hang4,Baldwin Timothy5,Qi Jianzhong6

Affiliation:

1. The University of Melbourne, Australia. aili.shen@unimelb.edu.au

2. The University of Melbourne, Australia. misticam@unimelb.edu.au

3. The University of Melbourne, Australia. baharsalehi@gmail.com

4. AI Lab at ByteDance, China. lihang.lh@bytedance.com

5. The University of Melbourne, Australia. tbaldwin@unimelb.edu.au

6. The University of Melbourne, Australia. jianzhong.qi@unimelb.edu.au

Abstract

Abstract While pretrained language models (LMs) have driven impressive gains over morpho-syntactic and semantic tasks, their ability to model discourse and pragmatic phenomena is less clear. As a step towards a better understanding of their discourse modeling capabilities, we propose a sentence intrusion detection task. We examine the performance of a broad range of pretrained LMs on this detection task for English. Lacking a dataset for the task, we introduce INSteD, a novel intruder sentence detection dataset, containing 170,000+ documents constructed from English Wikipedia and CNN news articles. Our experiments show that pretrained LMs perform impressively in in-domain evaluation, but experience a substantial drop in the cross-domain setting, indicating limited generalization capacity. Further results over a novel linguistic probe dataset show that there is substantial room for improvement, especially in the cross- domain setting.

Publisher

MIT Press - Journals

Subject

Artificial Intelligence,Computer Science Applications,Linguistics and Language,Human-Computer Interaction,Communication

Cited by 4 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Cohewl: Assessing the Semantic Coherence of Short Text at The Word Level;2023

2. Knowledge-Based Techniques for Document Fraud Detection: A Comprehensive Study;Computational Linguistics and Intelligent Text Processing;2023

3. Electronic educational resources of the entrepreneurial university: Kazakhstan practice;Grand Altai Research & Education / Наука и образование Большого Алтая;2022-07-11

4. Detecting Documents With Inconsistent Context;IEEE Access;2022

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3