1. Text Segmentation by Cross Segment Attention
2. Two-level transformer and auxiliary coherence modeling for im-proved text segmentation;glavas;AAAI,0
3. Google's neural machine translation system: Bridging the gap between human and machine translation;yonghui;CoRR,2016
4. BERT: pre-training of deep bidi-rectional transformers for language understanding;devlin;NAACL-HLT,2019
5. Roberta: A ro-bustly optimized BERT pretraining approach;liu;CoRR,2019