CERT: Contrastive Self-supervised Learning for Language Understanding-Reference-Cited by-同舟云学术

CERT: Contrastive Self-supervised Learning for Language Understanding

Published:2020-05-21 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Fang Hongchao,Xie Pengtao

Abstract

Pretrained language models such as BERT, GPT have shown great effectiveness in language understanding. The auxiliary predictive tasks in existing pretraining approaches are mostly defined on tokens, thus may not be able to capture sentence-level semantics very well. To address this issue, we propose CERT: Contrastive self-supervised Encoder Representations from Transformers, which pretrains language representation models using contrastive self-supervised learning at the sentence level. CERT creates augmentations of original sentences using back-translation. Then it finetunes a pretrained language encoder (e.g., BERT) by predicting whether two augmented sentences originate from the same sentence. CERT is simple to use and can be flexibly plugged into any pretraining-finetuning NLP pipeline. We evaluate CERT on three language understanding tasks: CoLA, RTE, and QNLI. CERT outperforms BERT significantly.<br>

Publisher

Institute of Electrical and Electronics Engineers (IEEE)

Cited by 47 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. DCCN: A dual-cross contrastive neural network for 3D point cloud representation learning;Expert Systems with Applications;2024-09

2. Soft Contrastive Sequential Recommendation;ACM Transactions on Information Systems;2024-08-19

3. Cryptocurrency Prediction Mining in Web 3.0 Environment;Advances in Web Technologies and Engineering;2024-08-16

4. Towards a cyberbullying detection approach: fine-tuned contrastive self-supervised learning for data augmentation;International Journal of Data Science and Analytics;2024-07-17

5. CMCS: contrastive-metric learning via vector-level sampling and augmentation for code search;Scientific Reports;2024-06-24