Automated Paraphrase Quality Assessment Using Language Models and Transfer Learning-Reference-Cited by-同舟云学术

Automated Paraphrase Quality Assessment Using Language Models and Transfer Learning

Published:2021-12-06 Issue:12 Volume:10 Page:166
ISSN:2073-431X
Container-title:Computers
language:en
Short-container-title:Computers

Author:

Nicula Bogdan,Dascalu Mihai^ORCID,Newton Natalie N.,Orcutt Ellen,McNamara Danielle S.

Abstract

Learning to paraphrase supports both writing ability and reading comprehension, particularly for less skilled learners. As such, educational tools that integrate automated evaluations of paraphrases can be used to provide timely feedback to enhance learner paraphrasing skills more efficiently and effectively. Paraphrase identification is a popular NLP classification task that involves establishing whether two sentences share a similar meaning. Paraphrase quality assessment is a slightly more complex task, in which pairs of sentences are evaluated in-depth across multiple dimensions. In this study, we focus on four dimensions: lexical, syntactical, semantic, and overall quality. Our study introduces and evaluates various machine learning models using handcrafted features combined with Extra Trees, Siamese neural networks using BiLSTM RNNs, and pretrained BERT-based models, together with transfer learning from a larger general paraphrase corpus, to estimate the quality of paraphrases across the four dimensions. Two datasets are considered for the tasks involving paraphrase quality: ULPC (User Language Paraphrase Corpus) containing 1998 paraphrases and a smaller dataset with 115 paraphrases based on children’s inputs. The paraphrase identification dataset used for the transfer learning task is the MSRP dataset (Microsoft Research Paraphrase Corpus) containing 5801 paraphrases. On the ULPC dataset, our BERT model improves upon the previous baseline by at least 0.1 in F1-score across the four dimensions. When using fine-tuning from ULPC for the children dataset, both the BERT and Siamese neural network models improve upon their original scores by at least 0.11 F1-score. The results of these experiments suggest that transfer learning using generic paraphrase identification datasets can be successful, while at the same time obtaining comparable results in fewer epochs.

Funder

Office of Naval Research

Institute of Education Sciences

Romanian National Authority for Scientific Research and Innovation, CNCS – UEFISCDI

Publisher

MDPI AG

Subject

Computer Networks and Communications,Human-Computer Interaction

Link

https://www.mdpi.com/2073-431X/10/12/166/pdf

Reference37 articles.

1. Improving Adolescent Students' Reading Comprehension with Istart

2. SERT: Self-Explanation Reading Training

3. The 4Pronged Comprehension Strategy Framework;McNamara,2007

Cited by 4 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Geographical origin identification of Khao Dawk Mali 105 rice using combination of FT-NIR spectroscopy and machine learning algorithms;Spectrochimica Acta Part A: Molecular and Biomolecular Spectroscopy;2024-10

2. A Systematic Literature Review: Are Automated Essay Scoring Systems Competent in Real-Life Education Scenarios?;IEEE Access;2024

3. A breast cancer risk predication and classification model with ensemble learning and big data fusion;Decision Analytics Journal;2023-09

4. Predicting Breast Cancer from Risk Factors Using SVM and Extra-Trees-Based Feature Selection Method;Computers;2022-09-12