Reaching quality and efficiency with a parameter-efficient controllable sentence simplification approach-Reference-Cited by-同舟云学术

Reaching quality and efficiency with a parameter-efficient controllable sentence simplification approach

Published:2024 Issue:3 Volume:21 Page:899-921
ISSN:1820-0214
Container-title:Computer Science and Information Systems
language:en
Short-container-title:ComSIS

Author:

Menta Antonio¹,Garcia-Serrano Ana¹

Affiliation:

1. E.T.S.I. Informática (UNED) C. de Juan del Rosal, Madrid, Spain

Abstract

The task of Automatic Text Simplification (ATS) aims to transform texts to improve their readability and comprehensibility. Current solutions are based on Large Language Models (LLM). These models have high performance but require powerful computing resources and large amounts of data to be fine-tuned when working in specific and technical domains. This prevents most researchers from adapting the models to their area of study. The main contributions of this research are as follows: (1) proposing an accurate solution when powerful resources are not available, using the transfer learning capabilities across different domains with a set of linguistic features using a reduced size pre-trained language model (T5-small) and making it accessible to a broader range of researchers and individuals; (2) the evaluation of our model on two well-known datasets, Turkcorpus and ASSET, and the analysis of the influence of control tokens on the SimpleText corpus, focusing on the domains of Computer Science and Medicine. Finally, a detailed discussion comparing our approach with state-of-the-art models for sentence simplification is included.

Publisher

National Library of Serbia

Reference74 articles.

1. Akiba, T., Sano, S., Yanase, T., Ohta, T., Koyama, M.: Optuna: A next-generation hyperparameter optimization framework (2019)

2. Alarcon, R., Moreno, L., Martínez, P., Macías, J.A.: Easier system. evaluating a spanish lexical simplification proposal with people with cognitive impairments. International Journal of Human-Computer Interaction 0(0), 1-15 (2022)

3. Althunayyan, S., Azmi, A.: Automated text simplification: A survey. ACM Computing Surveys 54, Article no. 43 (03 2021)

4. Alva-Manchego, F., Bingel, J., Paetzold, G., Scarton, C., Specia, L.: Learning how to simplify from explicit labeling of complex-simplified text pairs. In: Proceedings of the Eighth International Joint Conference on Natural Language Processing (Volume 1: Long Papers). pp. 295-305. Asian Federation of Natural Language Processing, Taipei, Taiwan (Nov 2017)

5. Alva-Manchego, F., Martin, L., Bordes, A., Scarton, C., Sagot, B., Specia, L.: ASSET: A dataset for tuning and evaluation of sentence simplification models with multiple rewriting transformations. In: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. pp. 4668-4679. Association for Computational Linguistics, Online (Jul 2020)