Affiliation:
1. School of Business, University of Applied Sciences and Arts Northwestern Switzerland, 4600 Olten, Switzerland
2. Institute for Information Systems, University of Applied Sciences and Arts Northwestern Switzerland, 4600 Olten, Switzerland
Abstract
The tremendous increase in documents available on the Web has turned finding the relevant pieces of information into a challenging, tedious, and time-consuming activity. Text summarization is an important natural language processing (NLP) task used to reduce the reading requirements of text. Automatic text summarization is an NLP task that consists of creating a shorter version of a text document which is coherent and maintains the most relevant information of the original text. In recent years, automatic text summarization has received significant attention, as it can be applied to a wide range of applications such as the extraction of highlights from scientific papers or the generation of summaries of news articles. In this research project, we are focused mainly on abstractive text summarization that extracts the most important contents from a text in a rephrased form. The main purpose of this project is to summarize texts in German. Unfortunately, most pretrained models are only available for English. We therefore focused on the German BERT multilingual model and the BART monolingual model for English, with a consideration of translation possibilities. As the source of the experiment setup, took the German Wikipedia article dataset and compared how well the multilingual model performed for German text summarization when compared to using machine-translated text summaries from monolingual English language models. We used the ROUGE-1 metric to analyze the quality of the text summarization.
Reference38 articles.
1. Patel, A., Siddiqui, T.J., and Tiwary, U.S. (June, January 30). A language independent approach to multilingual text summarization. Proceedings of the Conference RIAO 2007, Pittsburgh, PA, USA.
2. Parida, S., and Motlicek, P. (2019, January 3–7). Abstract Text Summarization: A Low Resource Challenge. Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), Hong Kong, China.
3. Bornea, M., Pan, L., Rosenthal, S., Florian, R., and Sil, A. (2021, January 2–9). Multilingual transfer learning for QA using translation as data augmentation. Proceedings of the AAAI Conference on Artificial Intelligence, Vancouver, BC, Canada.
4. Review of automatic text summarization techniques & methods;Widyassari;J. King Saud Univ.—Comput. Inf. Sci.,2022
5. Moratanch, N., and Chitrakala, S. (2016, January 18–19). A survey on abstractive text summarization. Proceedings of the 2016 International Conference on Circuit, Power and Computing Technologies (ICCPCT), Nagercoil, India.
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献