Content Analysis Using Specific Natural Language Processing Methods for Big Data-Reference-Cited by-同舟云学术

Content Analysis Using Specific Natural Language Processing Methods for Big Data

Published:2024-01-31 Issue:3 Volume:13 Page:584
ISSN:2079-9292
Container-title:Electronics
language:en
Short-container-title:Electronics

Author:

Pirnau Mironela¹,Botezatu Mihai Alexandru²^ORCID,Priescu Iustin¹,Hosszu Alexandra³,Tabusca Alexandru²^ORCID,Coculescu Cristina²,Oncioiu Ionica⁴⁵^ORCID

Affiliation:

1. Department of Informatics, Faculty of Informatics, Titu Maiorescu University, 040051 Bucharest, Romania

2. Department of Informatics, Statistics and Mathematics, School of Computer Science for Business Management, Romanian American University, 012101 Bucharest, Romania

3. Department of Sociology, Faculty of Sociology and Social Work, University of Bucharest, 030018 Bucharest, Romania

4. Faculty of Economic Sciences, Titu Maiorescu University, 040051 Bucharest, Romania

5. Faculty of Economics and Business Administration, “Eugeniu Carada” Doctoral School of Economic Sciences, University of Craiova, 200585 Craiova, Romania

Abstract

Researchers from different fields have studied the effects of the COVID-19 pandemic and published their results in peer-reviewed journals indexed in international databases such as Web of Science (WoS), Scopus, PubMed. Focusing on efficient methods for navigating the extensive literature on COVID-19 pandemic research, our study conducts a content analysis of the top 1000 cited papers in WoS that delve into the subject by using elements of natural language processing (NLP). Knowing that in WoS, a scientific paper is described by the group Paper = {Abstract, Keyword, Title}; we obtained via NLP methods the word dictionaries with their frequencies of use and the word cloud for the 100 most used words, and we investigated if there is a degree of similarity between the titles of the papers and their abstracts, respectively. Using the Python packages NLTK, TextBlob, VADER, we computed sentiment scores for paper titles and abstracts, analyzed the results, and then, using Azure Machine Learning-Sentiment analysis, extended the range of comparison of sentiment scores. Our proposed analysis method can be applied to any research topic or theme from papers, articles, or projects in various fields of specialization to create a minimal dictionary of terms based on frequency of use, with visual representation by word cloud. Complementing the content analysis in our research with sentiment and similarity analysis highlights the different or similar treatment of the topics addressed in the research, as well as the opinions and feelings conveyed by the authors in relation to the researched issue.

Publisher

MDPI AG

Link

https://www.mdpi.com/2079-9292/13/3/584/pdf

Reference81 articles.

1. WHO Declares COVID-19 a Pandemic;Cucinotta;Acta Biomed.,2020

2. (2023, September 01). World Health Organization. Available online: https://www.who.int/.

3. Mapping the research landscape of COVID-19 from social sciences perspective: A bibliometric analysis;Roychowdhury;Scientometrics,2022

4. Akl, E.A., Meho, L.I., Farran, S.H., Nasrallah, A.A., and Ghandour, B. (2020). The Pandemic of the COVID-19 Literature: A Bibliometric Analysis, Running Title: Bibliometric Analysis of the COVID-19 Literature. Res. Sq., 1–20.

5. Ageel, M. (2022). Pandemic Critical Care Research during the COVID-19 (2020–2022): A Bibliometric Analysis Using VOSviewer. BioMed. Res. Int., 2022.

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Enhancing personalized learning: AI-driven identification of learning styles and content modification strategies;International Journal of Cognitive Computing in Engineering;2024