Identifying missing data handling methods with text mining-Reference-Cited by-同舟云学术

Identifying missing data handling methods with text mining

Published:2024-06-17 Issue: Volume: Page:
ISSN:2364-415X
Container-title:International Journal of Data Science and Analytics
language:en
Short-container-title:Int J Data Sci Anal

Author:

Boros Krisztián,Kmetty Zoltán

Abstract

AbstractMissing data is an inevitable aspect of every empirical research. Researchers developed several techniques to handle missing data to avoid information loss and biases. Over the past 50 years, these methods have become more and more efficient and also more complex. Building on previous review studies, this paper aims to analyze what kind of missing data handling methods are used among various scientific disciplines. For the analysis, we used nearly 50.000 scientific articles published between 1999 and 2016. JSTOR provided the data in text format. We utilized a text-mining approach to extract the necessary information from our corpus. Our results show that the usage of advanced missing data handling methods, such as Multiple Imputation or Full Information Maximum Likelihood estimation, is steadily growing in the examination period. Additionally, simpler methods, like listwise and pairwise deletion, are still in widespread use.

Funder

HUN-REN Centre for Social Sciences

Publisher

Springer Science and Business Media LLC

Link

https://link.springer.com/content/pdf/10.1007/s41060-024-00582-1.pdf

Reference45 articles.

1. Dong, Y., Peng, C.-Y.J.: Principled missing data methods for researchers. Springerplus 2(1), 222 (2013). https://doi.org/10.1186/2193-1801-2-222

2. Enders, C.K.: Applied Missing Data Analysis. Methodology in the social sciences. Guilford Press, New York (2010)

3. Graham, J.W., Cumsille, P.E., Shevock, A.E.: Methods for Handling Missing Data. In: Handbook of Psychology, 2nd edn., pp. 109–141. Wiley, Hoboken, NJ (2013). https://doi.org/10.1002/9781118133880.hop202004

4. Little, T.D., Jorgensen, T.D., Lang, K.M., Moore, E.W.G.: On the Joys of Missing Data. J. Pediatr. Psychol. 39(2), 151–162 (2014). https://doi.org/10.1093/jpepsy/jst048

5. Little, T.D., Lang, K.M., Wu, W., Rhemtulla, M.: Statistical Issues: What Happens When Data Go Missing? In: Developmental Psychopathology, Third edition edn., p. 37. Wiley, Hoboken, NJ (2016). ISBN: 978-1-118-12179-5