Affiliation:
1. Tokyo Medical University , Hachioji , Japan
Abstract
Abstract
The authors of this paper applied a new approach combining text mining and principal component analysis (PCA) to objectively determine the actual state of regional COVID-19 strategy meetings and verified its utility. The authors used text mining to analyze meeting minutes and extracted words with high phase ubiquity by co-occurrence analysis. Then, they selected words symbolizing the meeting contents (“report,” “prevention,” “rules,” and “decision”) and performed PCA using the occurrence rates of these words as variables. Two principal components (PC1, PC2) were set. For PC1, we observed maximum factor loading for “decision” (0.81) and minimum for “report” (-0.72), so we considered this axis to show the “depth of meeting discussions.” For PC2, we observed maximum factor loading for “prevention” (0.81) and minimum for “rule” (-0.76). We considered this axis to show “regional infection status.” When we created a plot of all 44 meetings, Phase 1 occurred in quadrants 3 to 4 (knowledge sharing), phase 2 began in quadrant 1 (preparation for spread), and phase 3 shifted to quadrant 2 (response to spread) with significant differences between these phases. Our findings suggest that the actual state of regional COVID-19 strategy meetings could be objectively determined by using a combination of text mining and PCA.
Reference27 articles.
1. Abegaz, F., Chaichoompu, K., Génin, E., Fardo, D. W., König, I. R., John, J. M. M., & Van Steen, K. (2019). Principals about principal components in statistical genetics. Brief Bioinform, 20(6), 2200–2216. https://doi.org/10.1093/bib/bby08110.1093/bib/bby081
2. Alameda-Hernández, Á. (2008). SFL and CDA: Contributions of the analysis of the transitivity system in the study of the discursive construction of national identity (case study: Gibraltar). The Linguistics Journal, 3(3), 160–175.
3. Blaschke, C., Yeh, A., Hirschman, L., & Valencia, A. (2003). ISMB 2003 Text mining SIG meeting report. Computer Funct Genomics, 4, 667–673. https://doi.org/10.1002/cfg.33810.1002/cfg.338
4. Clark, N. R., & Ma’ayan, A. (2011). Introduction to statistical methods to analyze large data sets: Principal components analysis. Science Signaling, 4(190), 3. https://doi.org/10.1126/scisignal.200196710.1126/scisignal.2001967
5. Dancy-Scott, N., Dutcher, G. A., Keselman, A., Hochstein, C., Copty, C., Ben-Senia, D., Rajan, S., Asencio, M. G., & Choi, J. J. (2018). Trends in HIV terminology: Text mining and data visualization assessment of International AIDS Conference abstracts over 25 years. JMIR Public Health Surveill, 4, e50. https://doi.org/10.2196/publichealth.855210.2196/publichealth.8552