Abstract
AbstractIn this study, we incorporated several NLP techniques to identify the most important factors in the open-ended responses part of theKnowledge, Attitudes, and Practices: Survey of Zoonoses in Wildlife Trade (KAP)in Cambodia. These included: TF-IDF, ngrams, Latent Semantic allocation (LSA), k-means, Latent Dirichlet Allocation (LDA), and Top2Vec. The top topics participants identified included 1) stating that they handled wildlife by setting traps and mist nets, 2) stating they were bitten by bat or rat, 3) which zoonotic symptoms caused sickness, 4) describing how they would go to the hospital when they came down with zoonotic symptoms, and 5) saying that they were aware of avian flu and its symptoms.Based on our findings, recommendations for Cambodian public health officials include: 1) they need to educate participants to wear protective gear to prevent from being bitten by bats and rats during their jobs with these animals, and 2) they need to educate participants about the danger of different types of zoonotic diseases including Ebolavirus, Mojianvirus, etc., so that these participants can recognize the risks when handling bats and rats, and so they can take early action by seeking medical help as soon as they are bitten.
Publisher
Cold Spring Harbor Laboratory
Reference35 articles.
1. Alashwal, H. , el Halaby, M. , Crouse, J. J. , Abdalla, A. , & Moustafa, A. A. (2019). The application of unsupervised clustering methods to Alzheimer’s disease. In Frontiers in Computational Neuroscience (Vol. 13). Frontiers Media S.A. https://doi.org/10.3389/fncom.2019.00031
2. Angelov, D . (2020). Top2Vec: Distributed Representations of Topics. http://arxiv.org/abs/2008.09470
3. Aziz, S.A. , Olival, K.J. , Bumrungsri, S. , Richards, G.C. , and Racey, P.A . (2015). The conflict between pteropid bads and fruit growers: Species, legislation, and mitigation. In: Voigt, C. , Kingston, T . (eds) Bats in the Anthropocene: Conservation of Bats in a Changing World. Springer, Cham. https://doi.org/10.1007/978-3-319-25220-9_13
4. Baclic, O. , Tunis, M. , Young, K. , Doan, C. , & Swerdfeger, H . (2020). Challenges and opportunities for public health made possible by advances in natural language processing. Canada Communicable Disease Report, 161–168. https://doi.org/10.14745/ccdr.v46i06a02
5. Bruce, P. , Bruce, A. , Gedeck, P. (2020). Practical statistics for data scientists: 50+ essential concepts using R and Python. O’Reilly.