Affiliation:
1. School of Information Technology and Engineering, Vellore Institute of Technology (VIT), Vellore-632014, Tamil Nadu, India
2. Department of Applied Data Science, Noroff University College, Norway
Abstract
In recent times, bacterial Antimicrobial Resistance (AMR) analyses becomes a hot study topic. The AMR comprises information related to the antibiotic product name, class name, subclass name, type, subtype, gene type, etc., which can fight against the illness. However, the tagging language used to determine the data is of free context. These contexts often contain ambiguous data, which leads to a hugely challenging issue in retrieving, organizing, merging, and finding the relevant data. Manually reading this text and labelling is not time-consuming. Hence, topic modeling overcomes these challenges and provides efficient results in categorizing the topic and in determining the data. In this view, this research work designs an ensemble of artificial intelligence for categorizing the AMR gene data and determine the relationship between the antibiotics. The proposed model includes a weighted voting based ensemble model by the incorporation of Latent Dirichlet Allocation (LDA) and Hierarchical Recurrent Neural Networks (HRNN), shows the novelty of the work. It is used for determining the amount of “topics” that cluster utilizing a multidimensional scaling approach. In addition, the proposed model involves the data pre-processing stage to get rid of stop words, punctuations, lower casing, etc. Moreover, an explanatory data analysis uses word cloud which assures the proper functionality and to proceed with the model training process. Besides, three approaches namely perplexity, Harmonic mean, and Random initialization of K are employed to determine the number of topics. For experimental validation, an openly accessible Bacterial AMR reference gene database is employed. The experimental results reported that the perplexity provided the optimal number of topics from the AMR gene data of more than 6500 samples. Therefore, the proposed model helps to find the appropriate antibiotic for bacterial and viral spread and discover how to increase the proper antibiotic in human bodies
Publisher
World Scientific Pub Co Pte Ltd
Subject
Artificial Intelligence,Information Systems,Control and Systems Engineering,Software
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献