Affiliation:
1. KLE Institute of Technology
Abstract
Abstract
Traditional diagnostic techniques are prone to human error and are time-consuming. Computer-aided diagnostic procedures improve performance and reduce expenses. This paper presents machine learning-based classifiers to detect diabetes in India, and Indian Demographic & Health Survey (2019–21) dataset is considered for the analysis. Classifiers like Support Vector Machine, Decision Tree, Extreme Gradient Boosting, and Random Forest are considered. The four phases adopted in work include preprocessing, feature extraction, classification, and analysis. Principal Component Analysis is used for feature reduction. Hyper-tuning parameters are tuned to achieve good performance of the classifiers. From the results, Random Forest has given the maximum classification accuracy, precision, recall, and area under the curve compared with other models. The work finds application in healthcare for the predictive analysis of diabetes.
Publisher
Research Square Platform LLC
Reference18 articles.
1. S. Webber, International Diabetes Federation Atlas, Vol. 102, No. 2. 2021. doi: 10.1016/j.diabres.2013.10.013.
2. E. Summary, “National Family Health Survey India Report,” Minist. Heal. Fam. Welfare, Govt. India, pp. 1–713, 2021.
3. P. Thotad, D. G. R. Bharamagouda, and D. S. G. Totad, “SURVEY ON MINING DIABETES DATA AND ITS APPLICATIONS ON DIAGNOSING METHODS IN DISEASE MANAGEMENT USING BIG DATA,” Int. J. Adv. Res. Comput. Sci., vol. 9, no. 1, pp. 341–345, Feb. 2018, doi: 10.26483/IJARCS.V9I1.5261.
4. S. Kumari, D. Kumar, and M. Mittal, “An ensemble approach for classification and prediction of diabetes mellitus using soft voting classifier,” Int. J. Cogn. Comput. Eng., vol. 2, no. November 2020, pp. 40–46, 2021, doi: 10.1016/j.ijcce.2021.01.001.
5. Early and accurate prediction of diabetics based on FCBF feature selection and SMOTE;Kishor A;Int. J. Syst. Assur. Eng. Manag.,2021