Lung Cancer Prediction using Machine Learning
-
Published:2021-05-19
Issue:
Volume:
Page:21-27
-
ISSN:2581-9429
-
Container-title:International Journal of Advanced Research in Science, Communication and Technology
-
language:en
-
Short-container-title:IJARSCT
Author:
Khan Faraz1, Pradhan Khusboo1, Sinha Deependra1
Affiliation:
1. Galgotias College of Engineering and Technology, Greater Noida, India
Abstract
Prevention is always a better option than cure, especially when it comes to deadly disease like cancer. According to the latest reports by the International Agency for Research on Cancer, Lung cancer is the second leading cause of death. Along with the family history, main causes for lung cancer are Genetic Factors, Smoking and the unhealthy lifestyle in the world. In a developing country like India, cancer treatments are very costly and hard to access to all the sections of the society. In a country where 220 million Indians sustained with an expenditure level of Rs 32/day, affording cancer treatments is impossible. Therefore, predicting the disease acts as the saviour to the millions of people in the country. For this purpose, we have identified the specific genes responsible for causing lung cancer in the Human race. Selecting a small number of genes can lead to a better accuracy. In this paper , we have used Kruskal-Wallis test. This helped us to select the genes expression data. Finally, we have identified12 influential genes responsible for causing lung cancer. The accuracy of the model is 84.375% using the Random Forest algorithm. All the files and codes used in the work is available at https://github.com/Farazkhan0516/Lung-Cancer-Prediction-using-Machine-Learning.git.
Publisher
Naksh Solutions
Reference20 articles.
1. Das, U., Hasan, M.A.M. and Rahman, J., 2019,February. Influential Gene Identification for Cancer Classification. In 2019 International Conference on Electrical, Computer and Communication Engineering(ECCE)(pp. 1-6)IEEE. 2. Wang,Y.,Tetko,I.V.,M.A.,Frank,E.,Facius,A.,Mayer,K.F.andMewes,H.W.,2005.Gene Selection from Microarray Data for Cancer Classification-A Machine Learning Approach. Communication Biologyand Chemistry,29(1), pp.37-46. 3. Vishabh Goel,R, 2018. Building a Simple Machine Learning Model on Breast Cancer Data. Towards Data Science, pp(1-6). 4. Brodrick, P., Wang, Y., Vijayakrishnan., J.,Matakidou, A.,Spitz,M.R.,Eisen,T.,Amos,C.I. and Houlston, R.S.,2009.Deciphering the impact of common genetic variation on lung cancer risk: agenome-wide association study.Cancer Research, 69(16),pp.6633-6641 5. Chen,K.H, Wang,K.J., Tsai, M.L.,Wang, K.M., Adrian, A.M., Cheng, W.C., Yang, T.S., Teng, N.C.,Tan, K.P.and Chang,K.S.,2014.Gene selection for cancer identification: a decision tree model empowered by particle swarm optimization algorithm. BMC Bioinformatics,15(1),p49.
|
|