Lung cancer clustering by identification of similarities and discrepancies of DNA copy numbers using maximal information coefficient
Author:
N. Kachouie NezamoddinORCID,
Deebani Wejdan,
Shutaywi Meshal,
Christiani David C.
Abstract
Lung cancer is the second most diagnosed cancer and the first cause of cancer related death for men and women in the United States. Early detection is essential as patient survival is not optimal and recurrence rate is high. Copy number (CN) changes in cancer populations have been broadly investigated to identify CN gains and deletions associated with the cancer. In this research, the similarities between cancer and paired peripheral blood samples are identified using maximal information coefficient (MIC) and the spatial locations with substantially high MIC scores in each chromosome are used for clustering analysis. The results showed that a sizable reduction of feature set can be obtained using only a subset of locations with high MIC values. The clustering performance was evaluated using both true rate and normalized mutual information (NMI). Clustering results using the reduced feature set outperformed the performance of clustering using entire feature set in several chromosomes that are highly associated with lung cancer with several identified oncogenes.
Funder
National Institute of Health
Publisher
Public Library of Science (PLoS)
Reference44 articles.
1. American Society of Clinical Oncology: https://www.cancer.net/es/node/19149.
2. Cancer.org. Key statistics for lung cancer.
3. National Cancer Institute. Annual report to the nation on the status of cancer.
4. Non-small cell lung cancer: current treatment and future advances.;Cecilia Zappa and Shaker A Mousa;Translational lung cancer research,2016
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献