The Feature Selection Problem in Computer–Assisted Cytology-Reference-Cited by-同舟云学术

The Feature Selection Problem in Computer–Assisted Cytology

Published:2018-12-01 Issue:4 Volume:28 Page:759-770
ISSN:2083-8492
Container-title:International Journal of Applied Mathematics and Computer Science
language:en
Short-container-title:

Author:

Kowal Marek¹,Skobel Marcin¹,Nowicki Norbert¹²

Affiliation:

1. Institute of Control and Computation Engineering University of Zielona Góra, ul. Szafrana 2, 65-516 Zielona Góra , Poland

2. Department of Medical Physics University Hospital in Zielona Góra, ul. Zyty 26, 65-046 Zielona Góra , Poland

Abstract

Abstract Modern cancer diagnostics is based heavily on cytological examinations. Unfortunately, visual inspection of cytological preparations under the microscope is a tedious and time-consuming process. Moreover, intra- and inter-observer variations in cytological diagnosis are substantial. Cytological diagnostics can be facilitated and objectified by using automatic image analysis and machine learning methods. Computerized systems usually preprocess cytological images, segment and detect nuclei, extract and select features, and finally classify the sample. In spite of the fact that a lot of different computerized methods and systems have already been proposed for cytology, they are still not routinely used because there is a need for improvement in their accuracy. This contribution focuses on computerized breast cancer classification. The task at hand is to classify cellular samples coming from fine-needle biopsy as either benign or malignant. For this purpose, we compare 5 methods of nuclei segmentation and detection, 4 methods of feature selection and 4 methods of classification. Nuclei detection and segmentation methods are compared with respect to recall and the F1 score based on the Jaccard index. Feature selection and classification methods are compared with respect to classification accuracy. Nevertheless, the main contribution of our study is to determine which features of nuclei indicate reliably the type of cancer. We also check whether the quality of nuclei segmentation/detection significantly affects the accuracy of cancer classification. It is verified using the test set that the average accuracy of cancer classification is around 76%. Spearman’s correlation and chi-square test allow us to determine significantly better features than the feature forward selection method.

Publisher

Walter de Gruyter GmbH

Subject

Applied Mathematics,Engineering (miscellaneous),Computer Science (miscellaneous)

Link

https://www.sciendo.com/pdf/10.2478/amcs-2018-0058

Reference29 articles.

1. Araújo, T., Aresta, G., Castro, E., Rouco, J., Aguiar, P., Eloy, C., Polónia, A. and Campilho, A. (2017). Classification of breast cancer histology images using convolutional neural networks, PLOS ONE 12(6): 1-14.

2. Breiman, L., Friedman, J., Olshen, R. and Stone, C. (1984). Classification and Regression Trees, Wadsworth & Brooks/Cole Advanced Books & Software, Monterey, CA.

3. Cheng, J. and Rajapakse, J.C. (2009). Segmentation of clustered nuclei with shape markers and marking function, IEEE Transactions on Biomedical Engineering 56(3): 741-748.

4. Cortes, C. and Vapnik, V. (1995). Support-vector networks, Machine Learning 20(3): 273-297.

5. Cover, T. and Hart, P. (1967). Nearest neighbor pattern classification, IEEE Transactions on Information Theory 13(1): 21-27.

Cited by 18 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. OptiFeat: Enhancing Feature Selection, A Hybrid Approach Combining Subject Matter Expertise and Recursive Feature Elimination Method;2024-08-13

2. Compatible-domain Transfer Learning for Breast Cancer Classification with Limited Annotated Data;Computers in Biology and Medicine;2023-03

3. A contemporary multi-objective feature selection model for depression detection using a hybrid pBGSK optimization algorithm;International Journal of Applied Mathematics and Computer Science;2023

4. Choice of the p-norm for high level classification features pruning in modern convolutional neural networks with local sensitivity analysis;International Journal of Applied Mathematics and Computer Science;2023

5. Global and local attentional feature alignment for domain adaptive nuclei detection in histopathology images;Artificial Intelligence in Medicine;2022-10