Author:
Walsh Reece,Abdelpakey Mohamed H.,Shehata Mohamed S.,Mohamed Mostafa M.
Abstract
AbstractClassifying and analyzing human cells is a lengthy procedure, often involving a trained professional. In an attempt to expedite this process, an active area of research involves automating cell classification through use of deep learning-based techniques. In practice, a large amount of data is required to accurately train these deep learning models. However, due to the sparse human cell datasets currently available, the performance of these models is typically low. This study investigates the feasibility of using few-shot learning-based techniques to mitigate the data requirements for accurate training. The study is comprised of three parts: First, current state-of-the-art few-shot learning techniques are evaluated on human cell classification. The selected techniques are trained on a non-medical dataset and then tested on two out-of-domain, human cell datasets. The results indicate that, overall, the test accuracy of state-of-the-art techniques decreased by at least 30% when transitioning from a non-medical dataset to a medical dataset. Reptile and EPNet were the top performing techniques tested on the BCCD dataset and HEp-2 dataset respectively. Second, this study evaluates the potential benefits, if any, to varying the backbone architecture and training schemes in current state-of-the-art few-shot learning techniques when used in human cell classification. To this end, the best technique identified in the first part of this study, EPNet, is used for experimentation. In particular, the study used 6 different network backbones, 5 data augmentation methodologies, and 2 model training schemes. Even with these additions, the overall test accuracy of EPNet decreased from 88.66% on non-medical datasets to 44.13% at best on the medical datasets. Third, this study presents future directions for using few-shot learning in human cell classification. In general, few-shot learning in its current state performs poorly on human cell classification. The study proves that attempts to modify existing network architectures are not effective and concludes that future research effort should be focused on improving robustness towards out-of-domain testing using optimization-based or self-supervised few-shot learning techniques.
Publisher
Springer Science and Business Media LLC
Reference50 articles.
1. Link, D. Programming enter: Christopher strachey’s draughts program. Comput. Resurrection. Bull. Comput. Conserv. Soc. 60, 23–31 (2012).
2. McCorduck, P. & Cfe, C. Machines Who Think: A Personal Inquiry into the History and Prospects of Artificial Intelligence (CRC Press, 2004).
3. Jackson, P. Introduction to Expert Systems (Addison-Wesley Longman Publishing Co. Inc, 1998).
4. Rumelhart, D. E., Hinton, G. E. & Williams, R. J. Learning representations by back-propagating errors. Nature 323, 533–536 (1986).
5. Bai, B., Li, G., Wang, S., Wu, Z. & Yan, W. Time series classification based on multi-feature dictionary representation and ensemble learning. Exp. Syst. Appl. 169, 114162 (2021).
Cited by
15 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献