Affiliation:
1. IT-Infrastructure for Translational Medical Research, University of Augsburg, 86159 Augsburg, Germany
2. Medical Data Integration Center, Institute for Digital Medicine, University Hospital Augsburg, 86156 Augsburg, Germany
Abstract
Performance measures are an important tool for assessing and comparing different medical image segmentation algorithms. Unfortunately, the current measures have their weaknesses when it comes to assessing certain edge cases. These limitations arise when images with a very small region of interest or without a region of interest at all are assessed. As a solution to these limitations, we propose a new medical image segmentation metric: MISm. This metric is a composition of the Dice similarity coefficient and the weighted specificity. MISm was investigated for definition gaps, an appropriate scoring gradient, and different weighting coefficients used to propose a constant value. Furthermore, an evaluation was performed by comparing the popular metrics in the medical image segmentation and MISm using images of magnet resonance tomography from several fictitious prediction scenarios. Our analysis shows that MISm can be applied in a general way and thus also covers the mentioned edge cases, which are not covered by other metrics, in a reasonable way. In order to allow easy access to MISm and therefore widespread application in the community, as well as reproducibility of experimental results, we included MISm in the publicly available evaluation framework MISeval.
Funder
German Ministry of Education
Research
Reference23 articles.
1. Deep Learning in Medical Image Analysis;Shen;Annu. Rev. Biomed. Eng.,2017
2. A survey on deep learning in medical image analysis;Litjens;Med. Image Anal.,2017
3. Lee, K., Zung, J., Li, P.H., Jain, V., and Seung, H. (2017). Superhuman Accuracy on the SNEMI3D Connectomics Challenge. arXiv.
4. Chicco, D., and Jurman, G. (2020). The advantages of the Matthews correlation coefficient (MCC) over F1 score and accuracy in binary classification evaluation. BMC Genom., 21.
5. Müller, D., Soto-Rey, I., and Kramer, F. (2022). Towards a Guideline for Evaluation Metrics in Medical Image Segmentation. arXiv.