Author:
Yeom Han-Gyeol,Lee Byung-Do,Lee Wan,Lee Taehan,Yun Jong Pil
Abstract
AbstractThis study suggests a hybrid method based on ResNet50 and vision transformer (ViT) in an age estimation model. To this end, panoramic radiographs are used for learning by considering both local features and global information, which is important in estimating age. Transverse and longitudinal panoramic images of 9663 patients were selected (4774 males and 4889 females with a mean age of 39 years and 3 months). To compare ResNet50, ViT, and the hybrid model, the mean absolute error, mean square error, root mean square error, and coefficient of determination (R2) were used as metrics. The results confirmed that the age estimation model designed using the hybrid method performed better than those using only ResNet50 or ViT. The estimation is highly accurate for young people at an age with distinct growth characteristics. When examining the basis for age estimation in the hybrid model through attention rollout, the proposed model used logical and important factors rather than relying on unclear elements as the basis for age estimation.
Funder
Basic Science Research Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Education
Korea Institute of Industrial Technology
Publisher
Springer Science and Business Media LLC