PENet: A phenotype encoding network for automatic extraction and representation of morphological discriminative features

Author:

Zhao Zhengyu123ORCID,Lu Yuanyuan12ORCID,Tong Yijie12,Chen Xin4,Bai Ming1235

Affiliation:

1. Key Laboratory of Animal Biodiversity Conservation and Integrated Pest Management (Chinese Academy of Sciences), Institute of Zoology Chinese Academy of Sciences Beijing China

2. Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology Chinese Academy of Sciences Beijing China

3. University of Chinese Academy of Sciences Beijing China

4. Cangzhou Normal University Cangzhou Hebei Province China

5. Northeast Asia Biodiversity Research Centre Northeast Forestry University Harbin China

Abstract

Abstract Digitalized natural history collections serve as vital ecological and evolutionary research resources. Specimen retrieval based on morphological features allows for the rapid acquisition of similar specimens from these collections, aiding in maximizing the utilization of their resources and catering to the requirements of related research. However, achieving this objective requires effective feature extraction and representation techniques. We developed a phenotype encoding network (PENet), a deep learning‐based model that combines hashing methods to automatically extract and encode discriminative features into hash codes. We evaluated the performance of PENet on six data sets, including a newly constructed beetle data set (6566 images), which covers over 60% of the genera within the six subfamilies of Scarabaeidae. Phenotype encoding network showed high performance in feature extraction and image retrieval, allowing users to input an image of a specimen and efficiently retrieve all specimens with similar morphology. Two visualization methods, t‐SNE and Grad‐CAM, were used to evaluate the representation ability of the hash codes. Additionally, by using the hash codes generated from PENet, a phenetic distance tree was constructed based on the beetle data set. The result indicated that the hash codes could reveal the phenetic distances and relationships among categories to a certain extent. PENet provides an automatic and efficient method to extract and represent morphological discriminative features. The generated hash code can be used as a low‐dimensional carrier of these features, enabling efficient specimen retrieval. Moreover, the distance information carried by these hash codes suggests their potential in systematics, deserving further exploration.

Funder

China Postdoctoral Science Foundation

National Natural Science Foundation of China

Publisher

Wiley

Subject

Ecological Modeling,Ecology, Evolution, Behavior and Systematics

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3