Risk stratification of cervical lesions using capture sequencing and machine learning method based on HPV and human integrated genomic profiles

Author:

Tian Rui1,Cui Zifeng1,He Dan2,Tian Xun3,Gao Qinglei4,Ma Xin5,Yang Jian-rong6,Wu Jun7,Das Bhudev C8,Severinov Konstantin9,Hitzeroth Inga Isabel10,Debata Priya Ranjan11,Xu Wei1,Zhong Haolin12,Fan Weiwen1,Chen Yili1,Jin Zhuang1,Cao Chen2,Yu Miao1,Xie Weiling1,Huang Zhaoyue1,Bao Yuxian213,Xie Hongxian213,Yao Shuzhong1,Hu Zheng14

Affiliation:

1. Department of Obstetrics and Gynecology, Precision Medicine Institute, Sun Yat-sen University, Yuexiu, Guangzhou, Guangdong, China

2. Department of Neurology, The First Affiliated Hospital, Sun Yat-sen University, Yuexiu, Guangzhou, Guangdong, China

3. Department of Obstetrics and Gynecology, Academician Expert Workstation, The Central Hospital of Wuhan, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, Hubei, China

4. Department of Obstetrics and Gynecology, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, Hubei Province, China

5. Department of Urology, The General Hospital of the People’s Liberation Army, Beijing, China

6. Department of Biology, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, China

7. School of Biomedical Engineering, Sun Yat-sen University, Guangzhou, China

8. Amity Institute of Molecular Medicine and Stem Cell Research, Amity University, Noida, Uttar Pradesh, India

9. Skolkovo Institute of Science and Technology, Skolkovo, Moscow Region, Russia

10. Biopharming Research Unit, Department of Molecular and Cell Biology, University of Cape Town, South Africa

11. Department of Zoology, North Orissa University, Baripada

12. College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, China

13. Generulor Company Bio-X Lab, Guangzhou, Guangdong, China

Abstract

Abstract From initial human papillomavirus (HPV) infection and precursor stages, the development of cervical cancer takes decades. High-sensitivity HPV DNA testing is currently recommended as primary screening method for cervical cancer, whereas better triage methodologies are encouraged to provide accurate risk management for HPV-positive women. Given that virus-driven genomic variation accumulates during cervical carcinogenesis, we designed a 39 Mb custom capture panel targeting 17 HPV types and 522 mutant genes related to cervical cancer. Using capture-based next-generation sequencing, HPV integration status, somatic mutation and copy number variation were analyzed on 34 paired samples, including 10 cases of HPV infection (HPV+), 10 cases of cervical intraepithelial neoplasia (CIN) grade and 14 cases of CIN2+ (CIN2: n = 1; CIN2-3: n = 3; CIN3: n = 9; squamous cell carcinoma: n = 1). Finally, the machine learning algorithm (Random Forest) was applied to build the risk stratification model for cervical precursor lesions based on CIN2+ enriched biomarkers. Generally, HPV integration events (11 in HPV+, 25 in CIN1 and 56 in CIN2+), non-synonymous mutations (2 in CIN1, 12 in CIN2+) and copy number variations (19.1 in HPV+, 29.4 in CIN1 and 127 in CIN2+) increased from HPV+ to CIN2+. Interestingly, ‘common’ deletion of mitochondrial chromosome was significantly observed in CIN2+ (P = 0.009). Together, CIN2+ enriched biomarkers, classified as HPV information, mutation, amplification, deletion and mitochondrial change, successfully predicted CIN2+ with average accuracy probability score of 0.814, and amplification and deletion ranked as the most important features. Our custom capture sequencing combined with machine learning method effectively stratified the risk of cervical lesions and provided valuable integrated triage strategies.

Funder

National Science and Technology Major Project of the Ministry of Science and Technology of China

Nature and Science Foundation of China

Guangzhou Science and Technology Programme

Fundamental Research Funds for the Central Universities

Three Big Constructions

Publisher

Oxford University Press (OUP)

Subject

Cancer Research,General Medicine

Reference35 articles.

Cited by 23 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3