The influence of marker number and sequencing depth on the ability to identify mismatch repair deficient tumours

Author:

Law Joseph,Gallon Richard,Teare Ethan,Koref Ivan Santibanez,Phelps Rachel,Burn John,Jackson Michael,Koref Mauro Santibanez

Abstract

1.AbstractAnalysis of somatic mutation patterns is widely used to infer exposure to exogenous and endogenous mutagenic influences. This raises the question of the amount of sequence data required to detect factors of interest. A common use of mutation pattern analysis is the identification of increased microsatellite instability to uncover mismatch repair (MMR) defects in tumours and normal tissues. Here we explore the effects of sequencing depth and the number of loci analysed on the ability to detect MMR deficiency using artificial neural networks and publicly available amplicon sequencing data from colorectal tumours on 24 short quasi monomorphic microsatellites (up to 12 bp in length, PMID 31471937) split in a training (99 samples) and a test set (95 samples). We show that, at a sequencing depth of 200, pairs mononucleotide repeats can achieve discrimination between MMR proficient and deficient colorectal tumours similar to that obtained with the full 24 marker panel, with accuracies above 97% and ROC AUCs in excess of 99% in the test set. Our results indicate that for short monomorphic microsatellites considering the length distribution of the different alleles at each locus, representing these distributions as two-dimensional structures and including convolutional layers in the network can facilitate discrimination between MMR deficient and proficient tumour material. They also indicate that, despite the limitations imposed by amplification, sequencing accuracy and the limited divergence time between the sequences from one locus, high depth sequencing can be used to identify MMR deficiency from a limited number of loci. However, they also suggest that, for a fixed total number of reads per sample, increasing sensitivity by increasing the number of targets is more efficient than by increasing per target sequencing depth. These results are of interest for screening large numbers of samples and for assessing the impact MMR deficiency in different areas of the genome.

Publisher

Cold Spring Harbor Laboratory

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3