Author:
Wang Dong,Tao Ruiyang,Li Zhiqiang,Pan Dun,Wang Zhuo,Li Chengtao,Shi Yongyong
Abstract
Abstract
Background
Short tandem repeats (STRs) are important polymorphism makers for human identification and kinship analyses in forensic science. With the continuous development of massively parallel sequencing (MPS), more laboratories have utilized this technology for forensic applications. Existing STR genotyping tools, mostly developed for whole-genome sequencing data, are not effective for MPS data. More importantly, their backward compatibility with the conventional capillary electrophoresis (CE) technology has not been evaluated and guaranteed.
Results
In this study, we developed a new end-to-end pipeline called STRsearch for STR-MPS data analysis. The STRsearch can not only determine the allele by counting repeat patterns and INDELs that are actually in the STR region, but it also translates MPS results into standard STR nomenclature (numbers and letters). We evaluated the performance of STRsearch in two forensic sequencing datasets, and the concordance with CE genotypes was 75.73 and 75.75%, increasing 12.32 and 9.05% than the existing tool named STRScan, respectively. Additionally, we trained a base classifier using sequence properties and used it to predict the probability of correct genotyping at a given locus, resulting in the highest accuracy of 96.13%.
Conclusions
All these results demonstrated that STRsearch was a better tool to protect the backward compatibility with CE for the targeted STR profiling in MPS data. STRsearch is available as open-source software at https://github.com/AnJingwd/STRsearch.
Funder
Shanghai Municipal Science and Technology Major Project
the National Key R&D Program of China
the Natural Science Foundation of China
Shanghai Hospital Development Center
Shanghai Science and Technology Committee
shanghai municipal health commission
Publisher
Springer Science and Business Media LLC
Subject
Genetics,General Medicine
Reference36 articles.
1. Fan H, Chu JY. A brief review of short tandem repeat mutation. Genomics Proteomics Bioinformatics. 2007;5(1):7–14.
2. Gill P. Role of short tandem repeat DNA in forensic casework in the UK--past, present, and future perspectives. BioTechniques. 2002;32(2):366–8 370, 372, passim.
3. Butler JM. Short tandem repeat typing technologies used in human identity testing. BioTechniques. 2007;43(4):ii–v.
4. Carracedo A, Lareu M. Development of new STRs for forensic casework: criteria for selection, Sequencing & Population Data and forensic validation. In: Proceedings of the ninth international symposium on human identification; 1998.
5. Yang M, Yin C, Lv Y, Yang Y, Chen J, Yu Z, Liu X, Xu M, Chen F, Wu H, et al. Development of a rapid 21-plex autosomal STR typing system for forensic applications. Electrophoresis. 2016;37(21):2789–99.
Cited by
6 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献