Benchmarking immunoinformatic tools for the analysis of antibody repertoire sequences

Author:

Smakaj Erand1,Babrak Lmar1ORCID,Ohlin Mats2,Shugay Mikhail3,Briney Bryan4,Tosoni Deniz1,Galli Christopher1,Grobelsek Vendi5,D’Angelo Igor6,Olson Branden78,Reddy Sai5,Greiff Victor9ORCID,Trück Johannes10,Marquez Susanna11,Lees William12ORCID,Miho Enkelejda113ORCID

Affiliation:

1. Institute of Biomedical Engineering and Medical Informatics, School of Life Sciences, FHNW University of Applied Sciences and Arts Northwestern Switzerland, Muttenz 4132, Switzerland

2. Department of Immunotechnology, Lund University, Lund 223, Sweden

3. Center of Life Sciences, Skolkovo Institute of Science and Technology, Moscow 121205, Russia

4. Department of Immunology and Microbiology, The Scripps Research Institute, La Jolla, CA 92037, USA

5. Department of Biosystems Science and Engineering, ETH Zurich, Basel 4058, Switzerland

6. One Amgen Center Drive, Amgen, Inc., Therapeutic Discovery/Molecular Engineering, Thousand Oaks, CA 91320, USA

7. Computational Biology Program, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA

8. Department of Statistics, University of Washington, Seattle, WA 98195, USA

9. Department of Immunology, University of Oslo, Oslo 0372, Norway

10. Paediatric Immunology, Children’s Research Center, University Children's Hospital, University of Zurich, Zurich 8032, Switzerland

11. Department of Pathology, Yale School of Medicine, New Haven, CT 06511, USA

12. Department of Biological Sciences and Institute of Structural and Molecular Biology, Birkbeck College, University of London, London WC1E 7HX, UK

13. aiNET GmbH, Switzerland Innovation Park Basel Area AG, Basel 4057, Switzerland

Abstract

Abstract Summary Antibody repertoires reveal insights into the biology of the adaptive immune system and empower diagnostics and therapeutics. There are currently multiple tools available for the annotation of antibody sequences. All downstream analyses such as choosing lead drug candidates depend on the correct annotation of these sequences; however, a thorough comparison of the performance of these tools has not been investigated. Here, we benchmark the performance of commonly used immunoinformatic tools, i.e. IMGT/HighV-QUEST, IgBLAST and MiXCR, in terms of reproducibility of annotation output, accuracy and speed using simulated and experimental high-throughput sequencing datasets. We analyzed changes in IMGT reference germline database in the last 10 years in order to assess the reproducibility of the annotation output. We found that only 73/183 (40%) V, D and J human genes were shared between the reference germline sets used by the tools. We found that the annotation results differed between tools. In terms of alignment accuracy, MiXCR had the highest average frequency of gene mishits, 0.02 mishit frequency and IgBLAST the lowest, 0.004 mishit frequency. Reproducibility in the output of complementarity determining three regions (CDR3 amino acids) ranged from 4.3% to 77.6% with preprocessed data. In addition, run time of the tools was assessed: MiXCR was the fastest tool for number of sequences processed per unit of time. These results indicate that immunoinformatic analyses greatly depend on the choice of bioinformatics tool. Our results support informed decision-making to immunoinformaticians based on repertoire composition and sequencing platforms. Availability and implementation All tools utilized in the paper are free for academic use. Supplementary information Supplementary data are available at Bioinformatics online.

Funder

Wellcome Trust

Publisher

Oxford University Press (OUP)

Subject

Computational Mathematics,Computational Theory and Mathematics,Computer Science Applications,Molecular Biology,Biochemistry,Statistics and Probability

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3