Efficient HLA imputation from sequential SNPs data by transformer-Reference-Cited by-同舟云学术

Efficient HLA imputation from sequential SNPs data by transformer

Published:2024-08-02 Issue: Volume: Page:
ISSN:1434-5161
Container-title:Journal of Human Genetics
language:en
Short-container-title:J Hum Genet

Author:

Tanaka Kaho,Kato Kosuke,Nonaka Naoki,Seita Jun^ORCID

Abstract

AbstractHuman leukocyte antigen (HLA) genes are associated with a variety of diseases, yet the direct typing of HLA alleles is both time-consuming and costly. Consequently, various imputation methods leveraging sequential single nucleotide polymorphisms (SNPs) data have been proposed, employing either statistical or deep learning models, such as the convolutional neural network (CNN)-based model, DEEP*HLA. However, these methods exhibit limited imputation efficiency for infrequent alleles and necessitate a large size of reference dataset. In this context, we have developed a Transformer-based model to HLA allele imputation, named “HLA Reliable IMpuatioN by Transformer (HLARIMNT)” designed to exploit the sequential nature of SNPs data. We evaluated HLARIMNT’s performance using two distinct reference panels; Pan-Asian reference panel (n = 530) and Type 1 Diabetes genetics Consortium (T1DGC) reference panel (n = 5225), alongside a combined panel (n = 1060). HLARIMNT demonstrated superior accuracy to DEEP*HLA across several indices, particularly for infrequent alleles. Furthermore, we explored the impact of varying training data sizes on imputation accuracy, finding that HLARIMNT consistently outperformed across all data size. These findings suggest that Transformer-based models can efficiently impute not only HLA types but potentially other gene types from sequential SNPs data.

Publisher

Springer Science and Business Media LLC

Link

https://www.nature.com/articles/s10038-024-01278-x.pdf

Reference35 articles.

1. Dendrou CA, Petersen J, Rossjohn J, Fugger L. HLA variation and disease. Nat Rev Immunol. 2018;18:325–39. https://doi.org/10.1038/nri.2017.143

2. Fan WL, Shiao MS, Hui RC, Su SC, Wang CW, Chang YC, et al. HLA association with drug-induced adverse reactions. J Immunol Res. 2017;2017:3186328. https://doi.org/10.1155/2017/3186328.

3. Ko TM, Tsai CY, Chen SY, Chen KS, Yu KH.Chu CS,et al. Use of HLA-B58:01 genotyping to prevent allopurinol induced severe cutaneous adverse reactions in Taiwan: National prospective cohort study. BMJ. 2015;351. https://doi.org/10.1136/bmj.h4848.

4. Hirata J, Hosomichi K, Sakaue S, Kanai M, Nakaoka H, Ishigaki K, et al. Genetic and phenotypic landscape of the major histocompatibility complex region in the Japanese population. Nat Genet. 2019;51:470–80. https://doi.org/10.1038/s41588-018-0336-0.

5. Erlich H. HLA DNA typing: past, present, and future. Tissue Antigens. 2012;80:1–11. https://doi.org/10.1111/j.1399-0039.2012.01881.x