Affiliation:
1. University of California Irvine, Irvine, California
2. ALaRI
3. EPFL
4. Intel AI Research
Abstract
The advent of the quantum computer makes current public-key infrastructure insecure. Cryptography community is addressing this problem by designing, efficiently implementing, and evaluating novel public-key algorithms capable of withstanding quantum computational power. Governmental agencies, such as NIST, are promoting standardization of quantum-resistant algorithms that is expected to run for 7 years. Several modern applications must maintain permanent data secrecy; therefore, they ultimately require the use of quantum-resistant algorithms. Because algorithms are still under scrutiny for eventual standardization, the deployment of the hardware implementation of quantum-resistant algorithms is still in early stages.
In this article, we propose a methodology to design programmable hardware accelerators for lattice-based algorithms, and we use the proposed methodology to implement flexible and energy efficient post-quantum cache-based accelerators for
NewHope
,
Kyber
,
Dilithium
, Key Consensus from Lattice (
KCL
), and
R.EMBLEM
submissions to the NIST standardization contest.
To the best of our knowledge, we propose the first efficient domain-specific, programmable cache-based accelerators for lattice-based algorithms. We design a single accelerator for a common kernel among various schemes with different kernel sizes, i.e., loop count, and data types. This is in contrast to the traditional approach of designing one special purpose accelerators for each scheme.
We validate our methodology by integrating our accelerators into an HLS-based SoC infrastructure based on the X86 processor and evaluate overall performance. Our experiments demonstrate the suitability of the approach and allow us to collect insightful information about the performance bottlenecks and the energy efficiency of the explored algorithms. Our results provide guidelines for hardware designers, highlighting the optimization points to address for achieving the highest energy minimization and performance increase. At the same time, our proposed design allows us to specify and execute new variants of lattice-based schemes with superior energy efficiency compared to the main application processor without changing the hardware acceleration platform. For example, we manage to reduce the energy consumption up to 2.1× and energy-delay product (EDP) up to 5.2× and improve the speedup up to 2.5×.
Funder
European Union Horizon 2020 research and innovation programme under SAFEcrypto project
Swiss National Science Foundation
Swiss National Science Foundation project
Qualcomm Technology Inc.
Publisher
Association for Computing Machinery (ACM)
Subject
Hardware and Architecture,Software
Reference37 articles.
1. E. Alkim etal 2016. NewHope Without Reconciliation. Cryptology ePrint Archive Report 2016/1157. E. Alkim et al. 2016. NewHope Without Reconciliation. Cryptology ePrint Archive Report 2016/1157.
2. Fast and area efficient implementation for chaotic image encryption algorithms
3. R. Avanzi etal 2017. CRYSTALS-KYBER. Technical Report. NIST. R. Avanzi et al. 2017. CRYSTALS-KYBER. Technical Report. NIST.
4. Analysis and acceleration of NTRU lattice-based cryptographic system
Cited by
9 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献