Parallel Implementations of ARIA on ARM Processors and Graphics Processing Unit
-
Published:2022-11-30
Issue:23
Volume:12
Page:12246
-
ISSN:2076-3417
-
Container-title:Applied Sciences
-
language:en
-
Short-container-title:Applied Sciences
Author:
Eum Siwoo, Kim Hyunjun, Kwon Hyeokdong, Sim Minjoo, Song Gyeongju, Seo HwajeongORCID
Abstract
The ARIA block cipher algorithm is Korean standard, IETF standard (RFC 5794), and part of the TLS/SSL protocol. In this paper, we present the parallel implementation of ARIA block cipher on ARMv8 processors and GPU. The ARMv8 processor is the latest 64-bit ARM architecture and supports ASIMD for parallel implementations. With this feature, 4 and 16 parallel encryption blocks are implemented to optimize the substitution layer of ARIA block cipher using four different Sboxes. Compared to previous works, the performance was improved by 2.76× and 8.73× at 4-plaintext and 16-plaintext cases, respectively. We also present optimal implementation on GPU architectures. GPUs are highly parallel programmable processors featuring maximum arithmetic and memory bandwidth. Optimal settings of ARIA block cipher implementation on GPU were analyzed using the Nsight Compute profiler provided by Nvidia. We found that using shared memory reduces the execution timing when performing substitution operations with Sbox tables. When using many threads with shared memory instead of global memory, it improves performance by about 1.08∼1.43×. Additionally, techniques using table expansion to minimize bank conflicts have been found to be inefficient when tables cannot be copied by the size of the bank. We measured the performance of ARIA block ciphers implemented with various settings. This represents an optimized GPU implementation of the ARIA block cipher.
Funder
Hansung University
Subject
Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science
Reference25 articles.
1. Fujii, H., Rodrigues, F.C., and López, J. (2019, January 8–9). Fast AES implementation using ARMv8 ASIMD without cryptography extension. Proceedings of the International Conference on Information Security and Cryptology, Nanjing, China. 2. AES proposal: Rijndael;Daemen;Int. J. Commun. Netw. Syst. Sci.,1999 3. Optimized Implementation of SM4 on AVR Microcontrollers, RISC-V Processors, and ARM Processors;Kwon;Cryptol. Eprint Arch.,2021 4. Kim, H., Sim, M., Jang, K., Kwon, H., Uhm, S., and Seo, H. (2021). Masked Implementation of Format Preserving Encryption on Low-End AVR Microcontrollers and High-End ARM Processors. Mathematics, 9. 5. An, S., Kim, Y., Kwon, H., Seo, H., and Seo, S.C. (2020). Parallel implementations of ARX-based block ciphers on graphic processing units. Mathematics, 8.
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
|
|