Asynchronous Automata Processing on GPUs-Reference-Cited by-同舟云学术

Asynchronous Automata Processing on GPUs

Published:2023-06-26 Issue:1 Volume:51 Page:23-24
ISSN:0163-5999
Container-title:ACM SIGMETRICS Performance Evaluation Review
language:en
Short-container-title:SIGMETRICS Perform. Eval. Rev.

Author:

Liu Hongyuan¹^ORCID,Pai Sreepathi²^ORCID,Jog Adwait³^ORCID

Affiliation:

1. William & Mary and The Hong Kong University of Science & Technology (Guangzhou), Williamsburg, VA, USA

2. University of Rochester, Rochester, NY, USA

3. William & Mary and University of Virginia, Williamsburg, VA, USA

Abstract

Finite-state automata serve as compute kernels for application domains such as pattern matching and data analytics. Existing approaches on GPUs exploit three levels of parallelism in automata processing tasks: 1) input stream level, 2) automaton-level, and 3) state-level. Among these, only state-level parallelism is intrinsic to automata while the other two levels of parallelism depend on the number of automata and input streams to be processed. As GPU resources increase, a parallelism-limited automata processing task can underutilize GPU compute resources. To overcome this, we propose AsyncAP, a low-overhead approach that optimizes scalability and throughput. Our insight is that most automata processing tasks have an additional source of parallelism originating from the input symbols which has not been leveraged before. By making the matching process asynchronous, which involves having parallel GPU threads process an input stream from different input locations instead of processing it serially, AsyncAP is able to significantly improve throughput and scale with input length. Detailed evaluation across 12 applications shows that AsyncAP achieves an average speedup of 58x speedup over the state-of-the-art GPU automata processing engine when the task does not have enough parallelism to utilize all GPU cores. When tasks have enough parallelism to utilize GPU cores, AsyncAP still achieves 2.4x speedup.

Funder

National Science Foundation

Publisher

Association for Computing Machinery (ACM)

Subject

Computer Networks and Communications,Hardware and Architecture,Software

Link

https://dl.acm.org/doi/pdf/10.1145/3606376.3593524

Reference2 articles.

1. Why GPUs are Slow at Executing NFAs and How to Make them Faster

2. Hongyuan Liu , Sreepathi Pai , and Adwait Jog . 2023 . Asynchronous Automata Processing on GPUs. Proceedings of the ACM on Measurement and Analysis of Computing Systems , Vol. 7 , 1, Article 27 (March 2023), 27 pages. https://doi.org/ 10 .1145/3579453 10.1145/3579453 Hongyuan Liu, Sreepathi Pai, and Adwait Jog. 2023. Asynchronous Automata Processing on GPUs. Proceedings of the ACM on Measurement and Analysis of Computing Systems, Vol. 7, 1, Article 27 (March 2023), 27 pages. https://doi.org/10.1145/3579453