NAPOLY: A Non-deterministic Automata Processor OverLaY

Author:

Karakchi Rasha1ORCID,Bakos Jason D.1ORCID

Affiliation:

1. University of South Carolina

Abstract

Deterministic and Non-deterministic Finite Automata (DFA and NFA) comprise the core of many big data applications. Recent efforts to develop Domain-Specific Architectures (DSAs) for DFA/NFA have taken divergent approaches, but achieving consistent throughput for arbitrarily-large pattern sets, state activation rates, and pattern match rates remains a challenge. In this article, we present NAPOLY (Non-Deterministic Automata Processor OverLaY), an FPGA overlay and associated compiler. A common limitation of prior efforts is a limit on NFA size for achieving the advertised throughput. NAPOLY is optimized for fast re-programming to permit practical time-division multiplexing of the hardware and permit high asymptotic throughput for NFAs of unlimited size, unlimited state activation rate, and high pattern reporting rate. NAPOLY also allows for offline generation of configurations having tradeoffs between state capacity and transition capacity. In this article, we (1) evaluate NAPOLY using benchmarks packaged in the ANMLZoo benchmark suite, (2) evaluate the use of an SAT solver for allocating physical resources, and (3) compare NAPOLY’s performance against existing solutions. NAPOLY performs most favorably on larger benchmarks, benchmarks with higher state activation frequency, and benchmarks with higher reporting frequency. NAPOLY outperforms the fastest of the CPU and GPU implementations in 10 out of 12 benchmarks.

Funder

National Science Foundation

Publisher

Association for Computing Machinery (ACM)

Subject

General Computer Science

Reference36 articles.

1. Kevin Angstadt Jack Wadden Vinh Dang Ted Xie Dan Kramp Westley Weimer Mircea Stan and Kevin Skadron. 2018. MNCaRT: An open-source multi-architecture automata-processing research and execution ecosystem. IEEE Computer Architecture Letters 17 1 (2018) 84–87. 10.1109/LCA.2017.2780105

2. Data Structures, Algorithms and Architectures for Efficient Regular Expression Evaluation;Becchi Michela;Washington University, St. Louis, MO,2009

3. Paul Dlugosch Dave Brown Paul Glendenning Michael Leventhal and Harold Noyes. 2014. An efficient and scalable semiconductor architecture for parallel automata processing. IEEE Transactions on Parallel and Distributed Systems 25 12 (2014) 3088–3098. 10.1109/TPDS.2014.8

4. Andrew Putnam Adrian M. Caulfield Eric S. Chung Derek Chiou Kypros Constantinides John Demme Hadi Esmaeilzadeh Jeremy Fowers Gopi Prashanth Gopal Jan Gray Michael Haselman Scott Hauck Stephen Heil Amir Hormati Joo-Young Kim Sitaram Lanka James Larus Eric Peterson Simon Pope Aaron Smith Jason Thong Phillip Yi Xiao and Doug Burger. 2015. A reconfigurable fabric for accelerating large-scale datacenter services. IEEE Micro 35 3 (2015) 10–22. 10.1109/MM.2015.42

5. Arun Subramaniyan Jingcheng Wang Ezhil R. M. Balasubramanian David Blaauw Dennis Sylvester and Reetuparna Das. 2017. Cache automaton. In Proceedings of the 50th Annual IEEE/ACM International Symposium on Microarchitecture (Cambridge Massachusetts) (MICRO-50’17) Association for Computing Machinery New York NY 259–272. 10.1145/3123939.3123986

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3