Affiliation:
1. IBM Thomas J. Watson Research Center, Yorktown Heights, NY
2. Bilkent University, Ankara, Turkey
Abstract
Big data is revolutionizing how all sectors of our economy do business, including telecommunication, transportation, medical, and finance. Big data comes in two flavors: data at rest and data in motion. Processing data in motion is
stream processing
. Stream processing for big data analytics often requires scale that can only be delivered by a distributed system, exploiting parallelism on many hosts and many cores. One such distributed stream processing system is IBM Streams. Early customer experience with IBM Streams uncovered that another core requirement is extensibility, since customers want to build high-performance domain-specific operators for use in their streaming applications. Based on these two core requirements of distribution and extensibility, we designed and implemented the Streams Processing Language (SPL). This article describes SPL with an emphasis on the language design, distributed runtime, and extensibility mechanism. SPL is now the gateway for the IBM Streams platform, used by our customers for stream processing in a broad range of application domains.
Publisher
Association for Computing Machinery (ACM)
Reference75 articles.
1. Aurora: a new model and architecture for data stream management
2. Efficient pattern matching over event streams
3. Yanif Ahmad and Christoph Koch. 2009. DBToaster: A SQL compiler for high-performance delta processing in main-memory databases. In Demonstration at Very Large Data Bases (VLDB-Demo). 1566--1569. 10.14778/1687553.1687592 Yanif Ahmad and Christoph Koch. 2009. DBToaster: A SQL compiler for high-performance delta processing in main-memory databases. In Demonstration at Very Large Data Bases (VLDB-Demo). 1566--1569. 10.14778/1687553.1687592
4. Tyler Akidau Alex Balikov Kaya Bekiroglu Slava Chernyak Josh Haberman Reuven Lax Sam McVeety Daniel Mills Paul Nordstrom and Sam Whittle. 2013. MillWheel: Fault-tolerant stream processing at internet scale. In Very Large Data Bases (VLDB) Industrial Track. 734--746. 10.14778/2536222.2536229 Tyler Akidau Alex Balikov Kaya Bekiroglu Slava Chernyak Josh Haberman Reuven Lax Sam McVeety Daniel Mills Paul Nordstrom and Sam Whittle. 2013. MillWheel: Fault-tolerant stream processing at internet scale. In Very Large Data Bases (VLDB) Industrial Track. 734--746. 10.14778/2536222.2536229
Cited by
19 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献