GLADE-Reference-Cited by-同舟云学术

GLADE

Published:2012-02-16 Issue:1 Volume:46 Page:12-18
ISSN:0163-5980
Container-title:ACM SIGOPS Operating Systems Review
language:en
Short-container-title:SIGOPS Oper. Syst. Rev.

Author:

Rusu Florin¹,Dobra Alin²

Affiliation:

1. University of California, Merced, Merced, CA

2. University of Florida, Gainesville, FL

Abstract

In this paper we introduce GLADE, a scalable distributed framework for large scale data analytics. GLADE consists of a simple user-interface to define Generalized Linear Aggregates (GLA), the fundamental abstraction at the core of GLADE, and a distributed runtime environment that executes GLAs by using parallelism extensively. GLAs are derived from User-Defined Aggregates (UDA), a relational database extension that allows the user to add specialized aggregates to be executed inside the query processor. GLAs extend the UDA interface with methods to Serialize/Deserialize the state of the aggregate required for distributed computation. As a significant departure from UDAs which can be invoked only through SQL, GLAs give the user direct access to the state of the aggregate, thus allowing for the computation of significantly more complex aggregate functions. GLADE runtime is an execution engine optimized for the GLA computation. The runtime takes the user-defined GLA code, compiles it inside the engine, and executes it right near the data by taking advantage of parallelism both inside a single machine as well as across a cluster of computers. This results in maximum possible execution time performance (all our experimental tasks are I/O-bound) and linear scaleup.

Publisher

Association for Computing Machinery (ACM)

Link

https://dl.acm.org/doi/pdf/10.1145/2146382.2146386

Reference17 articles.

1. Hadoop. http://hadoop.apache.org/. {Online; accessed July 2011}. Hadoop. http://hadoop.apache.org/. {Online; accessed July 2011}.

2. Microsoft SQL Server. http://msdn.microsoft.com/enus/library/ms131057.aspx. {Online; accessed July 2011}. Microsoft SQL Server. http://msdn.microsoft.com/enus/library/ms131057.aspx. {Online; accessed July 2011}.

3. The DataPath system

4. MAD skills

Cited by 14 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Disaggregating the Differential Impact of Healthcare IT in Complex Care Delivery: Insights from Field Research in Chronic Care;Journal of the Association for Information Systems;2021

2. Data Management in Machine Learning Systems;Synthesis Lectures on Data Management;2019-02-25

3. Moment-based quantile sketches for efficient high cardinality aggregation queries;Proceedings of the VLDB Endowment;2018-07

4. Automatic identification and classification of Palomar Transient Factory astrophysical objects in GLADE;International Journal of Computational Science and Engineering;2018

5. In-database batch and query-time inference over probabilistic graphical models using UDA–GIST;The VLDB Journal;2016-11-02