The Design of the Borealis Stream Processing Engine

Frank J Cangialosi, Yanif Ahmad, Magdalena Balazinska, Ugur Cetintemel, Mitch Cherniack, Jeong-Hyon Hwang, Wolfgang Lindner, Anurag S Maskey, Alexander Rasin, Esther Ryvkina, Nesime Tatbul, Ying Xing, Stan Zdonik
Second Biennial Conference on Innovative Data Systems Research (CIDR 2005), Asilomar, CA, January 2005

Borealis is a second-generation distributed stream processing engine that is being developed at Brandeis University, Brown University, and MIT. Borealis inherits core stream processing functionality from Aurora and distribution functionality from Medusa. Borealis modifies and extends both systems in non-trivial and critical ways to provide advanced capabilities that are commonly required by newly-emerging stream processing applications. In this paper, we outline the basic design and functionality of Borealis. Through sample real-world applications, we motivate the need for dynamically revising query results and modifying query specifications. We then describe how Borealis addresses these challenges through an innovative set of features, including revision records, time travel, and control lines. Finally, we present a highly flexible and scalable QoS-based optimization model that operates across server and sensor networks and a new fault-tolerance model with flexible consistency-availability trade-offs.

[PDF (143KB)]

Bibtex Entry:

@inproceedings{cangialosi2005design,
   author =       "Frank J Cangialosi and Yanif Ahmad and Magdalena Balazinska and Ugur Cetintemel and Mitch Cherniack and Jeong-Hyon Hwang and Wolfgang Lindner and Anurag S Maskey and Alexander Rasin and Esther Ryvkina and Nesime Tatbul and Ying Xing and Stan Zdonik",
   title =        "{The Design of the Borealis Stream Processing Engine}",
   booktitle =    {Second Biennial Conference on Innovative Data Systems Research (CIDR 2005)},
   year =         {2005},
   month =        {January},
   address =      {Asilomar, CA}
}