A geometric approach to monitoring threshold functions over distributed data streams

Izchak Sharfman, Assaf Schuster, Daniel Keren

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Monitoring data streams in a distributed system is the focus of much research in recent years. Most of the proposed schemes, however, deal with monitoring simple aggregated values, such as the frequency of appearance of items in the streams. More involved challenges, such as the important task of feature selection (e.g., by monitoring the information gain of various features), still require very high communication overhead using naive, centralized algorithms. We present a novel geometric approach by which an arbitrary global monitoring task can be split into a set of constraints applied locally on each of the streams. The constraints are used to locally filter out data increments that do not affect the monitoring outcome, thus avoiding unnecessary communication. As a result, our approach enables monitoring of arbitrary threshold functions over distributed data streams in an efficient manner. We present experimental results on real-world data which demonstrate that our algorithms are highly scalable, and considerably reduce communication load in comparison to centralized algorithms.

Original languageEnglish
Title of host publicationSIGMOD 2006 - Proceedings of the ACM SIGMOD International Conference on Management of Data
Pages301-312
Number of pages12
DOIs
StatePublished - 2006
Event2006 ACM SIGMOD International Conference on Management of Data - Chicago, IL, United States
Duration: 27 Jun 200629 Jun 2006

Publication series

NameProceedings of the ACM SIGMOD International Conference on Management of Data
ISSN (Print)0730-8078

Conference

Conference2006 ACM SIGMOD International Conference on Management of Data
Country/TerritoryUnited States
CityChicago, IL
Period27/06/0629/06/06

Keywords

  • Data streams
  • Distributed monitoring

ASJC Scopus subject areas

  • Software
  • Information Systems

Fingerprint

Dive into the research topics of 'A geometric approach to monitoring threshold functions over distributed data streams'. Together they form a unique fingerprint.

Cite this