A randomized approach for approximating the number of frequent sets

Mario Boley, Henrik Grosskreutz

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

We investigate the problem of counting the number of frequent (item)sets-a problem known to be intractable in terms of an exact polynomial time computation. In this paper, we show that it is in general also hard to approximate. Subsequently, a randomized counting algorithm is developed using the Markov chain Monte Carlo method. While for general inputs an exponential running time is needed in order to guarantee a certain approximation bound, we empirically show that the algorithm still has the desired accuracy on real-world datasets when its running time is capped polynomially.

Original languageEnglish
Title of host publicationProceedings - 8th IEEE International Conference on Data Mining, ICDM 2008
Pages43-52
Number of pages10
DOIs
StatePublished - 2008
Externally publishedYes
Event8th IEEE International Conference on Data Mining, ICDM 2008 - Pisa, Italy
Duration: 15 Dec 200819 Dec 2008

Publication series

NameProceedings - IEEE International Conference on Data Mining, ICDM
ISSN (Print)1550-4786

Conference

Conference8th IEEE International Conference on Data Mining, ICDM 2008
Country/TerritoryItaly
CityPisa
Period15/12/0819/12/08

ASJC Scopus subject areas

  • General Engineering

Fingerprint

Dive into the research topics of 'A randomized approach for approximating the number of frequent sets'. Together they form a unique fingerprint.

Cite this