Harnessing the Strengths of Anytime Algorithms for Constant Data Streams

Anytime algorithms have been proposed for many different applications e.g. in data mining. Their strengths are the ability to first provide a result after a very short initialization and second to improve their result with additional time. Therefore, anytime algorithms have so far been used when the available processing time varies, e.g. on varying data streams. In this paper we propose to employ anytime algorithms on constant data streams, i.e. for tasks with constant time allowance. We introduce two approaches that harness the strengths of anytime algorithms on constant data streams and thereby improve the over all quality of the result with respect to the corresponding budget algorithm. We derive formulas for the expected performance gain and demonstrate the effectiveness of our novel approaches using existing anytime algorithms on benchmark data sets.

Authors: Kranen P., Seidl T.
Published in: Proc. European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML PKDD 2009), Bled, Slowenia.
Publisher: Springer - Heidelberg, Germany
Sprache: EN
Jahr: 2009

The Paper has additionally been chosen for publication in the Data Mining and Knowledge Discovery Journal, ECML PKDD Special Issue (acceptance rate 3.3%).

Seiten: 31
ISBN: 978-3-642-04179-2
Konferenz: ECML PKDD
Typ: Tagungsbeiträge
Forschungsgebiet: Data Analysis and Knowledge Extraction