Distributed Weighted Clustering of Evolving Sensor Data Streams with Noise

Collecting data from sensor nodes is the ultimate goal of Wireless Sensor Networks. This is performed by transmitting the sensed measurements to some data collecting station. In sensor nodes, radio communication is the dominating consumer of the energy resources which are usually limited. Summarizing the sensed data internally on sensor nodes and sending only the summaries will considerably save energy. Clustering is an established data mining technique for grouping objects based on similarity. For sensor networks, center clustering aims at grouping sensor measurements in  groups, each contains similar measurements.


In this article, we propose a novel resource-aware center clustering algorithm called: SenClu. Our algorithm immediately detects new trends in the drifting sensor data stream and follows them. SenClu powerfully uses a light-weighted decaying technique that gives lower influence to old data. As sensor data are usually noisy, our algorithm is also outlier-aware. In thorough experiments on drifting synthetic and real world data sets, we show that SenClu outperforms two state-of-the-art algorithms by producing higher clustering quality and following trends in the stream, while consuming nearly the same amount of energy.

Authors: Hassani M., Seidl T.
Published in: Journal of Digital Information Management (JDIM) Volume 10, No. 6, December 2012
Publisher: Digital Information Research Foundation
Sprache: EN
Jahr: 2012
Seiten: 410-420
ISSN: 09727272
Konferenz: JDIM
Typ: Zeitschriftenartikel
Forschungsgebiet: Data Analysis and Knowledge Extraction