Distributed Weighted Clustering of Evolving Sensor Data Streams with Noise
Collecting data from sensor nodes is the ultimate goal of Wireless Sensor Networks. This is performed by transmitting the sensed measurements to some data collecting station. In sensor nodes, radio communication is the dominating consumer of the energy resources which are usually limited. Summarizing the sensed data internally on sensor nodes and sending only the summaries will considerably save energy. Clustering is an established data mining technique for grouping objects based on similarity. For sensor networks, center clustering aims at grouping sensor measurements in groups, each contains similar measurements.
In this article, we propose a novel resource-aware center clustering algorithm called: SenClu. Our algorithm immediately detects new trends in the drifting sensor data stream and follows them. SenClu powerfully uses a light-weighted decaying technique that gives lower influence to old data. As sensor data are usually noisy, our algorithm is also outlier-aware. In thorough experiments on drifting synthetic and real world data sets, we show that SenClu outperforms two state-of-the-art algorithms by producing higher clustering quality and following trends in the stream, while consuming nearly the same amount of energy.
|Authors:||Hassani M., Seidl T.|
|Published in:||Journal of Digital Information Management (JDIM) Volume 10, No. 6, December 2012|
|Publisher:||Digital Information Research Foundation|
Volume 10, Issue 6, December 2012
|Forschungsgebiet:||Data Analysis and Knowledge Extraction|