MOA: Massive Online Analysis, a Framework for Stream Classification and Clustering

Massive Online Analysis (MOA) is a software environment for implementing algorithms and running experiments for online learning from evolving data streams. MOA is designed to deal with the challenging problem of scaling up the implementation of state of the art algorithms to real world dataset sizes. It contains collection of offline and online for both classification and clustering as well as tools for evaluation. In particular, for classification it implements boosting, bagging, and Hoeffding Trees, all with and without Naive Bayes classifiers at the leaves. For clustering, it implements StreamKM++, CluStream, ClusTree, DenStream, DStream and CobWeb. Researchers benefit from MOA by getting insights into workings and problems of different approaches, practitioners can easily apply and compare several algorithms to real world data set and settings. MOA supports bi-directional interaction with WEKA, the Waikato Environment for Knowledge Analysis, and is released under the GNU GPL license.

Authors: Bifet A., Holmes G., Pfahringer B., Kranen P., Kremer H., Jansen T., Seidl T.
Published in: Journal of Machine Learning Research (JMLR) Workshop and Conference Proceedings, Volume 11: Workshop on Applications of Pattern Analysis
Publisher: Journal of Machine Learning Research
Language: EN
Year: 2010
Pages: 44-50
ISSN: 1938-7228
Conference: Workshop on Applications of Pattern Analysis
JMLR Proc. Vol. 11
Type: Conference papers (peer reviewed)
Research topic: Data Analysis and Knowledge Extraction