Efficient Multi-Step Query Processing for EMD-based Similarity

Similarity search in large multimedia databases requires efficient query processing based on suitable similarity models. Similarity models consist of a feature extraction step as well as a distance defined for these features, and they demand an efficient algorithm for retrieving similar objects from large databases. In this work, we focus on the Earth Movers Distance (EMD), a recently introduced similarity model which has been successfully employed in numerous applications and has been reported as well reflecting human perceptual similarity. As its computation is complex, the direct application of the EMD to large, high-dimensional databases is not feasible. To overcome this limitation and allow users to benefit from the high quality of the model even in larger settings, we developed various lower bounds for the EMD to be used in index-supported multistep query processing algorithms. We prove that our algorithms are complete, thus producing no false drops. We also show that it is highly efficient as experiments on large image databases with high-dimensional features demonstrate.

Authors: Assent I., Seidl T.
Published in: T. Crawford, R. Veltkamp (eds.): Content-Based Retrieval. Dagstuhl Seminar 06171
Publisher: Internationales Begegnungs- und Forschungszentrum für Informatik (IBFI), Schloss Dagstuhl, Germany - Dagstuhl, Germany
Language: EN
Year: 2006
ISSN: 1862-4405
Url:Dagstuhl Seminar Proceedings
Full Text PDF
Type: Miscellaneous (not reviewed)
Research topic: Exploration of Multimedia Databases