Search and Retrieval of Multi-modal Data Associated with Image-parts

Niloufar Pourian, S. Karthikeyan, B.S. Manjunath


We present a novel framework for querying multi-modal data from a heterogeneous database containing images, textual tags, and GPS coordinates. We construct a bi-layer graph structure using localized image-parts and associated GPS locations and textual tags from the database. The first layer graphs capture similar data points from a single modality using a spectral clustering algorithm. The second layer of our multi-modal network allows one to integrate the relationships between clusters of different modalities. The proposed network model enables us to use flexible multi-modal queries on the database.

[PDF] [BibTex]
Niloufar Pourian, S. Karthikeyan, B.S. Manjunath,
International Conference on Image Processing, Sep. 2015.
Node ID: 679 , Lab: VRL , Target: Conference
Subject: [Managing Multimedia Databases] « Look up more
Subject: [Multimedia Database Mining] « Look up more