Publications

Journal:

S. Newsam, L. Wang, S. Bhagavathy, and B. S. Manjunath, "Using texture to analyze and manage large collections of remote sensed image and video data," Journal of Applied Optics: Information Processing, vol. 46, no. 2, Jan 2004. Abstract

S. Newsam, S. Bhagavathy, L.M.G. Fonseca, C. Kenney and B.S. Manjunath, "Object-based Representations of Spatial Images", Acta Astronautica Vol. 48, No. 5-12, pp. 567-577, 2001. Abstract

Conference:

B. Sumengen, S. Bhagavathy, and B. S. Manjunath, "Graph Partitioning Active Contours for Knowledge-Based Geo-Spatial Segmentation," CVPR Workshop on Perceptual Organization in Computer Vision (POCV), Washington DC, Jun 2004. Abstract

M. Zuliani, S. Bhagavathy, B. S. Manjunath, and C. S. Kenney, "Affine-Invariant Curve Matching," International Conference on Image Processing, Singapore, Oct 24-27, 2004. Abstract

S. Newsam, L. Wang, S. Bhagavathy, and B. S. Manjunath, "Using Texture to Annotate Remote Sensed Datasets," Proceedings of 3rd International Symposium on Image and Signal Processing and Analysis, Rome, Italy, Sep 2003, pp.72-77. Abstract

J. Tesic, S. Bhagavathy, and B. S. Manjunath, "Issues Concerning Dimensionality and Similarity Search," Proceedings of 3rd International Symposium on Image and Signal Processing and Analysis, Rome, Italy, Sep 2003, pp. 272-277. Abstract

S. Bhagavathy, J. Tesic, and B. S. Manjunath, "On the Rayleigh Nature of Gabor Filter Outputs," International Conference on Image Processing, Barcelona, Spain, Sep 14-17, 2003. Abstract

S. Newsam, S. Bhagavathy, and B. S. Manjunath, "Object Localization Using Texture Motifs and Markov Random Fields," International Conference on Image Processing, Barcelona, Spain, Sep 14-17, 2003. Abstract

S. Newsam, S. Bhagavathy, and B.S. Manjunath, "Modeling Object Classes in Aerial Images Using Hidden Markov Models," International Conference on Image Processing, Rochester, September 22-25, 2002. Abstract

S. Bhagavathy, S. Newsam, and B.S. Manjunath, "Modeling Object Classes in Aerial Images Using Texture Motifs," International Conference on Pattern Recognition, Quebec City, August 11-15, 2002. Abstract

S. Newsam, S. Bhagavathy, L.M.G. Fonseca, C. Kenney and B.S. Manjunath, "Object-based Representations of Spatial Images", 51st International Astronautical Congress, Rio de Janeiro, Oct 2000. Abstract

K.R. Ramakrishnan, S.H. Srinivasan and S. Bhagavathy, "The Independent Components of Characters are 'Strokes'", Proceedings of the fifth International Conference on Document Analysis and Recognition, pp. 414-417, Sep 1999. Abstract

S. Bhagavathy and M. El-Saban, "SketchIt: Basketball Video Retrieval Using Ball Motion Similarity," Pacific-Rim Conference on Multimedia, Tokyo Waterfront City, Japan, Nov 30-Dec 3, 2004. Abstract

Top

Abstracts

Using texture to analyze and manage large collections of remote sensed image and video data

S. Newsam, L. Wang, S. Bhagavathy, and B. S. Manjunath
Journal of Applied Optics: Information Processing, vol. 46, no. 2, Jan 2004

Abstract:

We describe recent research into using the visual primitive of texture to analyze and manage large collections of remote sensed image and video data. Texture is regarded as the spatial dependence of pixel intensity. It is characterized by the amount of dependence at different scales and orientations, as measured with frequency-selective filters. A homogeneous texture descriptor based on the filter outputs is shown to enable (1) content-based image retrieval in large collections of satellite imagery, (2) semantic labeling and layout retrieval in an aerial video management system, and (3) statistical object modeling in geographic digital libraries.

Top

Object-based Representations of Spatial Images

S. Newsam, S. Bhagavathy, L.M.G. Fonseca, C. Kenney and B.S. Manjunath
Acta Astronautica Vol. 48, No. 5-12, pp. 567-577, 2001

Abstract:

Object based representations of image data enable new content-related functionalities while facilitating management of large image databases. Developing such representations for multi-date and multi-spectral images is one of the objectives of the second phase of the Alexandria Digital Library (ADL) project at UCSB. Image segmentation and image registration are two of the main issues that are to be addressed in creating localized image representations. We present in this paper some of the recent and current work by the ADL’s image processing group on robust image segmentation, registration, and the use of image texture for content representation. Built upon these technologies are techniques for managing large repositories of data. A texture thesaurus assists in creating a semantic classification of image regions. An object-based representation is proposed to facilitate data storage, retrieval, analysis, and navigation. image content descriptors. Perhaps the most noticeable result is the upcoming MPEG-7 standardization effort whose objective is to provide a set of standardized tools to describe multimedia content. However, the expectations placed on the ability of these descriptors to capture meaningful information must be tempered. The human visual system requires extensive learned experience to function as well as it does. This is especially true for an expert viewing a restricted domain of images.

Top

Graph Partitioning Active Contours for Knowledge-Based Geo-Spatial Segmentation

B. Sumengen, S. Bhagavathy, and B. S. Manjunath
CVPR Workshop on Perceptual Organization in Computer Vision (POCV), Washington DC, Jun 2004

Abstract:

Our contribution in this paper is two-fold. First, we extend our previous curve evolution method based on pair-wise similarities. This curve evolution equation combines the grouping abilities of active contours and graph partitioning techniques. Connections of our method to spectral graph partitioning are investigated and comparisons are made. Second, in a model-based segmentation scenario, we propose a method to improve segmentation quality by iteratively modifying the model using feed-back from segmentation of a labeled training set. Our purpose here is to segment objects in geo-spatial images by integrating domain knowledge with the segmentation method. We achieve our goal by combining a statistical model for the object with a knowledge-guided segmentation method. Experimental results show that this framework is effective for model- based segmentation of complex geo-spatial objects.

Top

Affine-Invariant Curve Matching

M. Zuliani, S. Bhagavathy, B. S. Manjunath, and C. S. Kenney
International Conference on Image Processing, Singapore, Oct 24-27, 2004

Abstract:

In this paper, we propose an affine-invariant method for describing and matching curves. This is important since affne transformations are often used to model perspective distortions. More specically, we propose a new definition of the shape of a curve that characterizes a curve independently of the effects introduced by a±ne distortions. By combining this definition with a rotation-invariant shape descriptor, we show how it is possible to describe a curve in an intrinsically affine- invariant manner. To validate our procedure we built a database of shapes subject to perspective distortions and plotted the precision-recall curve for this dataset. Finally an application of our method is shown in the context of wide baseline matching.

Top

Using Texture to Annotate Remote Sensed Datasets

S. Newsam, L. Wang, S. Bhagavathy, and B. S. Manjunath
Proceedings of 3rd International Symposium on Image and Signal Processing and Analysis, Rome, Italy, Sep 2003, pp.72-77

Abstract:

Texture remains largely underutilized in the analysis of remote sensed datasets compared to descriptors based on the orthogonal spectral dimension. This paper describes our recent efforts towards using texture to automate the annotation of remote sensed imagery. Two applications are described that use the homogeneous texture descriptor recently standardized by MPEG-7. In the first, higher-level access to remote sensed imagery is enabled by using the texture descriptor to model geo-spatial objects. In particular, the common textures, or texture motifs, are characterized as Gaussian mixtures in the high-dimensional feature space. In the second application, the texture descriptor is used to label regions in a large collection of aerial videography in a perceptually meaningful way. Gaussian mixtures are used to model the distribution of feature vectors for a variety of semantic classes. Frame level similarity retrieval based on semantic layout and semantic histogram is enabled by modeling the spatial arrangement of the labeled regions as a Markov random field.

Top

Issues Concerning Dimensionality and Similarity Search

J. Tesic, S. Bhagavathy, and B. S. Manjunath
Proceedings of 3rd International Symposium on Image and Signal Processing and Analysis, Rome, Italy, Sep 2003, pp. 272-277

Abstract:

Effectiveness and efficiency are two important concerns in using multimedia descriptors to search and access database items. Both are affected by the dimensionality of the descriptors. While higher dimensionality generally increases effectiveness, it drastically reduces efficiency of storage and searching. With regard to effectiveness, relevance feedback is known to be a useful tool to squeeze information from a descriptor. However, not much has been done toward enabling relevance feedback computation using high-dimensional descriptors over a large multimedia dataset. In this context, we have developed new methods that enable us to a) reduce the dimensionality of Gabor texture descriptors without losing on effectiveness, and b) perform fast nearest neighbor search based on the information available during each iteration of a relevance feedback step. Experimental results are presented on real datasets.

Top

On the Rayleigh Nature of Gabor Filter Outputs

S. Bhagavathy, J. Tesic, and B. S. Manjunath
International Conference on Image Processing, Barcelona, Spain, Sep 14-17, 2003

Abstract:

Texture has been recognized as an important visual primitive in image analysis. A widely used texture descriptor, which is part of the MPEG-7 standard, is that computed using multiscale Gabor filters. The high dimensionality and computational complexity of this descriptor adversely affect the efficiency of content-based retrieval systems. We propose a modified texture descriptor that has comparable performance, but with nearly half the dimensionality and less computational expense. This gain is based on a claim that the distribution of (absolute values of) filter outputs have a strong tendency to be Rayleigh. Experimental results show that the dimensionality can be reduced by almost 50%, with a tradeoff of less than 3% on the error rate. Furthermore, it is easy to compute the new feature using the old one, without having to repeat the computationally expensive filtering step. We also propose a new normalization method that improves similarity retrieval and indexing efficiency.

Top

Object Localization Using Texture Motifs and Markov Random Fields

S. Newsam, S. Bhagavathy, and B. S. Manjunath
International Conference on Image Processing, Barcelona, Spain, Sep 14-17, 2003

Abstract:

This work presents a novel approach to object localization in complex imagery. In particular, the spatial extents of objects characterized by distinct spatial signatures at multiple scales are estimated by using statistical models to control a simple region growing process. Texture motifs are used to model the spatial signatures at the smallest, or pixel, scale. Markov random fields are used to model the spatial signatures at the larger, or motif, scale. These models are used to iteratively expand a bounding box to approximate the spatial extent of an object. The approach is applied to localizing geo-spatial objects in high-resolution panchromatic aerial imagery.

Top

Modeling Object Classes in Aerial Images Using Hidden Markov Models

S. Newsam, S. Bhagavathy, and B.S. Manjunath
International Conference on Image Processing, Rochester, September 22-25, 2002

Abstract:

We propose a canonical model for object classes in aerial images. This model is motivated by the observation that geographic regions of interest are characterized by collections of texture motifs corresponding to the geographic processes that generate them. We show that this model is effective in learning the common texture themes, or motifs, of the object classes.

Top

Modeling Object Classes in Aerial Images Using Texture Motifs

S. Bhagavathy, S. Newsam, and B.S. Manjunath
International Conference on Pattern Recognition, Quebec City, August 11-15, 2002

Abstract:

We propose a canonical model for object classes in aerial images. This model is motivated by the observation that geographic regions of interest are characterized by collections of texture motifs corresponding to the geographic processes that generate them. We show that this model is effective in learning the common texture themes, or motifs, of the object classes.

Top

The Independent Components of Characters are 'Strokes'

K.R. Ramakrishnan, S.H. Srinivasan and S. Bhagavathy
Proceedings of the fifth International Conference on Document Analysis and Recognition, pp. 414-417, Sep 1999

Abstract:

What are the natural features of hand-written characters and how to arrive at them automatically? We apply independent components analysis on hand-written characters. Independent components analysis extracts the underlying statistically independent signals from a mixture of them. We expect strokes to be the independent components of hand-written characters. Our findings show that stroke-like features emerge as a result of the analysis confirming the above intuition. This finding is significant since it gives an automatic procedures for extracting stroke-like features from multilingual character data sets. We use these features for handwritten digit recognition using a very simple classifier. The classifier is chosen to be simple so that the quality of input feature set can be evaluated. The recognition results indicate that the features arrived at by independent component analysis are useful.

Top

SketchIt: Basketball Video Retrieval Using Ball Motion Similarity

S. Bhagavathy and M. El-Saban
Pacific-Rim Conference on Multimedia, Tokyo Waterfront City, Japan, Nov 30-Dec 3, 2004.

Abstract:

A prototype basketball video retrieval system is presented in this report. Retrieval is based on the similarity of ball motion in the clip with that in the query. The system uses a query-by-sketch paradigm, where the user provides a sketch of the desired ball trajectory. The video data is pre-processed to make the ball motion invariant to camera translation. The next stage is dimensionality reduction wherein we model the ball motion as a set of parabolic trajectories. An R-tree is used to index these parabolic representations and search for similar trajectories in a low dimension parametric space. The query is processed to obtain its parametric representation, and a nearest neighbor search is performed for similar parabolas. These query results are then post- processed by assigning scores based on various similarity criteria. The system could be extended to other types of videos and moving objects. As a proof of concept, the system was tested for ball trajectories in basketball video.