Learning regional semantic concepts from incomplete annotation

Milind R. Naphade; John R. Smith

ICIP 2003

Conference paper

17 Dec 2003

Learning regional semantic concepts from incomplete annotation

Abstract

For Multimedia Retrieval to be effective, the semantic gap needs to be bridged. Statistical learning techniques provide a robust framework for learning representations of semantic concepts from visual features. The bottleneck is the need to annotate a large number of training samples to construct robust models. We present a novel approach where the annotations may be entered at coarser spatial granularity while the concept may still be learnt at finer granularity. This can speed up annotation significantly and provide bootstrapping. We show that it is possible to learn representations of concepts occurring at the regional level by using annotations for several images, where the annotations are provided only at the global level. The disambiguation can be handled by the multiple instance learning paradigm. We demonstrate this using the TREC 2001 Corpus for the concept Sky.

Conference paper