Automatically identifying known software problems
Natwar Modani, Rajeev Gupta, et al.
ICDEW 2007
Research in the field of content-based retrieval has primarily focused on image, video and audio information. In this paper, we demonstrate content-based retrieval in a new data domain called gene expression data derived from gene chip images. In particular, we consider the problem of retrieving functionally similar genes from a database based on the pattern of variation of the expression of genes over time. Specifically, we model the time-varying gene expression patterns as curves, and analyze similarity between gene profiles by the relative amounts of twists and turns produced in a higher-dimensional curve formed from the projection of the individual gene profiles. Scale-space analysis is used to detect the sharp twists and turns and their relative strength with respect to the component curves is estimated to form a shape similarity measure between gene profiles. The higher-dimensional curves also form prototypical descriptions of the individual gene profiles, serving as a way to index the database using clustering. Functionally similar genes are then identified using scale-space distance metric on the cluster prototypes.
Natwar Modani, Rajeev Gupta, et al.
ICDEW 2007
Rodrigo Bonazzola, Enzo Ferrante, et al.
Nature Machine Intelligence
Ken C. L. Wong, Hongzhi Wang, et al.
IEEE T-MI
Ritwik Kumar, Tanveer Syeda-Mahmood, et al.
AMIA ... Annual Symposium proceedings / AMIA Symposium. AMIA Symposium