Publication
HICSS 2000
Conference paper
Who's who? Identifying concepts and entities across multiple documents
Abstract
A number of research and software development groups have developed technology for identifying terms and names in documents and associating them with concepts and named entities, but few have addressed coreference of concepts and entities across multiple documents in a collection. Cross-document coreference is challenging, since a collection of documents consists of multiple discourse contexts, with a many-to-many correspondence between terms and names on one hand and the concepts and entities they refer to on the other. In this paper we describe extensions to our intra-document term and name identification for coreferencing concepts and entities across documents.