Conference paper
Self-similarity in the web
Stephen Dill, Ravi Kumar, et al.
VLDB 2001
In this paper, we describe the design, architecture, and the lessons learned from the implementation of a fast regular expression indexing engine FREE. FREE uses a pre-built index to identify the text data units which may contain a matching string and only examines these further. In this way, FREE shows orders of magnitude performance improvement in certain cases over standard regular expression matching systems, such as lex, awk and grep.
Stephen Dill, Ravi Kumar, et al.
VLDB 2001
Sridhar Rajagopalan, Leonard J. Shchulman
SIAM Journal on Computing
Junghoo Cho, Hector Garcia-Molina, et al.
ACM TOIT
Ravi Kumar, Prabhakar Raghavan, et al.
Journal of Computer and System Sciences