Redundancy elimination within large collections of filesPurushottam KulkarniFred Dougliset al.2004USENIX ATC 2004