An n-gram-based approach for detecting approximately duplicate database recordsZengping TianHongjun Luet al.2000IJDL