共 8 条
- [1] An Efficient Domain-Independent Algorithm for Detecting Approximately Duplicate Database Records. Monge A,Elkan C. Proceedings of SIGMOD Workshop on Research Issues on Data Mining and Knowledge Discovery . 1997
- [2] The Merge/Purge Problem for Large Databases. Hernandez M,Stolfo S. Proceedings of the ACM SIGMOD International Conference on Management of Data . 1995
- [3] IntelliClean: A Knowledge-based Intelligent Data Cleaner. Lee ML,Ling TW,Low WL. Proceedings of SIGMOD Workshop on Research Issues on Data Mining and Knowledge Discovery . 2000
- [4] Term-Weighting Approaches in Automatic Text Retrieval. Salton G,Buckley C. Information Processing Letters . 1988
- [5] AlphaSort: A RISC Machine Sort. Nyberg C,Barclay T,Cvetanovic Z,et al. Proceedings of the 1994 ACM- SIGMOD Conference . 1994
- [6] Binary Codes Capable of Correcting Deletions, Insertions and Reversals. Levenshtein V. Soviet Physics-Doklady10 . 1966
- [7] DynamicInvertedIndexesforaDistributedFull TextRetrievalSystem. ClarkeCLA,CormackGV. TechnicalReportMT- 95-01 .
- [8] Efficient Clustering of High-Dimensional Data Sets with Application to Reference Matching. McCallum A,Nigam K,Ungar L. Proceedings of the Sixth International Conference on Knowledge Discovery and Data Mining . 2000