全部 标题 作者
关键词 摘要

OALib Journal期刊
ISSN: 2333-9721
费用:99美元

查看量下载量

相关文章

更多...

Token-based method of blocking records for large data warehouse

Keywords: Data Warehouse , Record Linkage , Token , Blocking Records , Record Comparisons , Duplicate Data

Full-Text   Cite this paper   Add to My Lib

Abstract:

Record linkage is a critical problem in duplicate data elimination. It is used to detect and eliminateduplicate data. The elimination of duplicate data will increase the quality of data. Record Linkage problem willtake high computational cost because of the large number of record comparisons. The comparison of records isinefficient in large data warehouses. Blocking methods are used to group the records to minimize the number ofrecord comparisons. This paper explains the existing blocking methods and its comparison and discusses theselection of token-based blocking key for record comparisons.

Full-Text

comments powered by Disqus

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133

WeChat 1538708413