An Introduction to Duplicate Detection

An Introduction to Duplicate Detection

Versandkostenfrei!
Versandfertig in 6-10 Tagen
26,74 €
inkl. MwSt.
Weitere Ausgaben:
PAYBACK Punkte
0 °P sammeln!
With the ever increasing volume of data, data quality problems abound. Multiple, yet different representations of the same real-world objects in data, duplicates, are one of the most intriguing data quality problems. The effects of such duplicates are detrimental; for instance, bank customers can obtain duplicate identities, inventory levels are monitored incorrectly, catalogs are mailed multiple times to the same household, etc. Automatically detecting duplicates is difficult: First, duplicate representations are usually not identical but slightly differ in their values. Second, in principle ...