{{announcement.body}}
{{announcement.title}}

Efficient Duplicate Detection Over Massive Data Sets

DZone 's Guide to

Efficient Duplicate Detection Over Massive Data Sets

· Big Data Zone ·
Free Resource

This is the fourth presentation of the Data Quality module that I am presenting today.


Topics:
big data, bigdata, duplication detection

Published at DZone with permission of Pradeeban Kathiravelu , DZone MVB. See the original article here.

Opinions expressed by DZone contributors are their own.

{{ parent.title || parent.header.title}}

{{ parent.tldr }}

{{ parent.urlSource.name }}