Over a million developers have joined DZone.
{{announcement.body}}
{{announcement.title}}

Data Cleaning

DZone's Guide to

Data Cleaning

· Big Data Zone ·
Free Resource

Hortonworks Sandbox for HDP and HDF is your chance to get started on learning, developing, testing and trying out new features. Each download comes preconfigured with interactive tutorials, sample data and developments from the Apache community.

This is the second presentation for the Data Quality module of my PhD. Please refer to the references indicated for a more in-depth analysis, as this presentation is entirely based on them.


Hortonworks Community Connection (HCC) is an online collaboration destination for developers, DevOps, customers and partners to get answers to questions, collaborate on technical articles and share code examples from GitHub.  Join the discussion.

Topics:
bigdata ,big data ,data cleaning

Published at DZone with permission of

Opinions expressed by DZone contributors are their own.

{{ parent.title || parent.header.title}}

{{ parent.tldr }}

{{ parent.urlSource.name }}