Over a million developers have joined DZone.
{{announcement.body}}
{{announcement.title}}

Data Lakes: Managed Ingestion [Video]

DZone's Guide to

Data Lakes: Managed Ingestion [Video]

Check out this video to learn about the variation across data sources and the Hadoop distribution chosen.

· Big Data Zone
Free Resource

Learn best practices according to DataOps. Download the free O'Reilly eBook on building a modern Big Data platform.

The classic method for data ingestion in Hadoop relies on a number of different technologies each with its own configuration and scaling issues. These technologies require expertise to correctly ingest the data and ensure the ingest meets the SLAs of the organization.

The video below shows the variation across data sources and the Hadoop distribution chosen.


Managed ingestion is more than just using scripts to automate the movement of data into your Data Lake. This means not only having a defined repeatable process for data ingest but also having a way to manage the ingestion.

A managed process of ingestion gives you the tooling to programmatically address a consistent process of data movement into your system. In cases where that process has issues, it provides the means for delving into the root cause of the failure.

For more information on managed ingestion and data lakes, visit Data Lake 360° or more from our blog.

Find the perfect platform for a scalable self-service model to manage Big Data workloads in the Cloud. Download the free O'Reilly eBook to learn more.

Topics:
data lakes ,data sources ,ingestion ,big data

Published at DZone with permission of Adam Diaz, DZone MVB. See the original article here.

Opinions expressed by DZone contributors are their own.

{{ parent.title || parent.header.title}}

{{ parent.tldr }}

{{ parent.urlSource.name }}