Over a million developers have joined DZone.

Hadoop to Relational and Back Again: Apache Sqoop Performance Tuning

DZone's Guide to

Hadoop to Relational and Back Again: Apache Sqoop Performance Tuning

Apache Sqoop can transfer large amounts of data between Hadoop and datastores, like relational databases. Here's an overview of tuning Sqoop for optimal performance.

· Performance Zone ·
Free Resource

xMatters delivers integration-driven collaboration that relays data between systems, while engaging the right people to proactively resolve issues. Read the Monitoring in a Connected Enterprise whitepaper and learn about 3 tools for resolving incidents quickly.

Apache Sqoop is a tool designed for efficiently transferring bulk data between Apache Hadoop and structured datastores such as relational databases.

This presentation takes a deep dive approach and discusses some important tips, which can be used to improve the performance of Apache Sqoop. It also discusses some aspects, which is not directly controlled by Sqoop, still can impact the overall performance of Sqoop flows.

For high-level overview of Apache Sqoop and other Apache Hadoop related technologies, check out following two-part series on DZone:

Part 1 https://dzone.com/articles/techtalk-apache-hadoop-and-related-technologies-fo

Part 2 https://dzone.com/articles/the-hadoop-ecosystem-in-30-minutes-part-2

Discovering, responding to, and resolving incidents is a complex endeavor. Read this narrative to learn how you can do it quickly and effectively by connecting AppDynamics, Moogsoft and xMatters to create a monitoring toolchain.

bigdata ,analytics ,performance tuning

Opinions expressed by DZone contributors are their own.

{{ parent.title || parent.header.title}}

{{ parent.tldr }}

{{ parent.urlSource.name }}