Over a million developers have joined DZone.
{{announcement.body}}
{{announcement.title}}

Deploying Progress DataDirect Hybrid Data Pipeline on Amazon Lightsail

DZone's Guide to

Deploying Progress DataDirect Hybrid Data Pipeline on Amazon Lightsail

With the explosion in the use of cloud computing platforms like Azure, AWS, etc., we put together this tutorial to help you deploy Hybrid Data Pipeline on Amazon Lightsail.

· Big Data Zone ·
Free Resource

Hortonworks Sandbox for HDP and HDF is your chance to get started on learning, developing, testing and trying out new features. Each download comes preconfigured with interactive tutorials, sample data and developments from the Apache community.

Progress DataDirect Hybrid Data Pipeline is an innovative data access service for enterprises and cloud ISVs that is designed to simplify and streamline data integration between cloud, mobile, and on-premises sources through a secure, firewall-friendly integration. It is a transformative technology, abstracting away the complexity that has inevitably followed the recent explosion of data, which allows developers to build data-centric applications faster than ever and manage them more easily. SaaS ISVs can drive new wins through integration with customers' legacy applications and data. IT can immediately provide a plug-and-play solution that extends the reach of BI and ETL, RESTifies any database to improve developer productivity, accelerates the delivery of exciting new services.

You can deploy Hybrid Data Pipeline on your servers anywhere in the world. With the explosion in the use of cloud computing platforms like Azure, AWS, etc., we put together this tutorial to help you deploy Hybrid Data Pipeline on Amazon Lightsail.

Setting Up VM in AWS Lightsail

  1. To get started, you would need an Amazon Lightsail account. If you don't have one, register here and log into the portal.
  2. Once you have logged into the portal, create a new instance that has at least 8GB memory and 60GB HDD space. Choose OS-Only mode and your operating system as Amazon Linux.

Installing Hybrid Data Pipeline

  1. To get an evaluation copy of Hybrid Data Pipeline, visit this page and fill up your details to be able to download the installer.
  2. Copy the installer to the Lightsail instance using s or program like WinSCP if you are on Windows.
  3. Connect to your Lightsail instance using putty or the SSH console on Lightsail.
  4. Once you have logged in, you should be able to find the installer that you have copied from your local machine in the home folder of the user: ec2-user.
  5. If the installer package is not executable, run the following command to make the package executable.

    chmod +x PROGRESS_DATADIRECT_HDP_SERVER_LINUX_64_INSTALL.bin

  6. To start the installation, run the following command which will start the installer in console mode
  7. ./PROGRESS_DATADIRECT_HDP_SERVER_LINUX_64_INSTALL.bin

  8. During the installation, make sure you read and understand the License Agreement and accept the License Agreement to continue the installation.
    By default, the installation directory would be the following, but if you want to change it you are free to do so.

    /home/users/<username>/Progress/DataDirect/Hybrid_Data_Pipeline/Hybrid_Server

    Choose the type of installation when prompted for. If you are trying the Hybrid Data Pipeline, choose Evaluation as your option. If you have purchased a license, choose Licensed Installation and enter your license key number to proceed further.When the installer asks you to enter the hostname for your server, enter the full domain name label that you have or the IP address for the Lightsail VM.The installer tries to validate hostname, but it will fail. Ignore the validation and proceed ahead with the installation. When the installer prompts for an SSL Certificate file, select No to use the self-signed trust store that is included with the installation. If you have an SSL Certificate file that you want to use, you can provide a path to that instead of using the certificate that comes with the installer by selecting Yes.To use default settings, you can choose Typical installation (1), but if you want to configure the installation with your own settings, choose Custom installation (2).Next, you should see Ready to Install information with all the configurations that you have made. Press Enter to install Hybrid Data Pipeline with those settings. After the installation is complete, you will see an Install Complete message. To exit the installer, press Enter.

Amazon Lightsail Dashboard

You should now head over to Amazon Lightsail dashboard, go the instance and under Networking, and add new firewall rules to allow connections to HDP as shown below.

Using Hybrid Data Pipeline Server

Open your browser and browse to https://ip_address_or_dns_name:8443, which opens the Hybrid Data Pipeline Login screen. Use d2cadmin/d2cadmin as the username and password to login into Hybrid Data Pipeline Dashboard. Following are a couple of screenshots of Hybrid Data Pipeline for your reference.

Next Steps

Now that you have successfully deployed the advanced hybrid data access solution, feel free to configure your data sources in Hybrid Data pipeline and integrate the data in your applications using the Hybrid Data Pipeline's standards-based ODBC, JDBC connectivity, or its REST API (one of the most advanced OData standard's API).

Note that the trial is valid for 90 days, and you will have complete access to use any of the data stores. To learn more about Progress DataDirect Hybrid Data Pipeline, you can visit this page or watch this short video overview.

Hortonworks Sandbox for HDP and HDF is your chance to get started on learning, developing, testing and trying out new features. Each download comes preconfigured with interactive tutorials, sample data and developments from the Apache community.

Topics:
big data ,aws lightsail ,hybrid data pipeline ,tutorial ,data pipeline

Published at DZone with permission of

Opinions expressed by DZone contributors are their own.

{{ parent.title || parent.header.title}}

{{ parent.tldr }}

{{ parent.urlSource.name }}