Over a million developers have joined DZone.

BigData Workflows Made Easy -> Glue

DZone's Guide to

BigData Workflows Made Easy -> Glue

Free Resource

Access NoSQL and Big Data through SQL using standard drivers (ODBC, JDBC, ADO.NET). Free Download 

Glue is a job execution engine, written in Java and Groovy. 

Workflows are written in Groovy DSL / Jython / Clojure / JRuby and use pre-developed modules to interact with external resources e.g. DBs, Hadoop, Netezza, FTP etc.

Glue is not XML, and is not a BI tool, but rather a tool that allows programmers to write workflows in a production environment using any of the supported languages. 

The nicest thing about glue is its modules that allows you to interact with DBs Hadoop Clusters etc using tested methods and which can be setup once and re-used in each workflow, this abstracts the configuration away from the workflows and saves tons of time spent debugging.

Another cool feature is the ability to run data-driven workflows from hadoop, i.e. you can register N workflows to a HDFS directory path and have those workflows run automatically as data arrives in that directory.



The fastest databases need the fastest drivers - learn how you can leverage CData Drivers for high performance NoSQL & Big Data Access.


Opinions expressed by DZone contributors are their own.

{{ parent.title || parent.header.title}}

{{ parent.tldr }}

{{ parent.urlSource.name }}