Over a million developers have joined DZone.
{{announcement.body}}
{{announcement.title}}

Streaming SQL, Spark 2.0, Machine Learning and Deep Learning Resources [Video]

DZone's Guide to

Streaming SQL, Spark 2.0, Machine Learning and Deep Learning Resources [Video]

Big Data Presentation and projects recently updates and done. With a focus on Machine Learning, Spark, NiFi, Deep Learning and Streaming Data

· Big Data Zone ·
Free Resource

Hortonworks Sandbox for HDP and HDF is your chance to get started on learning, developing, testing and trying out new features. Each download comes preconfigured with interactive tutorials, sample data and developments from the Apache community.

For corporate Sparkists, SAP HANA is a mainstay for corporate environments. This article details accessing Hana Views From Spark. For those new to Spark, this is a nice Introduction to Spark. For a great way to write and run your Spark code, try Zeppelin.

A very interesting talk on Streaming SQL from FlinkForward 2016 Berlin, Julian Hyde has been doing some really cool low-level SQL .   

For NiFi, this is a nice article for setting up zero master clusters. I wrote this nice use case for NiFi: Reading Sensor Data from Raspberry Pis with Apache NiFi 1.0.0. For transmitting data once it leaves a database, try CDC.

RapidMiner on HDP

Apache Metron Cybersecurity Meetup - Sept. 7, 2016

In the world of Machine Learning with Spark, this is an awesome introduction to XGBoost with Spark 2.0, with detailed install instructions. Here is an interesting Deep Learning framework called MXNet by the makers of HiveMall (Scalable Machine Learning on Hive/Hadoop) for Spark 2.0. For Sparking Water 2.0, Apache Spark with H2O Machine Learning is a great presentation on the what, why and how of using this advanced ML/DL framework with Spark.

The following Apache PredictionIO Introduction is a great presentation on how to use this interesting Apache incubated Machine Learning Server that uses Spark and HBase.

Finally, for the ultimate in hip frameworks, TensorFrames: Apache Spark with Google TensorFlow.

Hortonworks Community Connection (HCC) is an online collaboration destination for developers, DevOps, customers and partners to get answers to questions, collaborate on technical articles and share code examples from GitHub.  Join the discussion.

Topics:
hadoop ,hortonworks ,machine learning

Opinions expressed by DZone contributors are their own.

{{ parent.title || parent.header.title}}

{{ parent.tldr }}

{{ parent.urlSource.name }}