For corporate Sparkists, SAP HANA is a mainstay for corporate environments. This article details accessing Hana Views From Spark. For those new to Spark, this is a nice Introduction to Spark. For a great way to write and run your Spark code, try Zeppelin.
A very interesting talk on Streaming SQL from FlinkForward 2016 Berlin, Julian Hyde has been doing some really cool low-level SQL .
For NiFi, this is a nice article for setting up zero master clusters. I wrote this nice use case for NiFi: Reading Sensor Data from Raspberry Pis with Apache NiFi 1.0.0. For transmitting data once it leaves a database, try CDC.
RapidMiner on HDP
Apache Metron Cybersecurity Meetup - Sept. 7, 2016
In the world of Machine Learning with Spark, this is an awesome introduction to XGBoost with Spark 2.0, with detailed install instructions. Here is an interesting Deep Learning framework called MXNet by the makers of HiveMall (Scalable Machine Learning on Hive/Hadoop) for Spark 2.0. For Sparking Water 2.0, Apache Spark with H2O Machine Learning is a great presentation on the what, why and how of using this advanced ML/DL framework with Spark.
The following Apache PredictionIO Introduction is a great presentation on how to use this interesting Apache incubated Machine Learning Server that uses Spark and HBase.
Finally, for the ultimate in hip frameworks, TensorFrames: Apache Spark with Google TensorFlow.