The presentations from the recent Hadoop Summit are up online. Hadoop Summit 2016 Dublin had a ton of great presentations on Hadoop, Apache Atlas, Spark, HDFS, TensorFlow, Machine Learning, NLP, Apache Zeppelin, HBase, Phoenix, Apache Hive, YARN and Docker. Running HDP in Containers.
The Spark-HBase Connector lets Spark access HBase tables as external data sources or sinks. You can then run Spark SQL against it with Data Frame support and Catalyst optimization. With it, users can operate HBase with Spark-SQL on a data frame level.
There is also recordings of many of the excellent talks, just like being there without the Guiness!
Apache Phoenix and HBase: Past, Present, and Future of SQL over HBase
Using Natural Language Processing On Non Textual Data
Zeppelin Livy Bringing Multi Tenancy to Interactive Data Analysis
Apache Zeppelin: Helium and Beyond
Why Big Data Management Requires Hierarchical Taxonomies
Apache Hive 2 0 SQL Speed Scale
Evolving HDFS to a Generalized Distributed Storage Subsystem
Apache Hadoop YARN and the Docker Container Runtime
TensorFlow Large Scale Deep Learning For Intelligent Computer Systems