The presentations from the recent Hadoop Summit are up online. Hadoop Summit 2016 Dublin had a ton of great presentations on Hadoop, Apache Atlas, Spark, HDFS, TensorFlow, Machine Learning, NLP, Apache Zeppelin, HBase, Phoenix, Apache Hive, YARN and Docker. Running HDP in Containers.
The Spark-HBase Connector lets Spark access HBase tables as external data sources or sinks. You can then run Spark SQL against it with Data Frame support and Catalyst optimization. With it, users can operate HBase with Spark-SQL on a data frame level.