Over a million developers have joined DZone.
{{announcement.body}}
{{announcement.title}}

Big Data Spain Talk About KSQL, the Streaming SQL Engine for Apache Kafka

DZone's Guide to

Big Data Spain Talk About KSQL, the Streaming SQL Engine for Apache Kafka

A big expert gives an overview of a talk he gave at the 2018 Big Data Spain conference about why KSQL is so awesome and how it pairs up with Kaka.

· Big Data Zone ·
Free Resource

The open source HPCC Systems platform is a proven, easy to use solution for managing data at scale. Visit our Easy Guide to learn more about this completely free platform, test drive some code in the online Playground, and get started today.

In November 2018, I was back in Madrid to speak at Big Data Spain. A great event all about big data, analytics, and machine learning. One of the largest tech companies in Spain. A perfect event to talk about KSQL, the Streaming SQL Engine for Apache Kafka.

Big Data Spain is held in Kinepolis, a big cinema. One of my favorite locations for a tech conference – for speakers and the audience.

All the talks at Big Data Spain are recorded. The video recording and slides are given below.

KSQL: The Open Source SQL Streaming Engine for Apache Kafka

My talk was an update about KSQL. The slide deck describes various different use cases for KSQL. I also included some advanced topics such as User Defined Functions (UDF). Here is the abstract:

The rapidly expanding world of stream processing can be daunting, with new concepts such as various types of time semantics, windowed aggregates, changelogs, and programming frameworks to master.

KSQL is an open-source, Apache 2.0 licensed streaming SQL engine on top of Apache Kafka which aims to simplify all this and make stream processing available to everyone. Even though it is simple to use, KSQL is built for mission-critical and scalable production deployments (using Kafka Streams under the hood).

Benefits of using KSQL include: no coding required; no additional analytics cluster needed; streams and tables as first-class constructs; access to the rich Kafka ecosystem. This session introduces the concepts and architecture of KSQL. Use cases such as streaming ETL, real-time stream monitoring, and anomaly detection are discussed. A live demo shows how to set up and use KSQL quickly and easily on top of your Kafka ecosystem.

Key takeaways:

  • KSQL includes access to the rich Apache Kafka ecosystem and is suitable for various use cases, including streaming ETL, real-time stream monitoring, and anomaly detection.
  • KSQL allows us to utilize stream processing without coding and without additional analytics cluster.

Slide Deck and Video Recording: KSQL Introduction

Here is the slide deck:


Here is the video recording:


Have fun watching it and please contact me for any feedback or questions.

Managing data at scale doesn’t have to be hard. Find out how the completely free, open source HPCC Systems platform makes it easier to update, easier to program, easier to integrate data, and easier to manage clusters. Download and get started today.

Topics:
big data ,ksql ,apache kafka ,real-time data streaming ,anomaly detection

Published at DZone with permission of

Opinions expressed by DZone contributors are their own.

{{ parent.title || parent.header.title}}

{{ parent.tldr }}

{{ parent.urlSource.name }}