Workshop: Apache Kakfa and KSQL in Action: Let's Build a Streaming Data Pipeline

Location: St James, 4th flr.

Duration: 1:00pm - 4:00pm

Day of week: Thursday

Level: Beginner

Key Takeaways

  • Brief introduction to Apache Kafka, and streaming concepts

  • Integrating systems with Kafka using Kafka Connect

  • Exploring data with KSQL

  • Using KSQL to filter, join, and aggregate data in Kafka

Prerequisites

Participants must have their own laptop and complete the prerequisite steps detailed here: https://github.com/confluentinc/examples/blob/5.1.0-post/ksql-workshop/pre-requisites.adoc.
For reasons of compatibility we strongly recommend the use of Mac or Linux for the workshop.
Knowledge of SQL is assumed. Participants will benefit from existing knowledge of the concepts of Kafka (topics, streaming, etc). 

Have you ever thought that you needed to be a programmer to do stream processing and build streaming data pipelines? Think again! Apache Kafka is a distributed, scalable, and fault-tolerant streaming platform, providing low-latency pub-sub messaging coupled with native storage and stream processing capabilities. Integrating Kafka with RDBMS, NoSQL, and object stores is simple with Kafka Connect, which is part of Apache Kafka. KSQL is the open-source SQL streaming engine for Apache Kafka, and makes it possible to build stream processing applications at scale, written using a familiar SQL interface. 

In this workshop you will learn the architectural reasoning for Apache Kafka and the benefits of real-time integration, and then build a streaming data pipeline using nothing but your bare hands, Kafka Connect, and KSQL. 

Gasp as we filter events in real time! Be amazed at how we can enrich streams of data with data from RDBMS! Be astonished at the power of streaming aggregates for anomaly detection!

Speaker: Robin Moffatt

Developer Advocate @confluentinc

Robin is a Developer Advocate at Confluent, the company founded by the original creators of Apache Kafka, as well as an Oracle Groundbreaker Ambassador and ACE Director (Alumnus). His career has always involved data, from the old worlds of COBOL and DB2, through the worlds of Oracle and Hadoop, and into the current world with Kafka. His particular interests are analytics, systems architecture, performance testing and optimization. He blogs at http://cnfl.io/rmoff and http://rmoff.net/ (and previously http://ritt.md/rmoff) and can be found tweeting grumpy geek thoughts as @rmoff. Outside of work he enjoys drinking good beer and eating fried breakfasts, although generally not at the same time.

Find Robin Moffatt at

Speaker: Sven Erik Knop

Solutions Architect @ConfluentInc

Sven Erik Knop is a Solutions Architect for Confluent, helping customers to design and implement distributed architectures with Apache Kafka. Prior to Confluent, Sven Erik worked at Perforce Software as a Senior Consultant, deploying and optimising hundreds of Perforce installations, presenting at many conferences and regularly contributing to blogs and industry publications on topics such as versioning and best practices for development and deployment. Sven Erik holds a physics degree from the University of Essen, Germany.

Find Sven Erik Knop at

Tracks

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.