Workshop: Apache Kafka and ksqlDB in Action: Let's Build a Streaming Data Pipeline!

Location: Westminster, 4th flr.

Duration: 1:00pm - 4:00pm

Day of week: Thursday

Level: Intermediate

Prerequisites

 
  • Participants must have their own laptop
  • Participants can either 
  1. use a provided cloud environment but MUST have a SSH client. 
    or
  2. run the full environment locally (Mac/Linux only)
    Please complete the 'prerequisites' section at https://rmoff.dev/qcon01 PRIOR to the workshop (the conference wifi won't cope with trying to do it on the day!)
  • Knowledge of SQL is assumed.
    Participants will benefit from existing knowledge of the concepts of Kafka (topics, streaming, etc).
Any questions, please get in touch with me, either robin@confluent.io or @rmoff on http://cnfl.io/slack

Have you ever thought that you needed to be a programmer to do stream processing and build streaming data pipelines? Think again! Apache Kafka is a distributed, scalable, and fault-tolerant streaming platform, providing low-latency pub-sub messaging coupled with native storage and stream processing capabilities. Integrating Kafka with RDBMS, NoSQL, and object stores is simple with Kafka Connect, which is part of Apache Kafka. ksqlDB is a SQL streaming engine for Apache Kafka, and makes it possible to build stream processing applications at scale, written using a familiar SQL interface.

In this workshop you will learn the architectural reasoning for Apache Kafka and the benefits of real-time integration, and then build a streaming data pipeline using nothing but your bare hands, Kafka Connect, and KSQL.

Gasp as we filter events in real time! Be amazed at how we can enrich streams of data with data from RDBMS! Be astonished at the power of streaming aggregates for anomaly detection!

Key Takeaways

  • An introduction to the Apache Kafka ecosystem
  • Key streaming concepts and design patterns
  • Integrating systems with Kafka using Kafka Connect
  • Exploring data with ksqlDB
  • Using ksqlDB to filter, join, and aggregate data in Kafka
  • Querying state from ksqlDB

Speaker: Robin Moffatt

Developer Advocate @confluentinc

Robin is a Developer Advocate at Confluent, the company founded by the original creators of Apache Kafka, as well as an Oracle Groundbreaker Ambassador and ACE Director (Alumnus). His career has always involved data, from the old worlds of COBOL and DB2, through the worlds of Oracle and Hadoop, and into the current world with Kafka. His particular interests are analytics, systems architecture, performance testing and optimization. He blogs at http://cnfl.io/rmoff and http://rmoff.net/ (and previously http://ritt.md/rmoff) and can be found tweeting grumpy geek thoughts as @rmoff. Outside of work he enjoys drinking good beer and eating fried breakfasts, although generally not at the same time.

Find Robin Moffatt at

Speaker: Sven Erik Knop

Solutions Architect @ConfluentInc

Sven Erik Knop is a Solutions Architect for Confluent, helping customers to design and implement distributed architectures with Apache Kafka. Prior to Confluent, Sven Erik worked at Perforce Software as a Senior Consultant, deploying and optimising hundreds of Perforce installations, presenting at many conferences and regularly contributing to blogs and industry publications on topics such as versioning and best practices for development and deployment. Sven Erik holds a physics degree from the University of Essen, Germany.

Find Sven Erik Knop at

Tracks

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.