Session + Live Q&A

Connecting Modern Data Pipelines and Data Products

The complexity of tools, distributed systems, and the CAP theorem introduce tradeoffs that practitioners cannot avoid or ignore as they embrace the world of modern data pipelines. What strategies can you employ? This is where data products come into play. Understanding the business objectives of data products helps us make informed decisions about tools, architecture, and services. Join this panel to learn from data thought leaders!

Speaker

Dr. Einat Orr

Co-creator of @lakeFS, Co-founder & CEO of Treeverse

Einat Orr has 20+ years of experience building R&D organizations and leading the technology vision at multiple companies, the latest being Similarweb, that IPO in NYSE last May. Currently she serves as Co-founder and CEO of Treeverse, the company behind lakeFS, an open source platform...

Speaker

Roksolana Diachuk

Big Data Engineer @Captify

Roksolana works as a Big Data Engineer at Captify. She is a speaker at technical conferences and meetups, one of the Women Who Code Kyiv leads. She is passionate about Big Data, Scala, and Kubernetes. Her hobbies include building technical topics around fairytales and discovering new cities.

Speaker

Ricardo Sueiras

Principal Advocate in Open Source @AWS

Over 30 years spent working in the technology industry, helping customers solve business problems with open source and cloud. Currently I am a Developer Advocate at AWS focusing on open source, where I help raise awareness of AWS and our customers open source projects and technology, and work...

Ismaël Mejía

Senior Cloud Advocate @Microsoft

Ismaël Mejía is a Senior Cloud Advocate at Microsoft working on the Azure Data and AI team. He has more than a decade of experience architecting systems for startups and financial companies. He has been recently focused on distributed data frameworks, he is an active contributor of Apache Beam...

Speaker

Dr. Einat Orr

Co-creator of @lakeFS, Co-founder & CEO of Treeverse

Speaker

Roksolana Diachuk

Big Data Engineer @Captify

Speaker

Ricardo Sueiras

Principal Advocate in Open Source @AWS

Speaker

Ismaël Mejía

Senior Cloud Advocate @Microsoft

From the same track

Session + Live Q&A Data Engineering

Modern Data Pipelines in AdTech—Life in the Trenches

Wednesday Apr 6 / 01:40PM BST

There are various tasks that the modern data pipelines approach helps us solve in different domains, including advertising. Modern data pipelines allow us to process data in a more efficient manner with a diverse set of data transformation tools for both batch and streaming data processing....

Roksolana Diachuk

Big Data Engineer @Captify

Session + Live Q&A Data Engineering

Taming the Data Mess, How Not to Be Overwhelmed by the Data Landscape

Wednesday Apr 6 / 10:35AM BST

The data engineering field has evolved at a tremendous pace in the last decade, new systems that enable the processing of huge amounts of data generated enormous opportunities, as well as challenges for software practitioners. All these new tools and methodologies created a new set of...

Ismaël Mejía

Senior Cloud Advocate @Microsoft

Session + Live Q&A Data Engineering

Orchestrating Hybrid Workflows with Apache Airflow

Wednesday Apr 6 / 02:55PM BST

According to analysts, 87 percent of enterprises have already adopted hybrid cloud strategies. Customers have many reasons why they need to support hybrid environments, from maximizing the value from heritage systems to meeting local compliance and data processing regulations. As they build...

Ricardo Sueiras

Principal Advocate in Open Source @AWS

Session + Live Q&A Data Engineering

Data Versioning at Scale: Chaos and Chaos Management

Wednesday Apr 6 / 04:10PM BST

Version control is fundamental when managing code, but what about data? Our data changes over time, first since it accumulates, we have new data points for new points in time. But this is not the only reason. We also have additional data added to past time, since we were able to get additional...

Dr. Einat Orr

Co-creator of @lakeFS, Co-founder & CEO of Treeverse

View full Schedule

Session + Live Q&A

Connecting Modern Data Pipelines and Data Products

Speaker

Dr. Einat Orr

Find Dr. Einat Orr at:

Speaker

Roksolana Diachuk

Find Roksolana Diachuk at:

Speaker

Ricardo Sueiras

Speaker

Ismaël Mejía

Find Ismaël Mejía at:

Speaker

Dr. Einat Orr

Speaker

Roksolana Diachuk

Speaker

Ricardo Sueiras

Speaker

Ismaël Mejía

Date

Location

Track

Topics

Video

Slides

Add to Calendar

Share

From the same track

Modern Data Pipelines in AdTech—Life in the Trenches

Roksolana Diachuk

Taming the Data Mess, How Not to Be Overwhelmed by the Data Landscape

Ismaël Mejía

Orchestrating Hybrid Workflows with Apache Airflow

Ricardo Sueiras

Data Versioning at Scale: Chaos and Chaos Management

Dr. Einat Orr

Follow QCon

Contact

Menu

QCons around the World