Track Overview
Modern Data Pipelines & DataMesh
Since data systems are becoming more complicated software solutions, how do we ensure that the architecture we are building is sustainable, reliable, and highly available? What best practices can we adopt for building modern, yet complex data pipelines?
And what is our role, as individuals who design architectures and intensive data applications in qualifying the data? Delivering on our promises to data users while making sure it is secure? What about enforcing data governance regulations and other requirements?
In this track, you will learn about processes and best practices for complex data systems- the good, the bad, and the hopefully bright future. Join us as we explore the data mesh, data lifecycle management, and modern data pipelines paradigms and walk away with actionable best practices to improve both your data product and the lives of your colleagues.
Adi Polak
VP of DevEx @TreeverseFrom this track
Taming the Data Mess, How Not to Be Overwhelmed by the Data Landscape
Wednesday Apr 6 / 10:35AM BST
The data engineering field has evolved at a tremendous pace in the last decade, new systems that enable the processing of huge amounts of data generated enormous opportunities, as well as challenges for software practitioners. All these new tools and methodologies created a new set of...
Ismaël Mejía
Senior Cloud Advocate @Microsoft
Connecting Modern Data Pipelines and Data Products
Wednesday Apr 6 / 11:50AM BST
The complexity of tools, distributed systems, and the CAP theorem introduce tradeoffs that practitioners cannot avoid or ignore as they embrace the world of modern data pipelines. What strategies can you employ? This is where data products come into play. Understanding the business objectives of...
Dr. Einat Orr
Co-creator of @lakeFS, Co-founder & CEO of Treeverse
Roksolana Diachuk
Big Data Engineer @Captify
Ricardo Sueiras
Principal Advocate in Open Source @AWS
Ismaël Mejía
Senior Cloud Advocate @Microsoft
Modern Data Pipelines in AdTech—Life in the Trenches
Wednesday Apr 6 / 01:40PM BST
There are various tasks that the modern data pipelines approach helps us solve in different domains, including advertising. Modern data pipelines allow us to process data in a more efficient manner with a diverse set of data transformation tools for both batch and streaming data processing....
Roksolana Diachuk
Big Data Engineer @Captify
Orchestrating Hybrid Workflows with Apache Airflow
Wednesday Apr 6 / 02:55PM BST
According to analysts, 87 percent of enterprises have already adopted hybrid cloud strategies. Customers have many reasons why they need to support hybrid environments, from maximizing the value from heritage systems to meeting local compliance and data processing regulations. As they build...
Ricardo Sueiras
Principal Advocate in Open Source @AWS
Data Versioning at Scale: Chaos and Chaos Management
Wednesday Apr 6 / 04:10PM BST
Version control is fundamental when managing code, but what about data? Our data changes over time, first since it accumulates, we have new data points for new points in time. But this is not the only reason. We also have additional data added to past time, since we were able to get additional...
Dr. Einat Orr
Co-creator of @lakeFS, Co-founder & CEO of Treeverse
Speakers from this track
Ismaël Mejía
Senior Cloud Advocate @Microsoft
Ismaël Mejía is a Senior Cloud Advocate at Microsoft working on the Azure Data and AI team. He has more than a decade of experience architecting systems for startups and financial companies. He has been recently focused on distributed data frameworks, he is an active contributor of Apache Beam...
Read moreFind Ismaël Mejía at:
Dr. Einat Orr
Co-creator of @lakeFS, Co-founder & CEO of Treeverse
Einat Orr has 20+ years of experience building R&D organizations and leading the technology vision at multiple companies, the latest being Similarweb, that IPO in NYSE last May. Currently she serves as Co-founder and CEO of Treeverse, the company behind lakeFS, an open source platform...
Read moreFind Dr. Einat Orr at:
Roksolana Diachuk
Big Data Engineer @Captify
Roksolana works as a Big Data Engineer at Captify. She is a speaker at technical conferences and meetups, one of the Women Who Code Kyiv leads. She is passionate about Big Data, Scala, and Kubernetes. Her hobbies include building technical topics around fairytales and discovering new cities.
Read moreFind Roksolana Diachuk at:
Ricardo Sueiras
Principal Advocate in Open Source @AWS
Over 30 years spent working in the technology industry, helping customers solve business problems with open source and cloud. Currently I am a Developer Advocate at AWS focusing on open source, where I help raise awareness of AWS and our customers open source projects and technology, and work...
Read moreTrack Host
Adi Polak
VP of DevEx @Treeverse
Track Host
Adi Polak
VP of DevEx @Treeverse
As Vice President of Developer Experience at Treeverse, Adi helps build lakeFS, git-like capabilities for data lakes. In her work, she brings her vast industry research and engineering experience to bear in educating and helping teams design, architect, and build cost-effective data systems and...
Read more