SESSION + Live Q&A

Real-Time Decisions Using ML on the Google Cloud Platform

Ocado Technology is providing a full solution to put the world’s retailers online using the cloud, robotics, AI and IoT. Processing tens of thousands of orders every day, we generate millions of events every minute, leading to huge amount of data to be managed. We will present how this Big Data is handled in Google Cloud Platform to build a end-to-end machine learning pipeline: how data is stored and processed in BigQuery, post-processed and copied with Dataflow, then used to train Deep Neural Network models with TensorFlow, how all this is orchestrated using our in-house scheduling software called Query Manager, and how predictions are finally run in real-time using Cloud ML Engine and Datastore.



Speaker

Carlos Garcia

Ocado Smart Platform Fraud Team Lead

Carlos has eight years of experience developing software, most of it in high-traffic applications in the travel industry. Carlos joined Ocado Technology one year ago, as team leader of the Fraud Detection team. He has participated in the definition and implementation of a production-ready...

Read more

Speaker

Przemyslaw Pastuszka

ML Engineer @Ocado

Przemek is a software engineer with six years of experience in the Big Data space. He started his career working for a US-based startup called Hadapt (now acquired by Teradata), building a distributed in-house file system which was designed as a highly-performant replacement for HDFS. After...

Read more
Find Przemyslaw Pastuszka at:

Location

Mountbatten, 6th flr.

Track

Distributed Stateful Systems

Topics

Machine LearningGCPReal TimeNeural NetworksTensorflowInterview AvailableOcado

Share

From the same track

SESSION + Live Q&A Kubernetes

Taming Distributed Stateful Pets With Kubernetes

So you've mastered Kubernetes for scheduling and scaling your stateless applications. Your pager has been quieter, life is good. But what about the carefully configured database clusters running on expensive dedicated infrastructure? (And the expensive sysadmin you're paying to maintain it!). In...

Matthew Bates

Co-founder at UK Kubernetes Company Jetstack

James Munnelly

Solutions Engineer @Jetstack

SESSION + Live Q&A Cloud Computing

Cloud-Native and Scalable Kafka Architecture

Kafka as a distributed stateful service faces serious stability and scalability challenges in cloud environment which favors stateless services. As cluster size grows with traffic, it faces issues of data balancing, high consumer data fan out and time consuming process to scale up or update....

Allen Wang

Senior Software Engineer - Cloud Platform @Netflix

SESSION + Live Q&A Ingestion

Scaling Uber's Elasticsearch Clusters

Uber's Marketplace is the algorithmic brain behind Uber's ride-sharing services, and the brain needs immense amount of real-time data to make timely and sound decisions. Uber's Marketplace Intelligence team has been using Elasticsearch as a real-time OLAP database to serve thousands of internal...

Danny Yuan

Real-time Streaming Lead @Uber

SESSION + Live Q&A Distributed Systems

The Future of Distributed Databases Is Relational

Years ago when working at Amazon on shopping cart infrastructure and the precursor to DynamoDB, my co-founder and I realized that while distributed key value stores were useful for a few use-cases, we missed many of the benefits of relational databases: transactions, joins, and the power of the...

Sumedh Pathak

VP Engineering & Co-Founder @CitusData

View full Schedule