SESSION + Live Q&A

Osiris: When Big Data Is Too Big for HBase

As the #1 job site in the world, Indeed delivers hundreds of millions of searches per day to job seekers. To give our users the best experience possible, we analyze petabytes of data per day for machine learning, A/B testing, and reporting.

Learn how the Search Quality team at Indeed developed Osiris, a horizontally scalable key-value store built on Hadoop. Osiris is flexible enough to be used in everything from big data analysis to latency-sensitive, user-facing applications.

This talk will cover the requirements and scaling challenges we faced that led to the development of Osiris. We'll also discuss the details of how we built Osiris, including its unique key design and highly configurable storage engine -- both of which allow for use in a wide variety of applications. We'll end this talk with specific examples of how Osiris is used at Indeed.


Speaker

Josh Slocum

Software Engineer @Indeed

Josh Slocum is a Software Engineer at Indeed on the Search Quality team. As the principal developer on Osiris, he oversees the development of Indeed's highly flexible and scalable key-value store.  He received a Bachelor of Science in Computer Science degree from The University of Texas at...

Read more

Location

Westminster, 4th flr.

Track

Solutions Track I

Topics

Big Data

Video

Video is not available

Share

From the same track

SESSION + Live Q&A Chatbot

Democratizing Serverless

Thom Leggett, Director Oracle Cloud, will do a 50-minute deep dive on  the recently announced open source Fn Project  a cloud-agnostic serverless functions platform that allows developers to run their own serverless infrastructure anywhere, even on their laptops. Learn how it’s...

Thom Leggett

Director @OracleCloud

SESSION + Live Q&A Scale

Understanding Geospatial Processing

Eighty-three percent of the world’s business to business transaction revenue touches an SAP system. Our customers collect and analyze huge volumes of data about everything from products and customers to assets, operations, and transactions.More and more of this data is becoming...

Vitaliy Rudnytskiy

Principal Solutions Architect @SAP

SESSION + Live Q&A Artificial Intelligence

Explaining Artificial Intelligence to Schoolchildren

In this talk, Dale will give an industry perspective on why introducing machine learning to kids is so essential, and share some of IBM’s experiences for how it can be done effectively - with some of the projects that school kids have created, and the lessons learned from these efforts.

Dale Lane

Software Developer @IBM

SESSION + Live Q&A DevOps

Data Driven DevOps

Devops is usually viewed from a traditional perspective of a collaboration of Dev, Ops, and QA, driven by the change in Culture, People, and Process. But how do you know where you stand and where to move? As in almost any field, data and metrics give you the gauges and instruments. In this talk,...

Baruch Sadogursky

Developer Advocate @JFrog

SESSION + Live Q&A London

Serverless Spring

This live coding session will introduce Spring Cloud Function, from the basic programming model all the way to multicloud deployments. Along the way, we'll explore the current state of Java across Function-as-a-Service providers and demonstrate what role Spring can play in the Serverless...

Dave Syer

Senior Consulting Engineer @Pivotal

View full Schedule