Data Infra Meetup | Uber’s Data Storage Evolution
January 29, 2024
By 
Jing Zhao

Uber builds one of the biggest data lakes in the industry, which stores exabytes of data. In this talk, we will introduce the evolution of our data storage architecture, and delve into multiple key initiatives during the past several years.

Specifically, we will introduce:

  • Our on-prem HDFS cluster scalability challenges and how we solved them
  • Our efficiency optimizations that significantly reduced the storage overhead and unit cost without compromising reliability and performance
  • The challenges we are facing during the ongoing Cloud migration and our solutions

Uber builds one of the biggest data lakes in the industry, which stores exabytes of data. In this talk, we will introduce the evolution of our data storage architecture, and delve into multiple key initiatives during the past several years.

Specifically, we will introduce:

  • Our on-prem HDFS cluster scalability challenges and how we solved them
  • Our efficiency optimizations that significantly reduced the storage overhead and unit cost without compromising reliability and performance
  • The challenges we are facing during the ongoing Cloud migration and our solutions

Video:

Presentation slides:

Complete the form below to access the full overview:

Videos

Sign-up for a Live Demo or Book a Meeting with a Solutions Engineer