Tech Talk: Interactive Analytics with the Starburst Presto + Alluxio stack for the Cloud
March 12, 2019
By 
Bin Fan
Matt Fuller

As data analytic needs have increased with the explosion of data, the importance of the speed of analytics and the interactivity of queries has increased dramatically

In this tech talk, we will introduce the Starburst Presto, Alluxio, and cloud object store stack for building a highly-concurrent and low-latency analytics platform. This stack provides a strong solution to run fast SQL across multiple storage systems including HDFS, S3, and others in public cloud, hybrid cloud, and multi-cloud environments.

You’ll learn about:

  • The architecture of Presto, an open source distributed SQL engine,  as well as innovations by Starburst like as it’s cost-based optimizer
  • How Presto can query data from cloud object storage like S3 at high performance and cost-effectively with Alluxio
  • How to achieve data locality and cross-job caching with Alluxio no matter where the data is persisted and reduce egress costs

In addition, we’ll present some real world architectures & use cases from internet companies like JD.com and NetEase.com running the Presto and Alluxio stack at the scale of hundreds of nodes.

As data analytic needs have increased with the explosion of data, the importance of the speed of analytics and the interactivity of queries has increased dramatically

In this tech talk, we will introduce the Starburst Presto, Alluxio, and cloud object store stack for building a highly-concurrent and low-latency analytics platform. This stack provides a strong solution to run fast SQL across multiple storage systems including HDFS, S3, and others in public cloud, hybrid cloud, and multi-cloud environments.

You’ll learn about:

  • The architecture of Presto, an open source distributed SQL engine,  as well as innovations by Starburst like as it’s cost-based optimizer
  • How Presto can query data from cloud object storage like S3 at high performance and cost-effectively with Alluxio
  • How to achieve data locality and cross-job caching with Alluxio no matter where the data is persisted and reduce egress costs

In addition, we’ll present some real world architectures & use cases from internet companies like JD.com and NetEase.com running the Presto and Alluxio stack at the scale of hundreds of nodes.

Complete the form below to access the full overview:

Videos

Sign-up for a Live Demo or Book a Meeting with a Solutions Engineer