Bay Area Meetup: Interactive Analytics in the Cloud with Presto and Alluxio
August 22, 2019
By 
No items found.

ALLUXIO BAY AREA MEETUP

This talk describes a stack to combine Presto, Alluxio, and Cloud object storage systems (e.g.,AWS S3) for high-concurrent and low-latency SQL queries on big data on the cloud. Presto, an open-source distributed SQL engine, is widely recognized for its low-latency queries, high concurrency, and native ability to query multiple data sources. Alluxio is an open-source data orchestration that brings data closer to compute and provides a unified data access layer at in-memory speeds. Presto can use Alluxio as a distributed caching tier on top of S3 for the hot data to query, avoiding reading data repeatedly from the cloud.

This talk covers:

  • The architecture of Presto, its separation of compute and storage, cloud-readiness, recent advancements in the project such as Cost-Based Optimizer and Kubernetes Support.
  • An overview of Alluxio’s key concepts, architecture and data flow,
  • Presto and Alluxio production use cases running hundreds of nodes, including ING Bank, JD.com, and NetEase Games.

Complete the form below to access the full overview:

Presentations

Sign-up for a Live Demo or Book a Meeting with a Solutions Engineer