StorageQuery: federated querying on object stores, powered by Alluxio and Presto
August 25, 2020
By 
Abner Ferreira
Caio Pavanelli

Over the last few years, organizations have worked towards the separation of storage and compute for a number of benefits in the areas of cost, data duplication and data latency. Cloud resolves most of these issues but comes to the expense of needing a way to query data on remote storages. Alluxio and Presto are a powerful combination to address the compute problem, which is part of the strategy used by Simbiose Ventures to create a product called StorageQuery – A platform to query files in cloud storages with SQL.

This talk will focus on:

  • How Alluxio fits StorageQuery’s tech stack;
  • Advantages of using Alluxio as a cache layer and its unified filesystem
  • Development of new under file system for Backblaze B2 and fine-grained code documentation;
  • ShannonDB remote storage mode.

Over the last few years, organizations have worked towards the separation of storage and compute for a number of benefits in the areas of cost, data duplication and data latency. Cloud resolves most of these issues but comes to the expense of needing a way to query data on remote storages. Alluxio and Presto are a powerful combination to address the compute problem, which is part of the strategy used by Simbiose Ventures to create a product called StorageQuery – A platform to query files in cloud storages with SQL.

This talk will focus on:

  • How Alluxio fits StorageQuery’s tech stack;
  • Advantages of using Alluxio as a cache layer and its unified filesystem
  • Development of new under file system for Backblaze B2 and fine-grained code documentation;
  • ShannonDB remote storage mode.

Video:

Slides:

StorageQuery: federated querying on object stores, powered by Alluxio and Presto from Alluxio, Inc.

Complete the form below to access the full overview:

Videos

Sign-up for a Live Demo or Book a Meeting with a Solutions Engineer