Online Meetup: Powering Data Science and AI with Apache Spark, Alluxio, and IBM

October 29, 2019

Bin Fan

VP of Technology

Alluxio

Yonggang Hu

Chief Architect at Spectrum Computing

IBM

Spark is a widely adopted open source framework that provides a unified interface for analytics and machine learning workloads. Alluxio, originating from the UC Berkeley AMPLab – the same lab as Spark, is an open source data orchestration platform that empowers compute frameworks like Spark by providing stateful caching to enable efficient data sharing between multiple jobs and improving resilience against job failures as well as bringing data together from many different sources, be it remote HDFS or cloud object stores.

Alluxio partnered with IBM to deliver a Spark-based solution to provide fast data analytics. With the integration of IBM Spectrum Conductor, an advanced workload and resource management platform that maximizes hardware utilization to speed results and cut infrastructure costs, Alluxio and IBM delivered a solution that powers leading telecom company’s applications to support 320 million subscribers. In this online meetup, we will present the benefits of the fast analytics stack of Spark on Alluxio and IBM and dive into a leading telecom’s use case of leveraging Spark and Alluxio to process massive amounts of mobile data.

In this online meetup, you will learn about:

Why the leading companies are moving towards a decoupled compute and storage architecture, and the associated challenges and requirements.
Why Spark and Alluxio together can solve the challenges and fulfill the requirements
How leading telecom leverages Spark with Alluxio for fast data processing at scale on top of object store and HDFS

In this online meetup, you will learn about:

Why the leading companies are moving towards a decoupled compute and storage architecture, and the associated challenges and requirements.
Why Spark and Alluxio together can solve the challenges and fulfill the requirements
How leading telecom leverages Spark with Alluxio for fast data processing at scale on top of object store and HDFS

Video:

Presentation slides:

Powering Data Science and AI with Apache Spark, Alluxio, and IBM from Alluxio, Inc.

‍

In this online meetup, you will learn about:

Why the leading companies are moving towards a decoupled compute and storage architecture, and the associated challenges and requirements.
Why Spark and Alluxio together can solve the challenges and fulfill the requirements
How leading telecom leverages Spark with Alluxio for fast data processing at scale on top of object store and HDFS

Videos:

Presentation Slides:

Online Meetup: Powering Data Science and AI with Apache Spark, Alluxio, and IBM from Alluxio, Inc.

Complete the form below to access the full overview:

Videos

GTC 2025 | Alluxio Decouples Storage and Compute for a Faster AI Future

April 9, 2025

Inside Deepseek 3FS: A Deep Dive into AI-Optimized Distributed Storage

Deepseek’s recent announcement of the Fire-flyer File System (3FS) has sparked excitement across the AI infra community, promising a breakthrough in how machine learning models access and process data.

In this webinar, an expert in distributed systems and AI infrastructure will take you inside Deepseek 3FS, the purpose-built file system for handling large files and high-bandwidth workloads. We’ll break down how 3FS optimizes data access and speeds up AI workloads as well as the design tradeoffs made to maximize throughput for AI workloads.

This webinar you’ll learn about how 3FS works under the hood, including:

✅ The system architecture

✅ Core software components

✅ Read/write flows

✅ Data distribution/placement algorithms

✅ Cluster/node management and disaster recovery

Whether you’re an AI researcher, ML engineer, or infrastructure architect, this deep dive will give you the technical insights you need to determine if 3FS is the right solution for you.

‍

April 1, 2025

AI/ML Infra Meetup | Building Production Platform for Large-Scale Recommendation Applications

March 6, 2025

Sign-up for a Live Demo or Book a Meeting with a Solutions Engineer

Request a demo

Alluxio Enterprise AI

Alluxio Enterprise Data

Videos:

Presentation Slides:

Complete the form below to access the full overview:

Videos

Sign-up for a Live Demo or Book a Meeting with a Solutions Engineer