On Demand Video

What’s new in Alluxio Enterprise AI 3.2: Leverage GPU Anywhere, Pythonic Filesystem API, Write Checkpointing and more

In today’s AI-driven world, organizations face unprecedented demands for powerful AI infrastructure to fuel their model training and serving workloads. Performance bottlenecks, cost inefficiencies, and management complexities pose significant challenges for AI platform teams supporting large-scale model training and serving. On July 9, 2024, we introduced Alluxio Enterprise AI 3.2, a groundbreaking solution designed to address these critical issues in the ever-evolving AI landscape.

In this webinar, Shouwei Chen introduced exciting new features of Alluxio Enterprise AI 3.2:

  • Leveraging GPU resources anywhere accessing remote data with the same local performance
  • Enhanced I/O performance with 97%+ GPU utilization for popular language model training benchmarks
  • Achieving the same performance as HPC storage on existing data lake without additional HPC storage infrastructure
  • New Python FileSystem API to seamlessly integrate with Python applications like Ray
  • Other new features, include advanced cache management, rolling upgrades, and CSI failover

Video:

Presentation slides:


Speaker:

Dr. Shouwei Chen is a core maintainer and product manager of open-source Alluxio. Before joining Alluxio, Shouwei received a Ph.D. degree from Rutgers University. Shouwei’s research focuses on the codesign of the memory-centric computing frameworks with in-memory distributed file systems in large-scale environments.