On-Demand Videos

video

AI/ML Infra Meetup | Building Production Platform for Large-Scale Recommendation Applications

Watch now

video

AI/ML Infra Meetup | How Uber Optimizes LLM Training and Finetune

Watch now

video

AI/ML Infra Meetup | Optimizing ML Data Access with Alluxio: Preprocessing, Pretraining, & Inference at Scale

Watch now

video

AI/ML Infra Meetup | Deployment, Discovery and Serving of LLMs at Uber Scale

Watch now

video

What’s New in Alluxio AI: 3X Faster Checkpoint File Creation, New Cache Eviction Policies, Python SDK enhancements, and more

Join us to learn about the latest release of Alluxio Enterprise AI. In this webinar, we’ll provide an overview of the new features and capabilities of Alluxio Enterprise AI, built to accelerate AI workloads and maximize GPU utilization.

Key highlights include:

New caching mode accelerates AI checkpoints
Advanced cache eviction policies provide fine-grained control
Python SDK integrations enhance AI framework compatibility
A demo of Alluxio accelerating AI training workloads in AWS

Watch now

video

AI/ML Infra Meetup | Balancing Cost, Performance, and Scale - Running GPU/CPU Workloads in the Cloud

Watch now

video

AI/ML Infra Meetup | A Faster and More Cost Efficient LLM Inference Stack

Watch now

video

AI/ML Infra Meetup | Three Developments in AI Infra

Watch now

video

Accelerate AI: Alluxio 101

In the rapidly evolving landscape of AI and machine learning, Platform and Data Infrastructure Teams face critical challenges in building and managing large-scale AI platforms. Performance bottlenecks, scalability of the platform, and scarcity of GPUs pose significant challenges in supporting large-scale model training and serving.

In this talk, we introduce how Alluxio helps Platform and Data Infrastructure teams deliver faster, more scalable platforms to ML Engineering teams developing and training AI models. Alluxio’s highly-distributed cache accelerates AI workloads by eliminating data loading bottlenecks and maximizing GPU utilization. Customers report up to 4x faster training performance with high-speed access to petabytes of data spread across billions of files regardless of persistent storage type or proximity to GPU clusters. Alluxio’s architecture lowers data infrastructure costs, increases GPU utilization, and enables workload portability for navigating GPU scarcity challenges.

‍

Watch now

video

AI/ML Infra Meetup | The power of Ray in the era of LLM and multi-modality AI

Watch now

video

AI/ML Infra Meetup | Exploring Distributed Caching for Faster GPU Training with NVMe GDS and RDMA

Watch now

video

AI/ML Infra Meetup | Big Data and AI

Watch now

Alluxio Enterprise AI

Alluxio Enterprise Data

On-Demand Videos

Sign-up for a Live Demo or Book a Meeting with a Solutions Engineer