Speeding up TensorFlow and PyTorch with Alluxio
September 9, 2021
By 
Lu Qiu

Driven by strong interests from our open source community, the Alluxio core engineering team re-designed things to come up with a more efficient and transparent way for users to leverage data orchestration through the POSIX interface. This enables much better performance for ML workloads where data is accessed via the POSIX interface.

In this 20 minute community session, you’ll hear from Lu Qiu, one of Alluxio’s lead engineers on the POSIX implementation project.

In this session, you’ll learn:

  • How Alluxio’s new JNI-based FUSE implementation supports more efficient POSIX data access
  • How improvements to multiple data operations, including distributedLoad, optimizations on listing or calculating directories with a massive amounts of files, etc., improve performance. In model training
  • How these latest enhancements improve performance on TensorFlow and PyTorch training workloads, even with GPU-based training and compute
ALLUXIO WEBINAR

Driven by strong interests from our open source community, the Alluxio core engineering team re-designed things to come up with a more efficient and transparent way for users to leverage data orchestration through the POSIX interface. This enables much better performance for ML workloads where data is accessed via the POSIX interface.

In this 20 minute community session, you’ll hear from Lu Qiu, one of Alluxio’s lead engineers on the POSIX implementation project.

In this session, you’ll learn:

  • How Alluxio’s new JNI-based FUSE implementation supports more efficient POSIX data access
  • How improvements to multiple data operations, including distributedLoad, optimizations on listing or calculating directories with a massive amounts of files, etc., improve performance. In model training
  • How these latest enhancements improve performance on TensorFlow and PyTorch training workloads, even with GPU-based training and compute

Video:

Slack with speakers, experts, and community members.
Join the Alluxio Global Online Meetup Group.

ALLUXIO WEBINAR

Driven by strong interests from our open source community, the Alluxio core engineering team re-designed things to come up with a more efficient and transparent way for users to leverage data orchestration through the POSIX interface. This enables much better performance for ML workloads where data is accessed via the POSIX interface.

In this 20 minute community session, you’ll hear from Lu Qiu, one of Alluxio’s lead engineers on the POSIX implementation project.

In this session, you’ll learn:

  • How Alluxio’s new JNI-based FUSE implementation supports more efficient POSIX data access
  • How improvements to multiple data operations, including distributedLoad, optimizations on listing or calculating directories with a massive amounts of files, etc., improve performance. In model training
  • How these latest enhancements improve performance on TensorFlow and PyTorch training workloads, even with GPU-based training and compute

Video:

Slack with speakers, experts, and community members.
Join the Alluxio Global Online Meetup Group.

Videos:
Presentation Slides:

Complete the form below to access the full overview:

Videos

Sign-up for a Live Demo or Book a Meeting with a Solutions Engineer