We are delighted by the success of the inaugural Data Orchestration Summit on Nov. 7, 2019! Organized by Alluxio, this one-day event was sold out with nearly 400 attendees! Data engineers, cloud engineers, data scientists joined the talks of 24 industry leaders from all over the globe to share their experiences building cloud-native data and AI platforms. All session recordings and slides are now available.
Key Announcements
Haoyuan Li, founder and CTO of Alluxio, opened the summit with his talk - Orchestrate a Data Symphony, where he discusses the key challenges and trends impacting data engineering in relation to building modern data and AI platforms, and explore the concept of Data Orchestration.
In the Alluxio tech talks, founding engineers Calvin Jia, Bin Fan, and Gene Pang dive into Alluxio 2 Series' key features in open source, community updates, and the latest innovations bringing Alluxio open source into the world of structured data.
Session highlights
The featured talks for the Summit highlighted how leading companies architect their data and AI platforms through the data orchestration approach, leveraging open source technologies such as Alluxio, Apache Spark, Presto, and more. Some session highlights include:
- Orchestrate a Data Symphony - Haoyuan Li, Alluxio
- Enterprise Distributed Query Service powered by Presto & Alluxio across clouds at WalmartLabs - Ashish Tadose, Walmart
- How to Run Fast Presto Analytics with Alluxio in Cloud - a Production Experience - Danny Linden, Ryte
- Alluxio tech talks: What’s New in Alluxio 2 - Calvin Jia & Bin Fan, and Alluxio Innovations for Structured Data - Gene Pang
- Open Source Panel: how to create an open source project - Ben Lorica, O’Reilly; Tobi Knaup, D2iQ; Maxime Beauchemin, Preset; Haoyuan Li, Alluxio
- Data Orchestration for Analytics and AI workloads at DBS Bank - Carlos Queiroz, Development Bank of Singapore (recording will soon be available here)
What's next?
- Join the conversations on the community slack channel!
- Given the strong interest, we’re bringing back the hands-on lab, so stay tuned!
Cheers!
Amelia and Bin
Data Orchestration Summit Co-Chairs
Blog
We are thrilled to announce the general availability of Alluxio Enterprise for Data Analytics 3.2! With data volumes continuing to grow at exponential rates, data platform teams face challenges in maintaining query performance, managing infrastructure costs, and ensuring scalability. This latest version of Alluxio addresses these challenges head-on with groundbreaking improvements in scalability, performance, and cost-efficiency.
We’re excited to introduce Rapid Alluxio Deployer (RAD) on AWS, which allows you to experience the performance benefits of Alluxio in less than 30 minutes. RAD is designed with a split-plane architecture, which ensures that your data remains secure within your AWS environment, giving you peace of mind while leveraging Alluxio’s capabilities.
PyTorch is one of the most popular deep learning frameworks in production today. As models become increasingly complex and dataset sizes grow, optimizing model training performance becomes crucial to reduce training times and improve productivity.