Stay Informed:

COVID-19 (coronavirus) information
Zoom Links: Zoom Help | Teaching with Zoom | Zoom Quick Guide

Data orchestration for AI, big data, and the cloud - The Journey from Academia Research to Commercialization

Speaker Name: 
Haoyuan Li
Speaker Title: 
Founder, Chairman, & CTO
Speaker Organization: 
Start Time: 
Wednesday, October 2, 2019 - 9:45am
End Time: 
Wednesday, October 2, 2019 - 10:30am


The data ecosystem has heavily evolved over the past two decades. There’s been an explosion of data-driven frameworks, such as Presto, Hive, and Spark to run analytics and ETL queries and TensorFlow and PyTorch to train and serve models. On the data side, the approach to managing and storing data has evolved from HDFS to cheaper, more scalable and separated services typified by cloud stores like AWS S3. As a result, data engineering has become increasingly complex, inefficient, and hard, particularly in hybrid and cloud environments. In this presentation, Haoyuan Li offers an overview of a data orchestration, as well his journey of converting academia research to commercialization.


Haoyuan (H.Y.) Li is the founder, chairman, and CTO of Alluxio. He holds a PhD in computer science from UC Berkeley’s AMPLab, where he created the Alluxio (formerly Tachyon) open source data orchestration system, cocreated Apache Spark Streaming, and became an Apache Spark founding committer. He also holds an MS from Cornell University and a BS from Peking University, both in computer science.

Event Type: