Agenda

A FREE ONE-DAY EVENT FOR DATA PRACTITIONERS

Join us for a free, instructor-led workshop to build an open data lake. Choose from the sessions below on some of the hottest topics in the Big Data space and walk away with the knowledge needed to guide your team to data-driven solutions.

Data Engineering on Data Lakes
Managing Machine Learning Lifecycle on Data Lakes
10/12/202010/12/2020
11:00 - 11:45 PDT13:00 - 13:45 PDT
In this workshop, experts will guide you through:

  • Common Challenges faced by data engineering teams managing a data platform and ETL pipelines

  • Leveraging an open data lake platform for building a modern data architecture

  • Developing auto-scaling data pipelines

  • Best practices for deploying data engineering pipelines



This workshop will help you learn how to:

  • Build predictive Machine Learning models on the data lake using Apache Spark, including data prep, model training, and hyperparameter tuning

  • Provide meaningful and convenient ways to manage packages for Python and R dialects on distributed Spark clusters

  • Enable experiment tracking and model registry for iterative analysis

  • Accelerate end-to-end machine learning lifecycle by automating the process of deploying models to production


THE DATA LAKE SUMMIT

THE DATA LAKE SUMMIT

Attend the definitive virtual conference for all things Data Lake