Data Lake Summit Preview: Take a deep-dive into the future of analytics

Start Free Trial
October 13, 2020 by Updated March 20th, 2024

The wait is finally over! The Data Lake Summit, the definitive virtual conference for all things data lake, kicks off today. Hosted by Qubole, in collaboration with AWS and Google Cloud, the summit aims to take a deeper dive into the latest market trends around data analytics, powerful use cases, and best practices to help you get the most out of data lakes in your organization.

The virtual event spread across two days, October 13 and October 14, will have five tracks – Thought Leadership, Data Lakes in the Real World, Fundamentals and Best Practices, Data Lake Technology, and Lightning Talks.

The summit will host about 55 speakers, 4000 attendees, and 30+ organizations and will bring the experience of our in-person conference from the comfort of your home.

Join us to gain valuable insights from the four interactive keynote sessions featuring our Co-founder and CEO Ashish Thusoo,  Debanjan Saha, VP/GM, Data Analytics Services, Google; Kirk Borne, Principal Data Scientist, and Data Science Fellow, and Executive Advisor at Booz Allen Hamilton; and Chris Casey, worldwide head of business development for AWS Data Exchange.

The virtual summit will see discussions on a number of thought-provoking ideas brought forward by the industry leaders. Some of the session topics include – Cost Optimization and Self-service reporting for a Data Lake Ecosystem, Powering Real-time decisions with Big Data and Microservices; Building Data Lake on AWS and GCP; Data Minimization for Data Governance Strategy in GDPR; Scaling Data Science with Spark and R; Modern Data Analytics and the Modern Data Lake; The Cloudscape of Data Lakes and Data Warehouse; Building an Open, data first and machine learning forward platform, among others.

Our event platform offers attendees a chance to connect one-to-one with experts to discuss integrations, use cases, and best practices on topics such as modern data lake architecture for streaming analytics, data discovery, and machine learning; plus open source technologies, including Apache Spark, Hive, Presto, Airflow. The full content can be accessed live and on-demand, exclusive to the attendees.

As a part of the summit, we also bring you a free pre-summit workshop on ‘Data Engineering on Data Lakes’ and ‘Managing Machine Learning Lifecycle on Data Lakes’, where instructors will walk you through how to build an open data lake.

This is a jammed-packed event with learnings that will give you the technical depth and business insight for all aspects of leveraging data lakes for success. To check out the full schedule for the event and sessions in detail, please visit –

Start Free Trial
Read The Data Lake Summit: Day 1 Recap