Data Lakes and Data Warehouses – The Data Lake Summit Speaker Lineup

Start Free Trial
September 29, 2020 by and Updated March 20th, 2024

Today’s applications for machine learning and real-time predictive analytics require a robust set of capabilities from the underlying data platform. These must meet the growing needs of the modern enterprise to integrate and analyze a wide variety of data generated from social, mobile, and sensor sources, among others.

So the key is to evolve and implement a data platform according to the organizational needs of today and the future. It’s about finding the synergies across various data storage, management, and orchestration technologies and focusing on a needs-based and domain-driven architecture.

We present the list of speakers of the Data Lake Summit who will shine a light on the advantages of a modern data platform and the synergies of data lakes and data warehouses, as well as how data architectures must be evaluated and adapted to an organization’s environment and needs in order for it to reap the benefits of its data assets.

Check out the full agenda of the Data Lake Summit

Also Read: Data Lakes for Artificial Intelligence and Machine Learning – The Data Lake Summit Speaker Lineup

Caleb Jones, The Walt Disney Company

Caleb Jones, Senior Staff Software Architect at The Walt Disney Company (TWDC), will speak at the Data Lake Summit on Domain-driven Data Architecture. He will discuss the concepts and theory of what a domain-driven data platform looks like and illustrate how TWDC has architected and built a domain-driven platform in the cloud.

Caleb has been at TWDC for over nine years, working with large-scale data both as an engineer and architect. He is currently a Lead Architect for the Data Platform team in TWDC’s Direct to Consumer and International segment (DTCI). Caleb leads efforts to combine and unify data platforms that have powered several consumer brands. He has focused on creating a modern architecture that avoids monolithic platforms’ pitfalls, strongly unifies data, and provides autonomy to diverse teams. Besides data architecture, Caleb has a passion for analyzing connected data using graphs and networks and has helped build some of the foundations for ID Graphs in TWDC.

Pravanjan Choudhury, Capillary Technologies

Pravanjan Choudhury, CTO at Capillary Technologies, will be speaking on Five Reasons Why a Multi-tenant Data Lakes is a Different Ballgame on Day 2 at the Data Lake Summit. His talk will stress the key aspects of modeling multi-tenant data lakes as a system around modern data lake concepts. Capillary’s Multi-Tenant Data Lake provides the ‘Consumer Data Platform’ for our customers of various verticals like retail, grocery, FnB, jewelry, etc. This data platform generates the essential insights that form the base for effective marketing engagement and eventually lead to consumer retention and loyalty. Catch him live on October 14 at 1 p.m. PDT to learn more about Capillary’s Multi-Tenant Data Lake.

Pravanjan has been responsible for taking Capillary’s product vision forward and ensuring partner brands always stay consumer-ready through the solutions the company provides. In his time at Capillary, he acted as the chief architect of the company’s data-driven technology stack. With a doctorate and bachelor’s from IIT – Kharagpur, Pravanjan was the principal architect at Silicon Valley startup Minekey where he developed products around content personalization, recommendation, search, social collaboration, and microblogging Pravan.png. He has also worked with several research groups and acted as a consultant for National Semiconductors, where he helped them in the domain of Embedded Systems design and verification.

Eddie White, Google Cloud

Eddie White, Partner Development Manager at Google Cloud, will join us in the session titled – The Business Value of Qubole’s Open Data Lake on Google Cloud. In this informative session on Day 1, Eddie will discuss how organizations can best leverage Qubole on Google Cloud.

He will also speak on Day 2 of the summit on The Cloudscape of Data Lakes and Data Warehouses, highlighting how Google Cloud and Qubole’s analytics partners play a critical role in helping enterprises through their digital transformation journey.

Eddie focuses on developing innovative technology and go-to-market motions with some of Google Cloud’s key Data Analytics Platform partners at Google Cloud. He is a seasoned executive with recent stints at Pentaho, Hitachi Vantara, and Dremio.

Ivan Peng, Nextdoor

In his session – Doubling Down: Why Nextdoor Ditched a Data Warehouse for a Centralized Data Lake, Ivan Peng, Software Engineer of the Data Platform at Nextdoor, will speak about the concession of processing performance in decoupling storage and compute pales in comparison to the data discoverability and the analytics velocity gained from an open data lake, while highlighting the journey to get it.

At Nextdoor, Ivan is responsible for developing and maintaining the data infrastructure used to process the various ETL pipelines. He is particularly passionate about functional programming. Ivan believes that everything can and should be written in Scala.

Hugo Sosa, BigData4ALL

Hugo Sosa, Managing Director, BigData4ALL, who comes with 20 years of business consulting and advising experience, will be sharing a business point of view on Migrating from a Legacy Data Warehouse to a Data Lake on Cloud. He will discuss how migration from the data warehouse to the data lake could reduce cost, lead to faster time to market, lower TCO, and augment final user satisfaction.

Hugo is the founding Partner of BigData4All; he also specializes in supporting companies in using statistical and analytical models to improve business performance and uncover actionable information. Before BigData4All, he was a Partner at Deloitte Mexico, responsible for Big Data and analytics practices and different practices in Consulting firms, such as KPMG and Capgemini, and leading different business lines in Teradata and IBM.

He has been responsible for the design, development, implementation, and operation of the information exploitation strategy for various companies in the consumer products, telecommunications, retail, and finance sectors. Hugo has a degree in Business Administration from the Universidad de las Americas, Puebla, where he also studied for a Master’s Degree in Business Administration with an International Management specialization. He has participated in executive programs for senior management, strategy, and innovation.

Grab your spot today

You don’t want to miss this boat, folks! The Data Lake Summit is your vessel to connect with the thought leaders and get hands-on training on all aspects of Data Lakes.

Need even more reasons to join the virtual summit? Here are the ten reasons to attend the Data Lake Virtual Summit.

Hurry! Register Now

Start Free Trial
Read Part 2: Tuning the Data Ingestion process