How Facebook built a self-service data infrastructure
Data is everywhere. And to thrive, even to survive, in this next economic cycle, companies need to not only re-tool their infrastructure, but they also need to remodel their approach to data to create a truly self-service and on-demand culture. It is no longer an option for business leaders to wait a month, a week, or even a day for analysis that will help drive decisions. And back of the napkin models with gut decisions do not enable predictable success…so businesses need to change.Please join Ashish Thusoo, Co-Founder and CEO of Qubole—a big data as a service company—and former head of Facebook’s Data Infrastructure team that pioneered the self-service data infrastructure model as he shares just how he did this at Facebook.
After this webcast, you will have learned:
- How Facebook pioneered the self-service data infrastructure model
- The complexities the Facebook team faced as they made the move
- The impact a self-service data model had on all aspects of the business
- Why leveraging public clouds is the right choice for a self-service data platform
About Ashish Thusoo, Co-Founder and Chief Executive Officer – Qubole
Before co-founding Qubole, Ashish ran Facebook’s Data Infrastructure team; under his leadership, the team built one of the largest data processing and analytics platforms in the world. This platform achieved not just the bold aim of making data accessible to analysts, engineers, and data scientists, but drove the “big data” revolution. In the process of scaling Facebook’s Big Data infrastructure, he helped drive the creation of a host of tools, technologies, and templates that are used industry wide today.
About Ravi Murthy, Director of Engineering at Facebook
Ravi Murthy is the Director of Engineering for Analytics Infrastructure at Facebook. He leads development of the big data platform for one of the largest data warehouses in the world. It is responsible for crunching petabytes of data, to enable better products and services for Facebook’s 1-billion+ active users. His teams develop popular open source software such as Presto, Hive and Hadoop(Corona), as well as systems for real-time stream processing (Scribe and Puma), in-memory analytics (Scuba), and graph analytics (Giraph).