The Rise and Rise of Apache Spark
Is Spark the answer to all questions posed for Big Data? In a few short years the Apache Spark in-memory data processing engine has risen from nowhere to become one of the most important projects in the Hadoop ecosystem and – for some – the anointed successor to MapReduce as Hadoop’s primary data processing engine.
In this stimulating webinar, Matt Aslett, Research Director at 451 Research, will lead a discussion around the impact of the rise of Apache Spark on the Big Data ecosystem. He will be joined by Steve Gotlieb, Big Data Guru at Autodesk, who will dive into how developers and data scientists are using Spark Notebooks to prototype data transformations that can be deployed through an automated ETL pipeline, and delivered to data analysts to enable faster time-to-insights. And finally, Dharmesh (Dash) Desai, Technology Evangelist for Qubole, will round out the discussion with a look at the real value of a self-service analytics platform and how this value is realized when both business users and data team members have access to raw and aggregated data from a range of sources.
Join us for insights into:
Spark use cases – when is the right time for Spark? The value of a self-service platform. How to choose the right engine for the job, when Spark is not enough.