Apache Spark

Apache Spark is a fast, in-memory data processing engine that allows data teams to run a range of workload types, such as streaming, machine learning or interactive data exploration, that require fast iterative access to datasets.

Apache Spark

A self-managing and self-optimizing implementation of Spark

Qubole offers the first Autonomous Data Platform implementation of the Apache Spark open source project.

Runs on your choice of popular public Cloud infrastructure

Leverages the platform’s AIR (Alerts, Insights, Recommendations) capabilities to help data teams focus on the outcome, instead of the platform


Microsoft Azure

Oracle Cloud

Supported Versions

AWS: 1.6.2, 2.0.0, 2.0.2

Azure, Oracle: 2.1.0, 2.0.2

Spark in Qubole Documentation