Welcome to the Qubole Education landing page – our mission is to ensure your success with the available Cloud Technologies as well as the QDS Product. We recommend bookmarking this page so you can quickly return to register for Instructor Led Free Public Courses and stay up to date with the new material we are continually developing. Select a tab below to browse our services and begin your cloud education!

Self Service Education

Get started with Qubole’s Self Service Education offering by selecting the link below to navigate and enroll in the available courses which features videos, exercises and quizzes.

Please be aware that currently the Self Service Education environment is tied to the api.qubole.com environment therefore you may use the same login for the Self Service Education portal. If you are using a different production URL it will be necessary to create a new account with the Self Service portal.

All Available Courses

Qubole provides access to In Application Tutorials which provide high level overviews of the functionality within the application via a lightweight walkthrough widget. The structure of the environment as well as how to navigate are reviewed in the In Application Tutorials. These are available via the Help Center inside of Qubole – we recommend beginning with the below:

“Getting Started” Walk Through

Free Public Courses

Qubole provides free instructor led public courses which include access to a training environment for exercises. Please scroll down to view the current schedule and access registration links. Please register with the email address you use to log into the Qubole product. Please keep in mind that all of the below are available for Self Service Education. All times stated in Eastern Standard Time.

Course TitleDate & Registration LinkDuration
Hive for Data Analysts4/23 @2pm ET120 minutes
Spark for Data Ops5/2 @2pm ET90 minutes
Qubole Enterprise User5/7 @2pm ET120 minutes
Qubole Enterprise Admin5/9 @2pm ET120 minutes
Presto for Data Analysts5/14 @2pm ET90 minutes
Presto for Data Ops5/16 @2pm ET90 minutes
Hive for Data Analysts5/21 @2pm ET90 minutes
Hive for Data Engineers5/23 @2pm ET90 minutes
Hive for Data Ops5/30 @2pm ET90 minutes

QDS Product Courses

The following is our Product Course Catalogue focusing on the Qubole Product. Please coordinate with your Customer Success Manager or Sales Representative to schedule private training sessions of any of the courses listed below.

DescriptionDurationQuestions Answered
Qubole Enterprise User 120 minWhat is Qubole, what are the features available to me as a user, how does it interact with the cloud on my behalf and how do I pick the appropriate SQL Engine for my need?
Qubole Enterprise Admin120 minLearn how Qubole clusters work, how to administer Qubole cluster and how to decide which cluster is appropriate for a given scenario.

Qubole Agent Admin (AWS Only)

60 minWhat are Agents, how can I use them to my advantage and what Agents are available in Qubole?
Qubole Airflow User60 minWhat is Airflow, how do I use the Airflow options in Qubole and how do I trigger DAGS that exist in Airflow from within Qubole. (NOTE: course does not teach how to build DAGS in Airflow, for that instruction please reach out to your CSM to set up a session with a Solutions Architect).

Persona Engine / Cluster Courses

The following are our Persona Based Engine / Cluster Courses which focus on the Open Source technologies that Qubole makes available through the product. Please coordinate with your Customer Success Manager or Sales Representative to schedule private training sessions of any of the courses listed below.

DescriptionDuration w/ LabsAgenda
Spark for Data Analysts120 minPreRequisite: knowledge of SQL & familiarity with java, scala or python

Spark Commands , Resilient Distributed Datasets, Data Frames, Scala vs Python, Spark Notebooks, Spark Tuning, Executor AutoScaling, Notebook Interpreters

Spark for Data Engineers120 minPreRequisite: Spark for Data Analysts

Spark Execution Model, Actions & Transformations, Stages & Shuffle, Spark Parallelism Management, Executors, Cores & Tasks, Memory Settings, Executor AutoScaling, Job Server

Spark for Data Scientists90 minPreRequisite: Spark for Data Analysts

Spark Notebooks, Spark Functionality, Qubole Features, Notebook API Execution, Notebook Dashboards, Notebook Tuning, Interpreter Configuration, Executor Management Troubleshooting

Spark for Data Ops60 minPreRequisite: Spark for Data Engineers

Spark Cluster Architecture, Yarn Cluster Behavior, Spark Cluster, Spark Job Submission, Spark Notebook Administration, Notebook Submission, Notebook Logs

Hive for Data Analysts90 minPreRequisite: knowledge of SQL

Hive Commands,  Hive SQL, Hive SQL Syntax, By Clauses, Transitioning from Database, Hive Tuning, Query Level Settings, Map Joins, Tez

Hive for Data Engineers90 minPreRequisite: Hive for Data Analysts

Hive Dynamic Partitioning, Syntax & Best Practices, Too Many Small Files, Entire System Scan, Hive Commands, Improving Performance, Advanced Join Options

Hive for Data Ops60 minPreRequisite: Hive for Data Analysts

Hive Data Preparation, Columnar Optimizations, HDFS Split Size, Common Failure Scenarios, Hive Environment Management, Controlling Environment Behavior, Common Failure Scenarios

Presto for Data Analysts60 minPreRequisite: knowledge of SQL

Presto Commands, Use Case, Comparison to Hive Comparison to RDBMS, Hive Metadata, Presto Tuning, Syntax Best Practices, Job Lag, Job Failure

Presto for Data Ops60 minPreRequisite: Presto for Data Analysts

Presto Data Preparation, Columnar Data Format, Ordering Data, Snappy Compression, Split Slots, Presto Execution Tuning, Memory Pools, Managing Memory

Coming Soon!

Stay tuned for future updates!