- Self Service Education
- Free Public Courses
- QDS Product Courses
- Persona Engine / Cluster Courses
- Coming Soon!
Self Service Education
Get started with Qubole’s Self Service Education offering by selecting the link below to navigate and enroll in the available courses which features videos, exercises and quizzes.
Please be aware that currently the Self Service Education environment is tied to the api.qubole.com environment therefore you may use the same login for the Self Service Education portal. If you are using a different production URL it will be necessary to create a new account with the Self Service portal.
Qubole provides access to In Application Tutorials which provide high level overviews of the functionality within the application via a lightweight walkthrough widget. The structure of the environment as well as how to navigate are reviewed in the In Application Tutorials. These are available via the Help Center inside of Qubole – we recommend beginning with the below:
Free Public Courses
Qubole provides free instructor led public courses which include access to a training environment for exercises. Please scroll down to view the current schedule and access registration links. Please register with the email address you use to log into the Qubole product. Please keep in mind that all of the below are available for Self Service Education. All times stated in Eastern Standard Time.
|Course Title||Date & Registration Link||Duration|
|Hive for Data Analysts||4/23 @2pm ET||120 minutes|
|Spark for Data Ops||5/2 @2pm ET||90 minutes|
|Qubole Enterprise User||5/7 @2pm ET||120 minutes|
|Qubole Enterprise Admin||5/9 @2pm ET||120 minutes|
|Presto for Data Analysts||5/14 @2pm ET||90 minutes|
|Presto for Data Ops||5/16 @2pm ET||90 minutes|
|Hive for Data Analysts||5/21 @2pm ET||90 minutes|
|Hive for Data Engineers||5/23 @2pm ET||90 minutes|
|Hive for Data Ops||5/30 @2pm ET||90 minutes|
QDS Product Courses
The following is our Product Course Catalogue focusing on the Qubole Product. Please coordinate with your Customer Success Manager or Sales Representative to schedule private training sessions of any of the courses listed below.
|Qubole Enterprise User||120 min||What is Qubole, what are the features available to me as a user, how does it interact with the cloud on my behalf and how do I pick the appropriate SQL Engine for my need?|
|Qubole Enterprise Admin||120 min||Learn how Qubole clusters work, how to administer Qubole cluster and how to decide which cluster is appropriate for a given scenario.|
Qubole Agent Admin (AWS Only)
|60 min||What are Agents, how can I use them to my advantage and what Agents are available in Qubole?|
|Qubole Airflow User||60 min||What is Airflow, how do I use the Airflow options in Qubole and how do I trigger DAGS that exist in Airflow from within Qubole. (NOTE: course does not teach how to build DAGS in Airflow, for that instruction please reach out to your CSM to set up a session with a Solutions Architect).|
Persona Engine / Cluster Courses
The following are our Persona Based Engine / Cluster Courses which focus on the Open Source technologies that Qubole makes available through the product. Please coordinate with your Customer Success Manager or Sales Representative to schedule private training sessions of any of the courses listed below.
|Description||Duration w/ Labs||Agenda|
|Spark for Data Analysts||120 min||PreRequisite: knowledge of SQL & familiarity with java, scala or python|
Spark Commands , Resilient Distributed Datasets, Data Frames, Scala vs Python, Spark Notebooks, Spark Tuning, Executor AutoScaling, Notebook Interpreters
|Spark for Data Engineers||120 min||PreRequisite: Spark for Data Analysts|
Spark Execution Model, Actions & Transformations, Stages & Shuffle, Spark Parallelism Management, Executors, Cores & Tasks, Memory Settings, Executor AutoScaling, Job Server
|Spark for Data Scientists||90 min||PreRequisite: Spark for Data Analysts|
Spark Notebooks, Spark Functionality, Qubole Features, Notebook API Execution, Notebook Dashboards, Notebook Tuning, Interpreter Configuration, Executor Management Troubleshooting
|Spark for Data Ops||60 min||PreRequisite: Spark for Data Engineers|
Spark Cluster Architecture, Yarn Cluster Behavior, Spark Cluster, Spark Job Submission, Spark Notebook Administration, Notebook Submission, Notebook Logs
|Hive for Data Analysts||90 min||PreRequisite: knowledge of SQL|
Hive Commands, Hive SQL, Hive SQL Syntax, By Clauses, Transitioning from Database, Hive Tuning, Query Level Settings, Map Joins, Tez
|Hive for Data Engineers||90 min||PreRequisite: Hive for Data Analysts|
Hive Dynamic Partitioning, Syntax & Best Practices, Too Many Small Files, Entire System Scan, Hive Commands, Improving Performance, Advanced Join Options
|Hive for Data Ops||60 min||PreRequisite: Hive for Data Analysts|
Hive Data Preparation, Columnar Optimizations, HDFS Split Size, Common Failure Scenarios, Hive Environment Management, Controlling Environment Behavior, Common Failure Scenarios
|Presto for Data Analysts||60 min||PreRequisite: knowledge of SQL|
Presto Commands, Use Case, Comparison to Hive Comparison to RDBMS, Hive Metadata, Presto Tuning, Syntax Best Practices, Job Lag, Job Failure
|Presto for Data Ops||60 min||PreRequisite: Presto for Data Analysts|
Presto Data Preparation, Columnar Data Format, Ordering Data, Snappy Compression, Split Slots, Presto Execution Tuning, Memory Pools, Managing Memory
Stay tuned for future updates!