Qubole Education

Welcome to the Qubole Education landing page – our mission is to ensure your success with the available Cloud Technologies as well as the QDS Product. We recommend bookmarking this page so you can quickly return to register for Instructor Led Free Public Courses and stay up to date with the new material we are continually developing. Select a tab below to browse our services and begin your cloud education!

Self Service Education

Get started with Qubole’s Self Service Education offering by selecting the link below to navigate and enroll in the available courses which features videos, exercises and quizzes. Please be aware that currently the Self Service Education environment is not yet tied to the Qubole product therefore you will need to create an additional login. We do recommend signing up with the email address used for QDS when this integration occurs.

All Available Courses

“Getting Started” Courses

QDS Walk Throughs

Qubole provides access to In Application Tutorials which provide high level overviews of the functionality within the application via a lightweight walkthrough widget. The structure of the environment as well as how to navigate are reviewed in the In Application Tutorials. These are available via the Help Center inside of Qubole – we recommend beginning with the below:

“Getting Started” Walk Through

Free Public Courses

Qubole provides free instructor led public courses which include demonstrations and access to a training space for exercises. Please scroll down to view the current schedule and access registration links. Please register with the email address you use to log into the Qubole product. Registration is manually accepted and upon approval a confirmation email will be sent. All times stated in Eastern Standard Time.

Course Title Date & Registration Link
Presto 101 1/23 @ 11am
Spark 101 1/23 @ 2pm
Admin 201 1/25 @ 11am
Hive 101 1/30 @ 11am
Tez 101 1/30 @ 2pm
User 101 2/1 @ 2pm
Admin 101 2/6 @ 11am
Spark 101 2/13 @ 11am
Presto 101 2/15 @ 2pm
Admin 201 2/22 @ 11am
Hive 101 2/27 @ 11am
Tez 101 2/27 @ 2pm

Private Course Delivery

Qubole has over 30 hours of training content and instructors are available for private remote or in person delivery of materials. Please contact [email protected] to schedule private training sessions of any of the courses listed below.

Course Number Duration Exercises Questions Answered
User 101  120 min yes What is Qubole, what are the features available to me as a user and how does it interact with AWS on my behalf?
Admin 101 120 min yes What is a Cluster, what are the features available to me as an administrator and how do I manage the Users in my environment?
Admin 201 120 min  yes There are several Clusters available in Qubole, what are the associated use cases and how do I select the right Cluster?
Admin 202 120 min  yes How can Task Focused Analysis help me achieve my desired parallelism and select the best Instance Type while estimating the expected cost of running my Cluster for an hour?
Hive 101 120 min yes What is Hive, how does MapReduce work and how should I think about writing SQL in Hive since it gets converted to MapReduce?
Hive 201 120 min yes I understand the basics of writing SQL in Hive, how can I control the MapReduce produced by Hive in response to the SQL I write?
Hive 202 120 min  yes How can I be more efficient with my MapReduce memory usage and what are some best practices for Dynamic Partitioning?
Tez 101  120 min yes What is Tez, how does it differ from traditional MapReduce, what is the use case for Tez and how can I use it in Qubole?
Presto 101 120 min  yes What is Presto, what are the features available to me as a user and how can I optimize my Presto performance as an analyst?
Presto 201 120 min yes How does Presto manage memory across queries, how can I prevent failure and how can I optimize Presto as an administrator?
Spark 101 120 min  yes What is Spark and the associated use cases, what are RDDs, Data Frames and Executors and how do I use Spark and Notebooks?
Spark 201 120 min yes How does Spark process data, why is Spark considered lazy and how does the structure of my code affect the Data Shuffle?
Spark 202 90 min  yes How does the Spark Application UI support analysis of the Spark jobs as well as the behavior of the Stages, Tasks and Data Shuffle?
Spark 301 120 min  yes How does Spark allocate memory across and within Executors and how can I anticipate and manage the parallelism in my Cluster?
Spark 302 90 min yes What additional considerations can affect Spark performance and how do I know when I need to increase the Cluster power?
Spark MLlib  90 min yes What is the Spark Machine Learning Library, how can I use it in Qubole and how do I create a Notebook Recommendation Engine?
Spark Streaming  60 min  no What is Spark Streaming, what is the associated architecture and how do we conceptualize Streaming development? What types of fault tolerance is available and how do we think about tuning Streaming?
HBase 101 120 min  yes What is HBase and the associated use cases, how does HBase structure and store data and what are some documented Best Practices?
ODBC JDBC 30 min  no What are the ODBC and JDBC drivers and how are these used for connectivity to Qubole for reporting purposes?

Meet Our Trainer

Our Training lead – Alexander Aidun – is a Cornell Engineering Graduate who has spent his career working with technology and his free time working with people. Alex is passionate about course creation and interviews colleagues and customers when designing courseware to ensure the highest possible utility for students. Feel free to review Alex’s career and volunteer work via his LinkedIn page. Alex is available for customized course creation – please contact your account manager or email [email protected] to begin discussions about the work and costs associated with this option.