Spark 101 – Intro to Spark (AWS)

Free membership level allows access to this content.


NOTE: As of the first week of July 2017 we will archive this Version 1 Engine Courses and will release our Persona Based Engine Courses which are focused more on the functionality you will need depending on your specific role. Please check back during the first week of July for the new course list. 




Qubole Education Spark 101 will explore popular use cases and the components of the Spark toolkit including:

  • Resilient Distributed Datasets
  • Data Frames
  • Spark Commands
  • Spark Notebooks
  • Spark Configuration

Try It

Qubole Education Spark 101 contains “Try It” sections which contain instructions that can be followed inside of a personal account if you have access to the ‘paid-qubole/default-datasets’ bucket on S3. Please contact your administrator if you have issues accessing the sample datasets from your personal accounts.

Quiz Questions

Qubole Education Spark 101 contains quiz questions after several of the lessons – it will be necessary to answer the quiz questions to complete the lessons and the course.

Recommended PreRequisites:

  • User 101

Estimated Time: 60 to 75 minutes

Difficulty: Beginner