Spark 301 – Parallelism Management (AWS)

Free membership level allows access to this content.


NOTE: As of the first week of July 2017 we will archive this Version 1 Engine Courses and will release our Persona Based Engine Courses which are focused more on the functionality you will need depending on your specific role. Please check back during the first week of July for the new course list. 




Qubole Education Spark 301 will explore the relationship between Cores, Executors and Tasks and how to manage memory allocation in a Qubole Spark cluster. The course will continue with a deeper dive into Executor analysis and how to manage Qubole Executor AutoScaling.  

Try It

Qubole Education Spark 301 contains “Try It” sections which contain instructions that can be followed inside of a personal account if you have access to the ‘paid-qubole/default-datasets’ bucket on S3. Please contact your administrator if you have issues accessing the sample datasets from your personal accounts.

Quiz Questions 

Qubole Education Spark 301 contains quiz questions after several of the lessons – it will be necessary to answer the quiz questions to complete the lessons and the course.

Recommended PreRequisites:

  • User 101
  • Spark 101
  • Spark 201

Estimated Time: 30 to 45 minutes

Difficulty: Advanced