Hive 202 – Advanced Hive (Cloud Agnostic)

 
 
 
Free membership level allows access to this content.

Placeholder

NOTE: As of the first week of July 2017 we will archive this Version 1 Engine Courses and will release our Persona Based Engine Courses which are focused more on the functionality you will need depending on your specific role. Please check back during the first week of July for the new course list. 

Clouds

AWS, Azure

Overview

Hive 202 will take students deeper into the advanced joins available in HiveQL . The course will also dive deeper into the discussion surrounding Dynamic Partitioning and the common cases that users may run into. 

Try It

Qubole Education Hive 202 contains “Try It” sections which contain instructions that can be followed inside of a personal account if you have access to the flight data. The data can be downloaded from http://stat-computing.org/dataexpo/2009/the-data.html – it will be necessary to download 2004 through 2008. The file for each year should be placed in an individual year directory under a higher level flights directory in the cloud. Therefore the structure should be as follows:

cloud_path/flights/2004

cloud_path/flights/2005

cloud_path/flight/2008

Quiz Questions 

Qubole Education Hive 202 contains quiz questions after several of the lessons – it will be necessary to answer the quiz questions to complete the lessons and the course.

Recommended PreRequisites:

  • User 101
  • Hive 101

Estimated Time: 30 to 45 minutes

Difficulty: Intermediate