Tutorial on Apache Hive at Big Data TechCon 2013
- Introduction
- Hadoop
- Load Data
- Partitioning in Hive
- Dynamic Partitioning
- Joins
- Sampling and Bucketing
- Custom UDFs in Hive
- Custom MapReduce Scripts in Hive
- Miscellaneous
- Grover Hive presentation.pdf: Copy of the slides used in the presentation
- 2008.tar.gz: Flight delay dataset from 2008. More details in Introduction section.
- airports.csv: Dataset linking airport codes to their full names. More details in Introduction section.
- README.md: This file.