Stars
Simple repo to demonstrate how to submit a spark job to EMR from Airflow
Collection of Pig scripts that I use for my talks and workshops
This repository contains the Pig Latin scripts, UDFs and datasets used in the book Pig Design Patterns by Pradeep Pasupuleti, published by Packt.
Data and example code for Programming Pig, by Alan F. Gates
Few scripts to automate daily data loads from RDBMS to Partitioned Avro Hive table
End-to-End Hive : HQL, Partitioning, Bucketing, UDFs, Windowing, Optimization, Map Joins, Indexes