Data Engineering in Hadoop using Cloudera Performed the principle tasks involved in managing, loading, extracting, and transforming data. This respository holds the scripst that I wrote during the whole project. The project was done in Cloudera using Hadoop.
- We create a staging environment
- Upload the Sales Data
- Form the audit database where we track the changes made in the back_office database
- Identify the blank values
- Find the missing locations