Repository containing the activities carried out for the "Big Data Analytics and Machine Learning" UNIVPM course exercises year 2023-2024
This task revolves around the PySpark framework and consists of implementing 3 queries and executing them in as little time as possible on a large dataset.
The task is to reverse engineer a given SQL database.