Generate big test data files for ECE 454: Distributed Computing - Spring 2023 Assignment 2 (Hadoop & Spark with Scala)
Make sure you have python/python3 installed
git clone
the repository locallycd
into the cloned directory- Run the command below:
python3 generator.py <NUMBER OF MOVIES> <NUMBER OF USERS>
i.e.
python3 generator.py 3000 1000
will generate 3000movies_1000users.txt
in the same directory