The following steps are a guide to start a data science project:
- Import the data
- Clean the data
- Split the data. Training Set/Test Set
- Create a Model
- Check the output
- Improve
Setting up the virtual environment is a good practice to avoid conflicts between libraries. To create a virtual environment, run the following command:
python3 -m venv venv
To activate the virtual environment, run the following command:
source venv/bin/activate
To install the required libraries, run the following command:
pip install -r requirements.txt
To deactivate the virtual environment, run the following command:
deactivate