Project with grocery transaction data using PCA data set used from https://koalaverse.github.io/homlr/data/my_basket.csv
In order to run PCA data must conform to the following;
- Data are in tidy format per Wickham and others (2014);
- Any missing values in the data must be removed or imputed;
- Typically, the data must all be numeric values (e.g., one-hot, label, ordinal encoding categorical features);
- Numeric data should be standardized (e.g., centered and scaled) to make features comparable.
The my_basket data used in this project already fullfills these requirements.