Skip to content

Project to improve cluster and metric pruning in OtterTune for CMPSCIS645 Database Design and Implementation project

License

Notifications You must be signed in to change notification settings

treble-maker123/ottertune

Repository files navigation

Ottertune Clustering

Project to improve cluster and metric pruning in OtterTune for CS645 Database Design and Implementation. The Github repository for this project can be found here.

To run the pipeline end-to-end, simply run bash run_pipeline.sh. Note that the script contains various hyperparamteres.

The test.csv in the root directory is generated using the GaussianRandomProjection and K-Means clustering as the metric-pruning method.

Setup

  1. To setup test data, unzip the project3_dataset.tar.gz file, e.g. tar zxf project3_dataset.tar.gz. You should have a folder named project3_dataset in the root directory of this project,
  2. To setup the conda environment (see guide on Miniconda installation here), run conda env create -f environment.yml,

NOTE: If you add packages to conda, delete the original environment.yml file and run conda env export --from-history | grep -v "^prefix: " > environment.yml to generate a new one with the addition you have made.

About

Project to improve cluster and metric pruning in OtterTune for CMPSCIS645 Database Design and Implementation project

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published