This is a Computer Vision (CV) project that solves the problem of segmenting out swimming pools from satellite images.
We use a slightly modified version of the BH-Pools dataset: we split the original 4k images into tiles of 1024x1024 pixels.
Python 3.8+ is required to run code from this repo.
-
Send the presenter the e-mail address associated with your GitHub account. You will be invited to collaborate on the workshop repository.
-
Clone the repository locally
$ git clone https://github.com/tibor-mach/workshop-e2e-gitops-ml $ cd workshop-e2e-gitops-ml
-
Now let's install the requirements and create a virual environment. You can use the included makefile and simply call
$ make venv
-
(Optional) Connect your account to DVC Studio for live experiment sharing
The worskhop host should invite you to a team in DVC Studio using the GitHub-associated e-mail address. You can then simply use your GitHub account to log in to Studio.
Once you are signed-in, click on the icon in the top-right corner of the Studio UI and select Profile. Scroll down to Studio Access Token and copy it
Then call the following command with
***
replaced by your tokendvc config --global studio.token ***
This will allow dvc to communicate with Studio and share progress of live (ongoing) experiments with others.
Import the raw data with dvc from our public dataset registry:
$ dvc import https://github.com/iterative/dataset-registry use-cases/pool_data -o data/pool_data
- Open the
params.yaml
file in the root of the repository and edit some values - Run
dvc exp run
- Observe the results in Studio, in the VS code extension or in the command line
- Persist the experiment and add it to the (local and remote) git history by calling
with
dvc exp push origin exp-name
exp-name
replaced with your experiment name.