Skip to content
forked from depot/depot.ai

Embed machine learning models in your Dockerfile

License

Notifications You must be signed in to change notification settings

Reboot-TV/depot.ai

 
 

Repository files navigation

🔮 depot.ai

depot.ai is a free, open-source Docker registry for public machine learning models that makes it easy to include those models in your own Dockerfile.

The registry serves the top 100 models on Hugging Face, as defined in models/models.yaml. You can see a full list of models, or open a PR to add a new model.

Table of Contents

Usage

Each of model is published as a Docker image, named after its Hugging Face repository. For example, the stable-diffusion-v1-5 model is published as depot.ai/runwayml/stable-diffusion-v1-5.

You can then use the COPY --from command in your Dockerfile to copy the model contents into your own image:

# Copy all files from the model repo to the current WORKDIR
COPY --from=depot.ai/runwayml/stable-diffusion-v1-5 / .

# COPY just one file from the model repo to the current WORKDIR
COPY --from=depot.ai/runwayml/stable-diffusion-v1-5 /v1-5-pruned.ckpt .

👉 If you build images with Depot, this is all you need to do! Depot is preconfigured to use BuildKit and eStargz to optimially build your image with the COPY command. If you COPY specific files from a model repo, Depot will pull just those files from the model image, rather than the entire repo contents, speeding up your build.

Otherwise, if you are not using Depot and would like the same lazy-loading support, you will need to do two things:

  1. You will need to use BuildKit as your Docker build engine. If you are using Docker Desktop or Docker Engine v23.0 or newer, BuildKit is the default build engine. If you are using Docker Buildx, you are using BuildKit (see below about enabling support for lazy-pulling with eStargz). And if you are using an older version of Docker Engine, you can enable BuildKit by setting the DOCKER_BUILDKIT=1 environment variable.

  2. To enable lazy-pulling of just the files you need, you will also need to enable support for eStargz. This means that BuildKit will only fetch the files you need from the image, rather than downloading the entire repo contents. To enable eStargz. For this, you will need to use BuildKit with Docker Buildx, and create a new builder with the following command:

    docker buildx create --use --buildkitd-flags '--oci-worker-snapshotter=stargz'

How it works

Each model is published as a Docker image containing the model contents as a single layer. Conceptually, the image is constructed like:

FROM scratch
COPY model /

Finally, the image layer is built with two specific techniques:

  1. We set SOURCE_DATE_EPOCH to 0, which sets the file created time to the Unix epoch. This ensures that the image layer is reproducible, meaning if the model file contents inside have not changed, the build produces the same layer.
  2. The image is compressed with eStargz, which creates an index of the files inside the layer and enables lazy-pulling of just the files requested by the COPY command. This means that if you want to include just one file from an otherwise large model repo, BuildKit will only copy that one file into your image.

See the Dockerfile for the full implementation.

We publish images to a private Artifact Registry as a temporary storage location, then a Cloudflare Worker imports the image from Artifact Registry, stores it in R2, and serves it as a public Docker registry.

Contributing

Contributions are welcome!

Add a Model

The models that depot.ai serves are defined in models/models.yaml — you can fork this repo, add an additional model entry, and submit a PR to add another model. Once the PR merges, GitHub Actions will automatically build and publish it:

- name: username/model
  sha: 1234567890abcdef1234567890abcdef12345678
  tagAs: latest

License

MIT License, see LICENSE.

About

Embed machine learning models in your Dockerfile

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • TypeScript 49.8%
  • JavaScript 35.0%
  • Astro 11.3%
  • Dockerfile 2.0%
  • Shell 1.9%