Diffusers Core ML

Tested on

Device	macOS	Python	coremltools
MacBook Air M3 16G	14.5	3.9	7.2

Performance

Model	Quantization	Compute Unit	Latency(s)
SDXL Lightning 4step	6bits	CPU_AND_GPU	15

Supported Pipelines

StableDiffusionXLPipeline
StableDiffusionXLImg2ImgPipeline
StableDiffusionXLInpaintPipeline
StableDiffusionXLControlNetPipeline

Not Supported

LoRA

Installation

pip3 install git+https://github.com/digitalbrain79/transformers-coreml.git
pip3 install git+https://github.com/digitalbrain79/diffusers-coreml.git

Examples

Only supports 1024x1024 resolution

Text to Image

from diffusers import (
    EulerAncestralDiscreteScheduler,
    StableDiffusionXLPipeline
)

# Dreamshaper XL: digitalbrain79/dreamshaper-xl-lightning-4step-coreml-6bits-compiled
# Juggernaut XL: digitalbrain79/juggernaut-xl-lightning-4step-coreml-6bits-compiled
pipeline = StableDiffusionXLPipeline.from_pretrained(
    "digitalbrain79/sdxl-lightning-4step-coreml-6bits-compiled",
    use_safetensors=False,
    low_cpu_mem_usage=False
)

pipeline.scheduler = EulerAncestralDiscreteScheduler.from_config(
    pipeline.scheduler.config, timestep_spacing="trailing"
)

image = pipeline(
    prompt="a photo of an astronaut riding a horse on mars",
    num_inference_steps=4,
    guidance_scale=0
).images[0]

Image to Image

from diffusers import (
    EulerAncestralDiscreteScheduler,
    StableDiffusionXLImg2ImgPipeline
)
from diffusers.utils import load_image

# Dreamshaper XL: digitalbrain79/dreamshaper-xl-lightning-4step-coreml-6bits-compiled
# Juggernaut XL: digitalbrain79/juggernaut-xl-lightning-4step-coreml-6bits-compiled
pipeline = StableDiffusionXLImg2ImgPipeline.from_pretrained(
    "digitalbrain79/sdxl-lightning-4step-coreml-6bits-compiled",
    use_safetensors=False,
    low_cpu_mem_usage=False
)

pipeline.scheduler = EulerAncestralDiscreteScheduler.from_config(
    pipeline.scheduler.config, timestep_spacing="trailing"
)

url = "https://huggingface.co/datasets/patrickvonplaten/images/resolve/main/aa_xl/000000009.png"
init_image = load_image(url).convert("RGB")
image = pipeline(
    prompt="an astronaut riding a horse on mars, anime style",
    image=init_image,
    strength=0.9,
    num_inference_steps=4,
    guidance_scale=0
).images[0]

Inpainting

from diffusers import (
    EulerAncestralDiscreteScheduler,
    StableDiffusionXLInpaintPipeline
)
from diffusers.utils import load_image

# Dreamshaper XL: digitalbrain79/dreamshaper-xl-lightning-4step-coreml-6bits-compiled
# Juggernaut XL: digitalbrain79/juggernaut-xl-lightning-4step-coreml-6bits-compiled
pipeline = StableDiffusionXLInpaintPipeline.from_pretrained(
    "digitalbrain79/sdxl-lightning-4step-coreml-6bits-compiled",
    use_safetensors=False,
    low_cpu_mem_usage=False
)

pipeline.scheduler = EulerAncestralDiscreteScheduler.from_config(
    pipeline.scheduler.config, timestep_spacing="trailing"
)

img_url = "https://raw.githubusercontent.com/CompVis/latent-diffusion/main/data/inpainting_examples/overture-creations-5sI6fQgYIuo.png"
mask_url = "https://raw.githubusercontent.com/CompVis/latent-diffusion/main/data/inpainting_examples/overture-creations-5sI6fQgYIuo_mask.png"

init_image = load_image(img_url).convert("RGB")
mask_image = load_image(mask_url).convert("RGB")

image = pipeline(
    prompt="A cat sitting on a bench",
    image=init_image,
    mask_image=mask_image,
    strength=0.9,
    num_inference_steps=4,
    guidance_scale=0
).images[0]

ControlNet

MistoLine

import numpy as np
import cv2
from PIL import Image
from diffusers import (
    EulerAncestralDiscreteScheduler,
    StableDiffusionXLControlNetPipeline,
    ControlNetModel
)
from diffusers.utils import load_image

image = load_image(
    "https://huggingface.co/datasets/patrickvonplaten/images/resolve/main/aa_xl/000000009.png"
)

# Download manually "https://huggingface.co/digitalbrain79/mistoline-coreml-6bits-compiled"
controlnet_path = "" # Downloaded path

controlnet = ControlNetModel.from_pretrained(
    controlnet_path,
    use_safetensors=False,
    low_cpu_mem_usage=False
)

# Dreamshaper XL: digitalbrain79/dreamshaper-xl-lightning-4step-controlnet-coreml-6bits-compiled
# Juggernaut XL: digitalbrain79/juggernaut-xl-lightning-4step-controlnet-coreml-6bits-compiled
pipeline = StableDiffusionXLControlNetPipeline.from_pretrained(
    "digitalbrain79/sdxl-lightning-4step-controlnet-coreml-6bits-compiled",
    use_safetensors=False,
    low_cpu_mem_usage=False,
    controlnet=controlnet
)

pipeline.scheduler = EulerAncestralDiscreteScheduler.from_config(
    pipeline.scheduler.config, timestep_spacing="trailing"
)

image = np.array(image)
image = cv2.Canny(image, 100, 200)
image = image[:, :, None]
image = np.concatenate([image, image, image], axis=2)
canny_image = Image.fromarray(image)

image = pipeline(
prompt="a photo of an astronaut riding a horse on mars",
    controlnet_conditioning_scale=0.5,
    image=canny_image,
    num_inference_steps=4,
    guidance_scale=0
).images[0]

OpenPose

from diffusers import (
    EulerAncestralDiscreteScheduler,
    StableDiffusionXLControlNetPipeline,
    ControlNetModel
)
from diffusers.utils import load_image
from controlnet_aux import OpenposeDetector

# Download manually "https://huggingface.co/digitalbrain79/controlnet-openpose-coreml-6bits-compiled"
controlnet_path = "" # Downloaded path

controlnet = ControlNetModel.from_pretrained(
    controlnet_path,
    use_safetensors=False,
    low_cpu_mem_usage=False
)
openpose = OpenposeDetector.from_pretrained("lllyasviel/ControlNet")

image = load_image(
    "https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/diffusers/person.png"
)
openpose_image = openpose(image).resize((1024, 1024))

# Dreamshaper XL: digitalbrain79/dreamshaper-xl-lightning-4step-controlnet-coreml-6bits-compiled
# Juggernaut XL: digitalbrain79/juggernaut-xl-lightning-4step-controlnet-coreml-6bits-compiled
pipeline = StableDiffusionXLControlNetPipeline.from_pretrained(
    "digitalbrain79/sdxl-lightning-4step-controlnet-coreml-6bits-compiled",
    use_safetensors=False,
    low_cpu_mem_usage=False,
    controlnet=controlnet
)
pipeline.scheduler = EulerAncestralDiscreteScheduler.from_config(
    pipeline.scheduler.config, timestep_spacing="trailing"
)

image = pipeline(
    prompt="Darth vader dancing in a desert, high quality",
    negative_prompt="low quality, bad quality",
    image=openpose_image,
    num_inference_steps=4,
    guidance_scale=0
).images[0]

Depth

import numpy as np
import cv2
from PIL import Image
from diffusers import (
    EulerAncestralDiscreteScheduler,
    StableDiffusionXLControlNetPipeline,
    ControlNetModel
)
from diffusers.utils import load_image
from controlnet_aux import MidasDetector

# Download manually "https://huggingface.co/digitalbrain79/controlnet-depth-coreml-6bits-compiled"
controlnet_path = "" # Downloaded path

controlnet = ControlNetModel.from_pretrained(
    controlnet_path,
    use_safetensors=False,
    low_cpu_mem_usage=False
)
processor_midas = MidasDetector.from_pretrained("lllyasviel/Annotators")

image = load_image(
    "https://huggingface.co/lllyasviel/sd-controlnet-depth/resolve/main/images/stormtrooper.png"
)

depth_image = processor_midas(image, output_type='cv2')
height, width, _ = depth_image.shape
ratio = np.sqrt(1024. * 1024. / (width * height))
new_width, new_height = int(width * ratio), int(height * ratio)
depth_image = cv2.resize(depth_image, (new_width, new_height))
depth_image = Image.fromarray(depth_image)

# Dreamshaper XL: digitalbrain79/dreamshaper-xl-lightning-4step-controlnet-coreml-6bits-compiled
# Juggernaut XL: digitalbrain79/juggernaut-xl-lightning-4step-controlnet-coreml-6bits-compiled
pipeline = StableDiffusionXLControlNetPipeline.from_pretrained(
    "digitalbrain79/sdxl-lightning-4step-controlnet-coreml-6bits-compiled",
    use_safetensors=False,
    low_cpu_mem_usage=False,
    controlnet=controlnet
)
pipeline.scheduler = EulerAncestralDiscreteScheduler.from_config(
    pipeline.scheduler.config, timestep_spacing="trailing"
)

image = pipeline(
    prompt="stormtrooper lecture, photorealistic",
    image=depth_image,
    num_inference_steps=4,
    guidance_scale=0,
    controlnet_conditioning_scale=0.5
).images[0]

References

https://github.com/apple/ml-stable-diffusion
https://huggingface.co/ByteDance/SDXL-Lightning
https://huggingface.co/TheMistoAI/MistoLine
https://huggingface.co/thibaud/controlnet-openpose-sdxl-1.0
https://huggingface.co/diffusers/controlnet-depth-sdxl-1.0

Name	Name	Last commit message	Last commit date
Latest commit digitalbrain79 Modified setup.py Sep 10, 2024 4ce5565 · Sep 10, 2024 History 4,485 Commits
.github	.github	Errata: Fix typos & `\s+$` (huggingface#9008 )	Aug 3, 2024
assets	assets	Sync upstream	Aug 16, 2024
benchmarks	benchmarks	shift cache in benchmarking. (huggingface#8740 )	Jul 1, 2024
docker	docker	[CI] Slow Test Updates (huggingface#8870 )	Jul 25, 2024
docs	docs	[refactor] CogVideoX followups + tiled decoding support (huggingface#…	Aug 13, 2024
examples	examples	Bump jinja2 from 3.1.3 to 3.1.4 in /examples/research_projects/realfi…	Aug 14, 2024
python_coreml_stable_diffusion	python_coreml_stable_diffusion	python_coreml_stable_diffusion	Jul 29, 2024
scripts	scripts	Add CogVideoX text-to-video generation model (huggingface#9082 )	Aug 7, 2024
src/diffusers	src/diffusers	Sync upstream	Aug 16, 2024
tests	tests	feat: allow flux transformer to be sharded during inference (huggingf…	Aug 16, 2024
utils	utils	Errata: Fix typos & `\s+$` (huggingface#9008 )	Aug 3, 2024
.gitignore	.gitignore	Latte: Latent Diffusion Transformer for Video Generation (huggingface…	Jul 11, 2024
CITATION.cff	CITATION.cff	[Chore] add: fives names to citations. (huggingface#7395 )	Mar 20, 2024
CODE_OF_CONDUCT.md	CODE_OF_CONDUCT.md	Fix typos, improve, update at main-page files and .github files (hugg…	Nov 14, 2023
CONTRIBUTING.md	CONTRIBUTING.md	Errata: Fix typos & `\s+$` (huggingface#9008 )	Aug 3, 2024
LICENSE	LICENSE	init upload	May 30, 2022
MANIFEST.in	MANIFEST.in	Minor package fixes (huggingface#809 )	Oct 12, 2022
Makefile	Makefile	add: utility to format our docs too 📜 (huggingface#7314 )	Apr 2, 2024
PHILOSOPHY.md	PHILOSOPHY.md	Errata - Fix typos and improve style (huggingface#8571 )	Jun 24, 2024
README.md	README.md	README.md	Aug 27, 2024
_typos.toml	_typos.toml	Fix typos (huggingface#568 )	Sep 19, 2022
googlec3e7ea31e8e0a4a2.html	googlec3e7ea31e8e0a4a2.html	Add files via upload	Aug 12, 2024
pyproject.toml	pyproject.toml	fix: Updated `ruff` configuration to avoid deprecated configuration w…	Apr 17, 2024
setup.py	setup.py	Modified setup.py	Sep 10, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Diffusers Core ML

Tested on

Performance

Supported Pipelines

Not Supported

Installation

Examples

Text to Image

Image to Image

Inpainting

ControlNet

MistoLine

OpenPose

Depth

References

About

Releases

Packages

Languages

License

digitalbrain79/diffusers-coreml

Folders and files

Latest commit

History

Repository files navigation

Diffusers Core ML

Tested on

Performance

Supported Pipelines

Not Supported

Installation

Examples

Text to Image

Image to Image

Inpainting

ControlNet

MistoLine

OpenPose

Depth

References

About

Topics

Resources

License

Code of conduct

Citation

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages