Stable Diffusion-NCNN

English | 中文

Stable-Diffusion implemented by NCNN framework based on C++ (Shit Mountain + Blind Box ver.)

Zhihu: https://zhuanlan.zhihu.com/p/582552276

Usages

To use the model, please refer to the description of the official stable-diffusion model license, which will not be repeated here, please abide by it consciously.
The code only uses CPU, after adjustment, it only needs 8G RAM!!!
Thanks to the pr from nihui, the quality of the current output is stable (prompt must be written well, you can refer to The Code of Quintessence), welcome to try.

Some Results

Implementation Details

Three main steps of Stable-Diffusion：
1. CLIP: text-embedding
2. iterative sampling with sampler
3. decode the sampler results to obtain output images
Model details：
1. Weights：Naifu (u know where to find)
2. Sampler：Euler ancestral (k-diffusion version)
3. Resolution：512*512
4. Denoiser：CFGDenoiser, CompVisDenoiser
5. Prompt：positive & negative, both supported :)

Code Details

Since the current running speed is not so fast, the exe file wasn't uploaded, please compile it yourself.
Download the three bin files from 百度网盘 or Google Drive , put them in the corresponding assets directory for compilation
A simple test prompt is given in this repo.

Some Issues

Very sensitive to prompts, if you want to make a high quality picture, the prompt must be written well.
Slow, one iterative step costs about 5-10second.

ONNX Model

I've uploaded the three onnx models used by Stable-Diffusion, so that you can do some interesting work.

You can find them from the link above.

Statements

Please abide by the agreement of the stable diffusion model consciously, and DO NOT use it for illegal purposes!
If you use these onnx models to make open source projects, please inform me and I'll follow and look forward for your next great work :)

Instructions

FrozenCLIPEmbedder

ncnn (input & output): token, multiplier, cond, conds
onnx (input & output): onnx::Reshape_0, 2271

z = onnx(onnx::Reshape_0=token)
origin_mean = z.mean()
z *= multiplier
new_mean = z.mean()
z *= origin_mean / new_mean
conds = torch.concat([cond,z], dim=-2)

UNetModel

ncnn (input & output): in0, in1, in2, c_in, c_out, outout
onnx (input & output): x, t, cc, out

outout = in0 + onnx(x=in0 * c_in, t=in1, cc=in2) * c_out

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
resources		resources
x86/vs2019_opencv-mobile_ncnn-dll_demo		x86/vs2019_opencv-mobile_ncnn-dll_demo
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
README_zh.md		README_zh.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Stable Diffusion-NCNN

Usages

Some Results

Implementation Details

Code Details

Some Issues

ONNX Model

Statements

Instructions

References

About

Releases

Packages

Languages

License

jimway71/Stable-Diffusion-NCNN

Folders and files

Latest commit

History

Repository files navigation

Stable Diffusion-NCNN

Usages

Some Results

Implementation Details

Code Details

Some Issues

ONNX Model

Statements

Instructions

References

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages