Diptych

Unofficial code implementation of "Large-Scale Text-to-Image Model with Inpainting is a Zero-Shot Subject-Driven Image Generator" [Link]

Setup

Clone this project and install dependencies to set up the environment (Python 3.11 is recommended):

cd Diptych
pip install -r requirements.txt

Prepare GroundingDINO:

git clone https://github.com/IDEA-Research/GroundingDINO.git
cd GroundingDINO/
pip install -e .
mkdir weights
cd weights
wget -q https://github.com/IDEA-Research/GroundingDINO/releases/download/v0.1.0-alpha/groundingdino_swint_ogc.pth
cd ..

Prepare SAM:

mkdir SAM_checkpoints

Then download required checkpoints from: facebookresearch/segment-anything under ./SAM_checkpoints/.

Running

mkdir output
python inference_diptych.py --arg1 * --arg2 *

Citation

@article{shin2024large,
  title={Large-Scale Text-to-Image Model with Inpainting is a Zero-Shot Subject-Driven Image Generator},
  author={Shin, Chaehun and Choi, Jooyoung and Kim, Heeseung and Yoon, Sungroh},
  journal={arXiv preprint arXiv:2411.15466},
  year={2024}
}

Acknowledgements

The code is mainly based on diffusers and FLUX-Controlnet-Inpainting.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
images		images
attn_processor.py		attn_processor.py
controlnet_flux.py		controlnet_flux.py
inference_diptych.py		inference_diptych.py
pipeline_flux_controlnet_inpaint.py		pipeline_flux_controlnet_inpaint.py
readme.md		readme.md
requirements.txt		requirements.txt
transformer_flux.py		transformer_flux.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Diptych

Setup

Running

Citation

Acknowledgements

About

Releases

Packages

Languages

wuyou22s/Diptych

Folders and files

Latest commit

History

Repository files navigation

Diptych

Setup

Running

Citation

Acknowledgements

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages