Skip to content

Commit

Permalink
Merge pull request fudan-generative-vision#9 from crystallee-ai/main
Browse files Browse the repository at this point in the history
docs & fix: update docs and modify requirements
  • Loading branch information
siyuzhu-fudan authored Jun 15, 2024
2 parents a47b8d7 + 4402d57 commit f55b40e
Show file tree
Hide file tree
Showing 2 changed files with 7 additions and 6 deletions.
12 changes: 6 additions & 6 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -63,6 +63,11 @@ Install packages with `pip`
pip install .
```

Besides, ffmpeg is also need:
```bash
apt-get install ffmpeg
```

# Inference

The inference entrypoint script is `scripts/inference.py`. Before testing your cases, there are two preparations need to be completed:
Expand Down Expand Up @@ -110,29 +115,24 @@ Finally, these pretrained models should be organized as follows:
| `-- mm_sd_v15_v2.ckpt
|-- sd-vae-ft-mse/
| |-- config.json
| |-- diffusion_pytorch_model.bin
| `-- diffusion_pytorch_model.safetensors
|-- stable-diffusion-v1-5/
| |-- feature_extractor/
| | `-- preprocessor_config.json
| |-- model_index.json
| |-- unet/
| | |-- config.json
| | `-- diffusion_pytorch_model.bin
| | `-- diffusion_pytorch_model.safetensors
| `-- v1-inference.yaml
`-- wav2vec/
|-- wav2vec2-base-960h/
| |-- config.json
| |-- feature_extractor_config.json
| |-- model.safetensors
| |-- preprocessor_config.json
| |-- pytorch_model.bin
| |-- special_tokens_map.json
| |-- tf_model.h5
| |-- tokenizer_config.json
| `-- vocab.json
`-- wav2vec2-large-960h/
`-- ...
```

## Run inference
Expand Down
1 change: 1 addition & 0 deletions requirements.txt
Original file line number Diff line number Diff line change
Expand Up @@ -14,6 +14,7 @@ numpy==1.26.4
omegaconf==2.3.0
onnx2torch==1.5.14
onnx==1.16.1
onnxruntime==1.18.0
opencv-contrib-python==4.9.0.80
opencv-python-headless==4.9.0.80
opencv-python==4.9.0.80
Expand Down

0 comments on commit f55b40e

Please sign in to comment.