Merge pull request fudan-generative-vision#9 from crystallee-ai/main

docs & fix: update docs and modify requirements
ahmed-khlifi · Jun 15, 2024 · f55b40e · f55b40e
2 parents a47b8d7 + 4402d57
commit f55b40e
Show file tree

Hide file tree

Showing 2 changed files with 7 additions and 6 deletions.
diff --git a/README.md b/README.md
@@ -63,6 +63,11 @@ Install packages with `pip`
   pip install .
 ```
 
+Besides, ffmpeg is also need:
+```bash
+  apt-get install ffmpeg
+```
+
 # Inference
 
 The inference entrypoint script is `scripts/inference.py`. Before testing your cases, there are two preparations need to be completed:
@@ -110,29 +115,24 @@ Finally, these pretrained models should be organized as follows:
 |   `-- mm_sd_v15_v2.ckpt
 |-- sd-vae-ft-mse/
 |   |-- config.json
-|   |-- diffusion_pytorch_model.bin
 |   `-- diffusion_pytorch_model.safetensors
 |-- stable-diffusion-v1-5/
 |   |-- feature_extractor/
 |   |   `-- preprocessor_config.json
 |   |-- model_index.json
 |   |-- unet/
 |   |   |-- config.json
-|   |   `-- diffusion_pytorch_model.bin
+|   |   `-- diffusion_pytorch_model.safetensors
 |   `-- v1-inference.yaml
 `-- wav2vec/
     |-- wav2vec2-base-960h/
     |   |-- config.json
     |   |-- feature_extractor_config.json
     |   |-- model.safetensors
     |   |-- preprocessor_config.json
-    |   |-- pytorch_model.bin
     |   |-- special_tokens_map.json
-    |   |-- tf_model.h5
     |   |-- tokenizer_config.json
     |   `-- vocab.json
-    `-- wav2vec2-large-960h/
-        `-- ...
 ```
 
 ## Run inference

diff --git a/requirements.txt b/requirements.txt
@@ -14,6 +14,7 @@ numpy==1.26.4
 omegaconf==2.3.0
 onnx2torch==1.5.14
 onnx==1.16.1
+onnxruntime==1.18.0
 opencv-contrib-python==4.9.0.80
 opencv-python-headless==4.9.0.80
 opencv-python==4.9.0.80