GPU requirements for training and inference #1

treya-lin · 2023-12-18T07:18:34Z

Hi, it would be very helpful if the GPU-related info can be added to the documentation so that we know if we have enough VRAM needed for training or inference. Thanks!

xuyaoxun · 2023-12-20T02:09:19Z

We've trained and performed inference on a 40G V100, using 16-mixed precision for training and float32 for inference. During training, LLaMA didn't utilize float32 parameters for import, allowing us to reach a batch size of 16. For inference, we've adopted a strategy of processing one audio piece at a time. To process the entire test set, which consists of 600 sentences, it took us 10 minutes on eight 40G V100s. If you're working with a different GPU, you might want to experiment with varying the batch size and trying out different inference methods.

xuyaoxun closed this as completed Dec 27, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GPU requirements for training and inference #1

GPU requirements for training and inference #1

treya-lin commented Dec 18, 2023 •

edited

Loading

xuyaoxun commented Dec 20, 2023

GPU requirements for training and inference #1

GPU requirements for training and inference #1

Comments

treya-lin commented Dec 18, 2023 • edited Loading

xuyaoxun commented Dec 20, 2023

treya-lin commented Dec 18, 2023 •

edited

Loading