Modailities param in LLaVA-NeXT documentation #391

gmongaras · 2025-01-13T05:02:46Z

The llava model requires the modalities parameter to be broadcasted to the batch size, otherwise the zip statement on line 442 in llava/model/llava_arch.py reduces the batch size to 1 (the default length of this parameter). Not having this in the .generate call leads to batches of size 1 to be generated even when a larger batch size is passed in.

I think adding this to the documentation would make it a little more clear. Or maybe doing auto broadcasting of the modalities parameter if the input is a batch and this parameter isn't changed by the user.

Added modalities param to docs/LLaVA-NeXT.mD

8496336

gmongaras mentioned this pull request Jan 18, 2025

Consider changing llama3 configs to left padding #398

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Modailities param in LLaVA-NeXT documentation #391

Modailities param in LLaVA-NeXT documentation #391

gmongaras commented Jan 13, 2025

Modailities param in LLaVA-NeXT documentation #391

Are you sure you want to change the base?

Modailities param in LLaVA-NeXT documentation #391

Conversation

gmongaras commented Jan 13, 2025