-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Long context rewards #1
Comments
Which base model are you referring to? If you're interested in models with models with longer prompts, I can train one on top of jina-embeddings-v3 which support up to 8192 tokens. Also, if you have suggestions for good backbones for multimodal reward models, happy to train one and support those too. |
Hey, It would be indeed super useful to have a long version of this models but i think the recent jina clip v2 is a better base model (jina embedding don't have an image tower i believe) https://huggingface.co/jinaai/jina-clip-v2. This would be a great ressource to have, as it would help with scoring synthetic prompts! |
I actually didn't add ImageReward yet. But, since it's a standard model used for image generation benchmark, I will be adding that in coming weeks. I will probably train |
@nicolas-dufour quick update, I've briefly tried to train one on jina-clip-v2 sometime last week, but accuracy is around 55% ~ 60% for some reason. It was trained using the same recipe as standard CLIP / SigLIP based scorers, but still very strange. Will try again in next few days and let you know. |
Closing since #5 is added. |
Hey,
Out of this models, do they support longer texts? Some of the base reward models support less than 40 text tokens!
The text was updated successfully, but these errors were encountered: