Skip to content

RifleZhang/LLaVA-Reasoner-DPO

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Unofficial Repo for LLaVA-Reasoner-DPO

This is an unofficial repo for the paper: Improve Vision Language Model Chain-of-thought Reasoning

Release

  • [10.22] we will provide third party implementation for arxiv paper

setup

# setup environment, need to fill in the required fields
source setup/setup_env.sh

# data
source setup/setup_train_data.sh 

sft

cd llava_reasoner
bash scripts_sft/sft_direct+cot_preview.sh \
$SAVE_DIR/sft/llava_reasoner_sft_preview

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published