Skip to content

Latest commit

 

History

History
 
 

evaluation

The evaluation component includes Bert Score, GPT-4 ranking, and GPT-4 preference win rate.