Skip to content

Improving llm reasoning on GSM and other datasets with structured templates

Notifications You must be signed in to change notification settings

yashrahmed/llm-reasoning-experiments

Repository files navigation

llm-reasoning-experiments

Goal:

Improving llm reasoning on GSM8K, GSM Symbolic and other datasets using reasoning.

Idea:

  1. Learning how to fine tune llama.
  2. Use techniques like self taught reasoners (Rejection sampling?).
  3. Use techniques like MCTS.
  4. Try RL techniques.
  5. Figure out how to integrate interpreters into reasoning.
  6. Train verifiers on program of thought.

About

Improving llm reasoning on GSM and other datasets with structured templates

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published