yashrahmed / llm-reasoning-experiments Public

Notifications You must be signed in to change notification settings
Fork 0
Star 0

Improving llm reasoning on GSM and other datasets with structured templates

0 stars 0 forks Branches Tags Activity

Notifications

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.gitignore		.gitignore
Introduction_to_LLM_Reasoning.ipynb		Introduction_to_LLM_Reasoning.ipynb
README.md		README.md
heuristics_idea.txt		heuristics_idea.txt
prompt.txt		prompt.txt

Repository files navigation

llm-reasoning-experiments

Goal:

Improving llm reasoning on GSM8K, GSM Symbolic and other datasets using reasoning.

Idea:

Learning how to fine tune llama.
Use techniques like self taught reasoners (Rejection sampling?).
Use techniques like MCTS.
Try RL techniques.
Figure out how to integrate interpreters into reasoning.
Train verifiers on program of thought.

About

Improving llm reasoning on GSM and other datasets with structured templates

Report repository

Releases

No releases published

Packages

No packages published

Languages

Jupyter Notebook 100.0%