Skip to content

Analyzing the spatial reasoning skills of language models

Notifications You must be signed in to change notification settings

josephcappadona/spatialQA

Repository files navigation

spatialQA

Setup

python3 -m pip install -r requirements.txt

Usage

# generate data
python scripts/generate.py

# test model
python scripts/test.py t5-small

# analyze results
python scripts/analyze.py t5-small

generate.py outputs data.tsv.

test.py outputs a results TSV to results/results-MODEL-NAME.tsv.

analyze.py outputs a summary TSV to summary/summary-MODEL-NAME.tsv and an analysis TSV to anlysis/analysis-MODEL-NAME.tsv.

Computing graphs

# Combines result of different model
python scripts/make_dataframes.py analysis/ summary/

# Generate figures
python scripts/make_figures.py analysis/df_analysis.csv summary/df_summary.csv 

make_dataframes.py outputs a dataframe of combined analysis results to analysis/df_analysis.csv and a dataframe of combined summary results to summary/df_summary.csv

make_figures.py output the figures to figures

TODO

  • Refactor generation to make it simpler and more scalable

About

Analyzing the spatial reasoning skills of language models

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 3

  •  
  •  
  •