meta-models-public

Files:

base_classifier.py experiments with comparing meta-models to just feeding the text to a meta-model and asking the question
data.py all the data. lots of duplicated code here
elicit_activations.py get activations from a finetuned input-model
finetune2.py finetune an input-model LoRA
hftrain.py train a meta-model
incontext.py short experiment to create a meta-model fron in-context examples (unsuccessful so far)
make_main_figure.py makes the main figure
make_question_ablations.py makes the question ablations ablation figure
phi2_meta_model.py the meta-model code

Provide feedback

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
README.md		README.md
base_classifier.py		base_classifier.py
data.py		data.py
elicit_activations.py		elicit_activations.py
finetune2.py		finetune2.py
hftrain.py		hftrain.py
incontext.py		incontext.py
make_main_figure.py		make_main_figure.py
make_question_ablations.py		make_question_ablations.py
phi2_meta_model.py		phi2_meta_model.py