HyDE: Precise Zero-Shot Dense Retrieval without Relevance Labels

This is code repository for the paper: HyDE: Precise Zero-Shot Dense Retrieval without Relevance Labels.

HyDE zero-shot instructs GPT3 to generate a fictional document and re-encodes it with unsupervised retriever Contriever to search in its embedding space. HyDE significantly outperforms Contriever across tasks and languages and it does not require any human labeled relevance judgement.

Steps to run the code

Install pyserini by following the guide. We use pyserini to conduct dense retrieval and evaluation.
Download the prebuilt Contrever faiss index

wget  https://www.dropbox.com/s/dytqaqngaupp884/contriever_msmarco_index.tar.gz
tar -xvf contriever_msmarco_index.tar.gz

Setup GPT3 API key

export OPENAI = <your key>

Run hyde-dl19.ipynb, it will run the experiment on the TREC DL19 dataset. Run hyde-demo.ipynb, it will go through HyDE pipeline with an example query.

Citation

@article{hyde,
  title = {Precise Zero-Shot Dense Retrieval without Relevance Labels},
  author = {Luyu Gao and Xueguang Ma and Jimmy Lin and Jamie Callan},
  journal={arXiv preprint arXiv:2212.10496},
  year = {2022}
}

The Internal State of an LLM Knows When It’s Lying

The corresponded code resource is from https://github.com/balevinstein/probes

About prober

The optional models currently including facebook/opt-350m/1.3b/opt-2.7b/6.7b, meta-llama/Llama-3.2-1B-Instruct (3B), meta-llama/Llama-3.1-8B-Instruct
Later the embeddings should be stored when generated the statements to avoid secondary calls.
For HyDE, you can use the prediction_function.py to get the token-level score.

Files:

config.josn: configuretion file. (You need supply huggingface token here)
threshold.json: this is the threshold for binary classification.
generate_embedding.py: This is used for get the hidden states of training dataset from true-and-false dataset, prepared to train the probe model.
model.py: This is for the architecture of prober, currently it is SAPLMAClassifier (MLP from the cited paper)
train_unite.py: This is to train the prober.
prediction.py: This is for the token score prediction from the statements and to write the result into csv files.
prediction_function.py: This is for the token score prediction from the statements with list of input, and output the list of result.
- Input:
  - statement list [stentence1, sentence2, …]
  - model_name: 350m, 1,3b, 2,7b or 6.7b (for opt model)
  - layer: The used layer for prober. (For now, it is -4)
- Output: [[word_list1], [word_list2], ...]
Folder probes: This the trained probers.
prediction_results_xx.csv: This is the output sample outputed by prediction.py.

Result

Dataset ["cities", "facts", "elements", "animals", "companies", "inventions"] Layer: The fourth to last layer of the model (layer = -4)

Model	Acc	AUC	Threshold
opt-350m	0.7867	0.8931	0.5342
opt-1.3b	0.8018	0.9161	0.6019
opt-2.7b	0.8189	0.9197	0.4521
opt-6.7b	0.8230	0.9303	0.4796
llama-3.2-1b-Instruct	0.8243	0.9284	0.5017
llama-3.2-3b-Instruct	0.8708	0.9606	0.5345
llama-3.1-8b-Instruct	0.9064	0.9755	0.5003

Name		Name	Last commit message	Last commit date
Latest commit History 83 Commits
hyde-base		hyde-base
.DS_Store		.DS_Store
.gitignore		.gitignore
README.md		README.md
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

HyDE: Precise Zero-Shot Dense Retrieval without Relevance Labels

Steps to run the code

Citation

The Internal State of an LLM Knows When It’s Lying

About prober

Files:

Result

About

Releases

Packages

Languages

Brian-Konr/probing-hyde

Folders and files

Latest commit

History

Repository files navigation

HyDE: Precise Zero-Shot Dense Retrieval without Relevance Labels

Steps to run the code

Citation

The Internal State of an LLM Knows When It’s Lying

About prober

Files:

Result

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages