Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
scripts		scripts
tests		tests
translation_models		translation_models
LICENSE		LICENSE
README.md		README.md
illustration.png		illustration.png
logo.png		logo.png
mt_task.py		mt_task.py
requirements.txt		requirements.txt

Repository files navigation

Contrastive Decoding

This repository implements source-contrastive and language-contrastive decoding, as described in Sennrich et al. (2023).

In source-contrastive decoding, we search for a translation that maximizes P(Y|X) - λ·P(Y|X'), where X' is a random source segment. This penalizes hallucinations.
In language-contrastive decoding, we search for a translation that maximizes P(Y|X,l_y) - λ·P(Y|X,l_y'), where l_y is the language indicator for the desired target language, l_y' the indicator for some undesired language (such as English or the source language). This penalizes off-target translations.

Installation

pip install -r requirements.txt

Usage

Example commands

Source-contrastive decoding with M2M-100 (418M) on Asturian–Croatian, with λ_src=0.7:

python -m scripts.run --model_path m2m100_418M --language_pairs ast-hr --source_contrastive --source_weight -0.7

Source-contrastive and language-contrastive decoding with SMaLL-100 on Pashto–Asturian, with 2 random source segments, λ_src=0.7, λ_lang=0.1, and English and Pashto as contrastive target languages:

python -m scripts.run --model_path small100 --language_pairs ps-ast --source_contrastive 2 --source_weight -0.7 --language_contrastive en ps --language_weight -0.1

Language-contrastive decoding with Llama 2 Chat (7B) on English–German, with λ_lang=0.5 and English as contrastive target language, using prompting with a one-shot example:

python -m scripts.run --model_path llama-2-7b-chat --language_pairs en-de --language_contrastive en --language_weight -0.5 --oneshot

Dataset and Models:

This repository automatically downloads and uses FLORES-101 for evaluation. devtest section is used for the evaluation.

Multiple models are implemented:

M2M-100 (418M). Use --model_path m2m100_418M
SMaLL-100. Use --model_path small100
Llama 2 7B Chat. Use --model_path llama-2-7b-chat or llama-2-13b-chat

Evaluation

ChrF2:

sacrebleu ref.txt < output.txt --metrics chrf

spBLEU:

sacrebleu ref.txt < output.txt --tokenize flores101

Reference

@article{sennrich-et-al-2023-mitigating,
      title={Mitigating Hallucinations and Off-target Machine Translation with Source-Contrastive and Language-Contrastive Decoding}, 
      author={Rico Sennrich and Jannis Vamvas and Alireza Mohammadshahi},
      journal={arXiv preprint arXiv:2309.07098},
      year={2023}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Contrastive Decoding

Installation

Usage

Dataset and Models:

Evaluation

Reference

About

Releases

Packages

Contributors 3

Languages

License

ZurichNLP/ContraDecode

Folders and files

Latest commit

History

Repository files navigation

Contrastive Decoding

Installation

Usage

Dataset and Models:

Evaluation

Reference

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages