Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
utils		utils
.gitattributes		.gitattributes
README.md		README.md
config.py		config.py
data.zip		data.zip
evaluator.py		evaluator.py
external_knowledges.py		external_knowledges.py
main.py		main.py
model.py		model.py
preprocess.py		preprocess.py
reader.py		reader.py
requirements.txt		requirements.txt
runner.py		runner.py

Repository files navigation

MTTOD

This is code for the paper "Improving End-to-End Task-Oriented Dialogue System with A Simple Auxiliary Task".

Environment setting

Our python version is 3.6.9.

The package can be installed by running the following command.

pip install -r requirements.txt
python -m spacy download en_core_web_sm

Data Preprocessing

For the experiments, we use MultiWOZ2.0 and MultiWOZ2.1.

(MultiWOZ2.0) annotated_user_da_with_span_full.json: A fully annotated version of the original MultiWOZ2.0 data released by developers of Convlab available here.
(MultiWOZ2.1) data.json: The original MultiWOZ 2.1 data released by researchers in University of Cambrige available here.

We use the preprocessing scripts implemented by Zhang et al., 2020. Please refer to here for the details.

python preprocess.py -version $VERSION

Training

Our implementation supports a single GPU. Please use smaller batch sizes if out-of-memory error raises.

MTTOD without auxiliary task (for the ablation)

python main.py -version $VERSION -run_type train -model_dir $MODEL_DIR

MTTOD with auxiliary task

python main.py -version $VERSION -run_type train -model_dir $MODEL_DIR -add_auxiliary_task

The checkpoints will be saved at the end of each epoch (the default training epoch is set to 10).

Inference

python main.py -run_type predict -ckpt $CHECKPOINT -output $MODEL_OUTPUT -batch_size $BATCH_SIZE

All checkpoints are saved in $MODEL_DIR with names such as 'ckpt-epoch10'.

The result file ($MODEL_OUTPUT) will be saved in the checkpoint directory.

To reduce inference time, it is recommended to set large $BATCH_SIZE. In our experiemnts, it is set to 16 for inference.

Evaluation

We use the evaluation scripts implemented by Zhang et al., 2020.

python evaluator.py -data $CHECKPOINT/$MODEL_OUTPUT

Acknowledgement

This code is based on the released code (https://github.com/thu-spmi/damd-multiwoz/) for "Task-Oriented Dialog Systems that Consider Multiple Appropriate Responses under the Same Context".

For the pre-trained language model, we use huggingface's Transformer (https://huggingface.co/transformers/index.html#).

We are grateful for their excellent works.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MTTOD

Environment setting

Data Preprocessing

Training

Inference

Evaluation

Acknowledgement

About

Releases

Packages

Languages

License

tqgminh/MTTOD

Folders and files

Latest commit

History

Repository files navigation

MTTOD

Environment setting

Data Preprocessing

Training

Inference

Evaluation

Acknowledgement

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages