Multilingual extension for multitask prompted training.
This repository uses the following external repositories that need to be setup:
The finetuning script is multilingual_t0/first_run.sh
. The script is as follows.
# Model dir to save logs, ckpts, etc. in "gs://model_dir" format.
MODEL_DIR="..."
# Data dir to save the processed dataset in "gs://data_dir" format.
TFDS_DATA_DIR="..."
T5X_DIR="..." # directory where the T5X repo is cloned.
python3 ${T5X_DIR}/t5x/train.py \
--tfds_data_dir=${TFDS_DATA_DIR} \
--gin_file="t5x/examples/t5/t5_1_1/xxl.gin" \
--gin_file="t5x/configs/runs/finetune.gin" \
--module_import="multilingual_t0.tasks" \
--gin.MIXTURE_OR_TASK_NAME="'mt5_d4_gpt_sglue_train'" \
--gin.MIXTURE_OR_TASK_MODULE="'t5.data.mixtures'" \
--gin.TASK_FEATURE_LENGTHS="{'inputs': 256, 'targets': 256}" \
--gin.TRAIN_STEPS=1_125_000 \
--gin.MODEL_DIR="'${MODEL_DIR}'" \
--gin.USE_CACHED_TASKS=False \
--gin.INITIAL_CHECKPOINT_PATH="'gs://t5-data/pretrained_models/mt5/xxl/model.ckpt-1000000'"