word-order-universals-cogLM

This repository contains the code of the ACL 2024 paper: Emergent Word Order Universals from Cognitively-Motivated Language Models (Kuribayashi et al., 2024).

@inproceedings{kuribayashi-etal-2024-emergent,
    title = "Emergent Word Order Universals from Cognitively-Motivated Language Models",
    author = "Kuribayashi, Tatsuki  and
      Ueda, Ryo  and
      Yoshida, Ryo  and
      Oseki, Yohei  and
      Briscoe, Ted  and
      Baldwin, Timothy",
    booktitle = "Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)",
    month = aug,
    year = "2024",
    address = "Bangkok, Thailand",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2024.acl-long.781",
    doi = "10.18653/v1/2024.acl-long.781",
    pages = "14522--14543",
}

Codes

Environment: Python 3.9.18

Our experimental results (Steps 2 and 3) are stored in work/results/regression.
Just starting from Step 4 with these data will yield the results (figures and tables) shown in our paper.

pip install -r requirements.txt
git clone https://github.com/kpu/kenlm.git
mkdir -p build
cd build
cmake ..
make -j 4

# Step 1: Preprocessing
bash script/gen_data.sh
python src/load_tree_per_line.py
bash scripts/preprocess.sh
bash scripts/preprocess4fairseq.sh

# Step 2: Model training
bash scripts/experiment_ngram.sh
bash scripts/experiment_lms.sh
bash scripts/experiment_rnng.sh
python src/llama2.py -m meta-llama/Llama-2-7b-hf -b 4 -q 8bit # set huggingface key in src/config.py

# Step 3: Experiments
python src/export_language_stats.py
src/export_results.ipynb
src/export_stack_depth.ipynb
src/regression.ipynb

# Step 4: Visualization
src/visualize_figures.ipynb
src/visualize_tables.ipynb

Credits

We used a modified version of codes originally released in:

Fully Tensorized Recurrent Neural Network Grammars (RNNGs) based on PyTorch
- src/rnng-pytorch (added simple RNN grammar)
Fairseq
- src/fairseq (added RNN and particular classes of LSTM/Transformer; WIP)
Transformer Grammars (this part include the work distributed in the Apache License 2.0)
- src/utils
Examining the Inductive Bias of Neural Language Models with Artificial Languages
- src/artificial-langs
- src/get_sentence_scores.py
- work/grammar
The World Atlas of Language Structures
- work/language.csv

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
figures		figures
scripts		scripts
src		src
work		work
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

word-order-universals-cogLM

Codes

Credits

About

Releases

Packages

Languages

License

kuribayashi4/word-order-universals-cogLM

Folders and files

Latest commit

History

Repository files navigation

word-order-universals-cogLM

Codes

Credits

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages