Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
data		data
scripts		scripts
sentence_transformers		sentence_transformers
transformers		transformers
.gitignore		.gitignore
README.md		README.md
analysis_rep_space.py		analysis_rep_space.py
correlation_visualization.py		correlation_visualization.py
data_utils.py		data_utils.py
eval.py		eval.py
eval_pretrain.py		eval_pretrain.py
main.py		main.py
requirements.txt		requirements.txt

Repository files navigation

ConSERT

Code for our ACL 2021 paper - ConSERT: A Contrastive Framework for Self-Supervised Sentence Representation Transfer

Requirements

torch==1.6.0
cudatoolkit==10.0.103
cudnn==7.6.5
sentence-transformers==0.3.9
transformers==3.4.0
tensorboardX==2.1
pandas==1.1.5
sentencepiece==0.1.85
matplotlib==3.4.1
apex==0.1.0

To install apex, run:

git clone https://github.com/NVIDIA/apex
cd apex
pip install -v --disable-pip-version-check --no-cache-dir ./

Get Started

Download pre-trained language model (e.g. bert-base-uncased) to folder ./bert-base-uncased from HuggingFace's Library
Download STS datasets to ./data folder by running cd data && bash get_transfer_data.bash. The script is modified from SentEval toolkit
Run the scripts in the folder ./scripts to reproduce our experiments. For example, run the following script to train unsupervised consert-base:
```
bash scripts/unsup-consert-base.sh
```

Pre-trained Models & Results

English STS Tasks

ID	Model	STS12	STS13	STS14	STS15	STS16	STSb	SICK-R	Avg.
-	bert-base-uncased (baseline)	35.20	59.53	49.37	63.39	62.73	48.18	58.60	53.86
-	bert-large-uncased (baseline)	33.06	57.64	47.95	55.83	62.42	49.66	53.87	51.49
1	unsup-consert-base [Google Drive] [百度云q571]	64.64	78.49	69.07	79.72	75.95	73.97	67.31	72.74
2	unsup-consert-large [Google Drive] [百度云9fm1]	70.28	83.23	73.80	82.73	77.14	77.74	70.19	76.45
3	sup-sbert-base (re-impl.) [Google Drive] [百度云msqy]	69.93	76.00	72.15	78.59	73.53	76.10	73.01	74.19
4	sup-sbert-large (re-impl.) [Google Drive] [百度云0oir]	73.06	77.77	75.21	81.63	77.30	79.74	74.75	77.07
5	sup-consert-joint-base [Google Drive] [百度云jks5]	70.92	79.98	74.88	81.76	76.46	78.99	78.15	77.31
6	sup-consert-joint-large [Google Drive] [百度云xua4]	73.15	81.45	77.04	83.32	77.28	81.15	78.34	78.82
7	sup-consert-sup-unsup-base [Google Drive] [百度云5mc8]	73.02	84.86	77.32	82.70	78.20	81.34	75.00	78.92
8	sup-consert-sup-unsup-large [Google Drive] [百度云tta1]	74.99	85.58	79.17	84.25	80.19	83.17	77.43	80.68
9	sup-consert-joint-unsup-base [Google Drive] [百度云cf07]	74.46	84.19	77.08	83.77	78.55	81.37	77.01	79.49
10	sup-consert-joint-unsup-large [Google Drive] [百度云v5x5]	76.93	85.20	78.69	85.44	79.34	82.93	76.71	80.75

Note:

All the base models are trained from bert-base-uncased and the large models are trained from bert-large-uncased.
For the unsupervised transfer, we merge all unlabeled texts from 7 STS datasets (STS12-16, STSbenchmark and SICK-Relatedness) as the training data (total 89192 sentences), and use the STSbenchmark dev split (including 1500 human-annotated sentence pairs) to select the best checkpoint.
The sentence representations are obtained by averaging the token embeddings at the last two layers of BERT.
For model 2 to 10, we re-trained them on a single GeForce RTX 3090 with pytorch 1.8.1 and cuda 11.1 (rather than V100, pytorch 1.6.0 and cuda 10.0 in our initial experiments) and changed the max_seq_length from 64 to 40 to reduce the required GPU memory (only for large models). Consequently, the results shown here may be slightly different from those reported in our paper.

Chinese STS Tasks

To be added.

Citation

@article{yan2021consert,
  title={ConSERT: A Contrastive Framework for Self-Supervised Sentence Representation Transfer},
  author={Yan, Yuanmeng and Li, Rumei and Wang, Sirui and Zhang, Fuzheng and Wu, Wei and Xu, Weiran},
  journal={arXiv preprint arXiv:2105.11741},
  year={2021}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ConSERT

Requirements

Get Started

Pre-trained Models & Results

English STS Tasks

Chinese STS Tasks

Citation

About

Releases

Packages

Languages

yym6472/ConSERT

Folders and files

Latest commit

History

Repository files navigation

ConSERT

Requirements

Get Started

Pre-trained Models & Results

English STS Tasks

Chinese STS Tasks

Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages