Large Language Models for Patent Claim Generation

This repository is the code for generating patent claims based on the patent description. It supports fine-tuning and multi-task fine-tuning of the Llama-3 model. It also includes the inference code for fine-tuned Llama-3, models on HuggingFace, and GPT-based models from OpenAI.

Dataset

Original dataset can be downloaded here.

To prepare dataset for fine-tuning:

cd src
python data_setup.py --directory [training data directory] --out_path [output directory]

Environment setup

sh init_env.sh

Training

cd src
accelerate launch --config_file accelerate_ds_config.yaml pefts/mft_accelerate.py --train_config configs/lora_train_config.json --distributed_type "DeepSpeed"

Change configs/lora_train_config.json for specific settings.

Inference

Inference of fine-tuned models with LoRA adapters:

python hf_inference.py --base_model [path to the base model] --peft_path [path to the trained adapter] --directory [test data directory]

Inference of models on Huggingface:

python pipeline_inference.py --model [path to the model] --directory [test data directory]

Inference of GPT series from OpenAI:

python openai_inference.py --openai_key [your key] --model_name [model name] --data_dir [test data directory]

Evaluation

python eval.py --model_name [model outputs for evaluation] --data_dir [test data directory]

Results

Sample outputs are shown in the results folder.

Acknowledgement

The code is built based on MFTCoder, a multi-task fine-tuning framework. Detailed instructions of fine-tuning can be found there.

Citation

If you find our work useful for your research, please feel free to cite our paper.

@article{jiang2024can,
  title={Can Large Language Models Generate High-quality Patent Claims?},
  author={Lekang Jiang and Caiqi Zhang and Pascal A Scherz and Stephan Goetz},
  journal={arXiv preprint arXiv:2406.19465},
  year={2024}
}

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
results		results
src		src
README.md		README.md
init_env.sh		init_env.sh
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Large Language Models for Patent Claim Generation

Dataset

Environment setup

Training

Inference

Evaluation

Results

Acknowledgement

Citation

About

Releases

Packages

Languages

scylj1/LLM4DPCG

Folders and files

Latest commit

History

Repository files navigation

Large Language Models for Patent Claim Generation

Dataset

Environment setup

Training

Inference

Evaluation

Results

Acknowledgement

Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages