ClinicalGPT-2

In this project, we finetune the popular GPT-2 model on MIMIC-III clinical notes to build a language generative model that can produce semi-plausible clinical notes. The objective is to be able to build an automated auto-complete for faster, more efficient, and more personalized real-time clinical visit documentation. Through this, we aim to help reduce physician burnout.

Our models and experiments make use of Hugging Face libraries/blogs/custom Dataset objects and a GPT-2 finetuning tutorial by Phil Schmid. For more information and full credits, please see our writeup. The finetuning data is sampled off of NOTEEVENTS, DIAGNOSES_ICD, ADMISSIONS, and D_ICD_DIAGNOSES data tables from MIMIC-III. We thank all our references for their contributed data, ideas, and guidance.

This project is a collaboration between Jiying Zou ([email protected]/[email protected]) and Diego Saldana ([email protected]).

Data citation: MIMIC-III, a freely accessible critical care database. Johnson AEW, Pollard TJ, Shen L, Lehman L, Feng M, Ghassemi M, Moody B, Szolovits P, Celi LA, and Mark RG. Scientific Data (2016). DOI: 10.1038/sdata.2016.35.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
Code		Code
Eval		Eval
Generated_Text		Generated_Text
ClinicalGPT2_Writeup.pdf		ClinicalGPT2_Writeup.pdf
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ClinicalGPT-2

About

Releases

Packages

Languages

License

jiyingz/clinicalGPT-2

Folders and files

Latest commit

History

Repository files navigation

ClinicalGPT-2

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages