files related to converting between different corpus formats
CORD-19 json to BioBERT input format
https://github.com/AnttonLA/BINP37/tree/master/dataset_generation
BioC xml to BioC json and vice versa
https://github.com/Aitslab/BioNLP/blob/master/formatconversion/BioCxml-BioCjson.ipynb
BioBERT output format to PubAnnotation format
https://github.com/AnttonLA/BINP37/tree/master/output_generation
https://github.com/Aitslab/BioNLP/tree/master/antton/formatting
http://sourceforge.net/projects/bioc/files/BioCconvert-0.1.tar.gz/download
Comment: so far we have not been able to get this to work
https://bitbucket.org/nicta_biomed/brat2bioc
converts many formats including CoNLL and Pubannotation
https://github.com/dkpro/dkpro-core
https://github.com/inception-project
converts brat standoff format to CoNLL format
https://github.com/spyysalo/standoff2conll
Comparison of Pubtator, BioC and PubAnnotation formats
https://www.ncbi.nlm.nih.gov/CBBresearch/Lu/Demo/tmTools/Format.html
Reference: https://academic.oup.com/database/article/doi/10.1093/database/bat064/341301
http://2013.bionlp-st.org/file-formats
http://2011.bionlp-st.org/home/file-formats
http://www.nactem.ac.uk/tsujii/GENIA/SharedTask/detail.shtml#format
http://brat.nlplab.org/standoff.html
https://github.com/nlplab/brat/wiki/Annotation-Data-Format
https://www.kaggle.com/allen-institute-for-ai/CORD-19-research-challenge#json_schema.txt
http://www.pubannotation.org/docs/annotation-format/
Reference: https://dl.acm.org/doi/10.5555/2391123.2391150
https://www.ncbi.nlm.nih.gov/CBBresearch/Lu/Demo/PubTator/tutorial/
Reference: https://academic.oup.com/nar/article/41/W1/W518/1105731 and https://academic.oup.com/database/article/doi/10.1093/database/bas041/438535
Example files: