Authors: Ning Ding, Guangwei Xu, Yulin Chen, Xiaobin Wang, Xu Han, Pengjun Xie, Hai-Tao Zheng, Zhiyuan Liu
Institutions: Tsinghua University
IEEE 2021: Compressor Fault Diagnosis Knowledge: A Benchmark Dataset for Knowledge Extraction From Maintenance Log Sheets Based on Sequence Labeling
Authors: Tao Chen, Jiang Zhu, Zhiqiang Zeng, Xudong Jia
Institutions: Wuyi University, California State University
AfricalNLP 2021: MasakhaNER: Named Entity Recognition for African Languages
Authors: David Ifeoluwa Adelani, etc.
Institutions: Saarland University, Carnegie Mellon University, etc.
Authors: Majid Asgari-Bidhendi, Behrooz Janfada, Omid Reza Roshani Talab, Behrouz Minaei-Bidgoli
Institutions: Iran University of Science and Technology
arXiv 2020: A Semantically Enriched Dataset based on Biomedical NER for the COVID19 Open Research Dataset Challenge
Authors: Hermann Kroll, Jan Pirklbauer, Johannes Ruthmann, Wolf-Tilo Balke
Institutions: TU Braunschweig
Authors: Liang Xu, Yu tong, Qianqian Dong, Yixuan Liao, Cong Yu, Yin Tian, Weitang Liu, Lu Li, Caiquan Liu, Xuanwei Zhang
Institutions: CLUE Organization
Authors: Alex Brandsen, Suzan Verberne, Milco Wansleeben, Karsten Lambers
Institutions: Leiden University
Authors: Elena Leitner, Georg Rehm, Julian Moreno-Schneider
Institutions: DFKI GmbH
Authors: Siti Oryza Khairunnisa, Aizhan Imankulova, Mamoru Komachi
Institutions: Tokyo Metropolitan University
IEEE 2020: Developing Name Entity Recognition for Structured and Unstructured Text Formatting Dataset
Authors: Nadhia Salsabila Azzahra, Muhammad Okky Ibrohim, Junaedi Fahmi, Bagus Fajar Apriyanto, Oskar Riandi
Institutions: Telkom University, Universitas Indonesia, PT. Bahasa Kinerja Utama
arXiv 2020: Development of a Dataset and a Deep Learning Baseline Named Entity Recognizer for Three Low Resource Languages: Bhojpuri, Maithili and Magahi
Authors: Rajesh Kumar Mundotiya, Shantanu Kumar, Ajeet kumar, Umesh Chandra Chaudhary, Supriya Chauhan, Swasti Mishra, Praveen Gatla, Anil Kumar Singh
Institutions: IIT (BHU), Banaras Hindu University, Cognizant
Authors: Wazir Ali, Junyu Lu, Zenglin Xu
Institutions: University of Electronic Science and Technology of China
EMNLP 2020: XGLUE: A New Benchmark Dataset for Cross-lingual Pre-training, Understanding and Generation
Authors: Yaobo Liang, etc.
Institutions: Microsoft
Authors: Xuan Wang, Xiangchen Song, Bangzheng Li, Yingjun Guan, Jiawei Han
Institutions: University of Illinois at Urbana-Champaign
Authors: Nasser Alshammari, Saad Alanazi
Institutions: Jouf University
COLING 2020: IndoLEM and IndoBERT: A Benchmark Dataset and Pre-trained Language Model for Indonesian NLP
Authors: Fajri Koto, Afshin Rahimi, Jey Han Lau, Timothy Baldwin
Institutions: The University of Melbourne, The University of Queensland
Authors: Nicky Ringland, Xiang Dai, Ben Hachey, Sarvnaz Karimi, Cecile Paris, James R. Curran
Institutions: University of Sydney, CSIRO Data61, Digital Health CRC
CoNLL 2019: BIOfid Dataset: Publishing a German Gold Standard for Named Entity Recognition in Historical Biodiversity Literature
Authors: Sajawel Ahmed, Manuel Stoeckel, Christine Driller, Adrian Pachzelt, Alexander Mehler
Institutions: Goethe University Frankfurt, Senckenberg Nature Research Society, Frankfurt University Library
Authors: Dilek Küçük, Fazli Can
Institutions: TUBITAK Energy Institute, Bilkent University
Authors: Hanieh Poostchi, Ehsan Zare Borzeshi, Massimo Piccardi
Institutions: University of Technology Sydney
Authors: Ika Alfina, Septiviana Savitri, Mohamad Ivan Fanany
Institutions: Faculty of Computer Science Universitas Indonesia
arXiv 2017: A Discourse-Level Named Entity Recognition and Relation Extraction Dataset for Chinese Literature Text
Authors: Jingjing Xu, Ji Wen, Xu Sun, Qi Su
Institutions: PeKing University