This is the entire dataset of our CIKM 2022 paper "SpCQL: A Semantic Parsing Dataset for Converting Natural Language into Cypher", including the train, validation, and test sets.
For example,
{
"query": "云艺文华的全称你知道是什么?",
"cypher": "match (:ENTITY{name:'云艺文华'})<-[:Relationship{name:'简称'}]-(h) return h.name",
"answer": [{"h.name": "云南艺术学院文华学院"}]
}
Since the size of the neo4j database exceeded the github upload limit requirements, we have to upload it to Baidu.com. The neo4j database used for the dataset can be downloaded via Baidu.com. Of course, the entire version of the data is also available for download on Baidu.com along with it. https://pan.baidu.com/s/1aqMZFMOOpiB1GWUo5-I7xQ?pwd=b6ix
Thank you for your support of our work. If you use this dataset, please cite it in the following format
@inproceedings{DBLP:conf/cikm/GuoLXT022,
author = {Aibo Guo and
Xinyi Li and
Guanchen Xiao and
Zhen Tan and
Xiang Zhao},
editor = {Mohammad Al Hasan and
Li Xiong},
title = {SpCQL: {A} Semantic Parsing Dataset for Converting Natural Language
into Cypher},
booktitle = {Proceedings of the 31st {ACM} International Conference on Information
{\&} Knowledge Management, Atlanta, GA, USA, October 17-21, 2022},
pages = {3973--3977},
publisher = {{ACM}},
year = {2022},
url = {https://doi.org/10.1145/3511808.3557703},
doi = {10.1145/3511808.3557703},
timestamp = {Tue, 21 Mar 2023 20:54:47 +0100},
biburl = {https://dblp.org/rec/conf/cikm/GuoLXT022.bib},
bibsource = {dblp computer science bibliography, https://dblp.org}
}