- (2022-09) News Summarization and Evaluation in the Era of GPT-3 paper
-
(2023-01) How Close is ChatGPT to Human Experts? Comparison Corpus, Evaluation, and Detection paper | project
-
(2023-01) Is ChatGPT A Good Translator? A Preliminary Study paper | code
❗ They only randomly select 50 sentences for evaluation, since there is no available API.
-
(2023-01) Benchmarking Large Language Models for News Summarization paper
-
(2023-02) Is ChatGPT a General-Purpose Natural Language Processing Task Solver? paper
❗ No large dataset evaluation, no few-shot in-context learning evaluation, due to lack of API.
-
(2023-02) ChatGPT: Jack of all trades, master of none paper
-
(2023-02) Can ChatGPT Understand Too? A Comparative Study on ChatGPT and Fine-tuned BERT paper
-
(2023-02) On the Robustness of ChatGPT: An Adversarial and Out-of-distribution Perspective paper
-
(2023-02) Exploring the Limits of ChatGPT for Query or Aspect-based Text Summarization paper
-
(2023-03) How Robust is GPT-3.5 to Predecessors? A Comprehensive Study on Language Understanding Tasks. paper
-
(2023-02) ChatGPT: potential, prospects, and limitations paper