Skip to content

Test Set for Phrasing Break Assessment for ESL learners with Pre-trained Language Models

Notifications You must be signed in to change notification settings

Chris0King/phrasing-break-assessment

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

23 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

A non-native English corpus for phrasing-break-assessment

Introduction

This corpus aims to provide a free public dataset for the phrasing break assessment task. we collected 800 audio samples from Chinese ESL learners. Then two linguists are invited to assess them with overall performance on phrasing break and each individual phrasing break ranging from 1 (Poor), 2 (Fair), and 3 (Great). If two experts' opinions are inconsistent, an extra linguist is asked to do the final scoring.

Annotator Recruitment Requirement

Speech Ocean has assisted us in the labeling work, which selected the teachers with rich teaching experience to annotate the data.

The scoring metric

The linguists score at two levels: overall-sentence-level and single-break-level. It is worth mentioning that the scoring ignores the words pronouncing accuracy.

overall-sentence-level

Assess the overall performance on phrase breaks in the sentence.

Score range: 1-3

  • 3: the utterance of the sentence is fluent and all phrase breaks are reasonable
  • 2: the utterance of the sentence is fluent and there is no pause in all phrase breaks
  • 1: some phrase breaks in the sentence are inappropriate which influence the understanding of the sentence

single-break-level

Assess each individual phrasing breaks in the sentence.

Score range: 1-3

  • 3: the phrase break is perfect
  • 2: the phrase break is fair and can to be improved
  • 1: the phrase break is wrong

Dataset Statistics

Score distribution

The statistics of collected corpus is listed in following table.

Dataset Poor Fair Great Total
Overall 21 136 643 800
Fine-grained 129 644 10797 11570

Sentence distribution

We collected 800 utterances of 19 different sentences of various lengths, and the detail distribution is shown in the following table.

No. Sentence Num
1 Their poor parents were just starving as usual. 31
2 One afternoon, he thought up a good plan to have fun. 40
3 Cinderella danced so joyfully that she forgot the time. 31
4 They made a choice and went out in search of food. 42
5 Plenty of complicated thoughts about the rainbow have been formed. 32
6 The following week, the same policeman sees the same man with the tiger again. 52
7 His mother noticed his left shoe was on his right foot. 28
8 One spring day some naughty boys were playing near a pond. 42
9 Our class is going to put on a show on may seventh. 35
10 Now, can you give me a few words to describe the body shape of this girl? 55
11 The man replied, "i did we had such a good time we are going to the beach this weekend!" 63
12 His wife says, "no problem but i have to find all the money you have hidden from me. 44
13 There was once a poor boy, and he watched his sheep in the fields next to a dark forest. 39
14 The frogs in the pond were very afraid of the children because the stones killed some of them. 57
15 One day a crow stood on a branch near his nest and felt very happy with the meat in his mouth. 56
16 But the mouse finally got out of her mouth and ran away since she could not bite it. 45
17 Then the old woman became very angry at the cat and began to hit her for she had not killed the mouse. 30
18 Until the fox thought highly of the crow's beautiful voice, the crow felt quite proud and finally opened his mouth to sing. 36
19 "Dad!" he puffed, "is it true that an apple a day keeps the doctor away?" 41
total 800

Data structure

The following tree shows the file structure of this corpus:

├───cross-validation-testset
│   ├───testset.csv
│   ├───coarse_grain_res.csv
│   ├───fine_grain_res.csv
│   ├───coarse_grain_set
│   │       coarse_grain_testset_0.csv
│   │       coarse_grain_testset_1.csv
│   │       coarse_grain_testset_2.csv
│   │       coarse_grain_testset_3.csv
│   │       coarse_grain_testset_4.csv
│   │       coarse_grain_devset_0.csv
│   │       coarse_grain_devset_1.csv
│   │       coarse_grain_devset_2.csv
│   │       coarse_grain_devset_3.csv
│   │       coarse_grain_devset_4.csv
│   │       coarse_grain_trainset_0.csv
│   │       coarse_grain_trainset_1.csv
│   │       coarse_grain_trainset_2.csv
│   │       coarse_grain_trainset_3.csv
│   │       coarse_grain_trainset_4.csv
│   │
│   ├───fine_grain_set
│   │       fine_grain_devset_0.csv
│   │       fine_grain_devset_1.csv
│   │       fine_grain_devset_2.csv
│   │       fine_grain_devset_3.csv
│   │       fine_grain_devset_4.csv
│   │       fine_grain_testset_0.csv
│   │       fine_grain_testset_1.csv
│   │       fine_grain_testset_2.csv
│   │       fine_grain_testset_3.csv
│   │       fine_grain_testset_4.csv
│   │       fine_grain_trainset_0.csv
│   │       fine_grain_trainset_1.csv
│   │       fine_grain_trainset_2.csv
│   │       fine_grain_trainset_3.csv
│   │       fine_grain_trainset_4.csv
│   │
│   └───gpt
│       │   testset_0.csv
│       │   testset_1.csv
│       │   testset_2.csv
│       │   testset_3.csv
│       │   testset_4.csv
│       │   trainset_0.csv
│       │   trainset_1.csv
│       │   trainset_2.csv
│       │   trainset_3.csv
│       │   trainset_4.csv
│       │   prompts.md
│       └───res
├───resource
│   ├───A-res_form
│   ├───B-res_form
│   └───C-res_form
├───wavs
└───metadata.csv

The wavs folder contains our all wavs resource and match the metadata.csv.

The resource folder contains the scores from three linguists label A, B and C according to three subfolders. A and B are invited to assess them. If two experts' opinions are inconsistent, an extra linguist C is asked to do the final scoring.

The cross-validation-testset folder contains trainset, devset and testset in fine-tune step and chatgpt scenario. In the gpt testset, columns gpt_score and unexcept_break represent the responses of chatgpt in zero-shot learning senario. Columns few-shot-score and few-shot-unexcept_break represent the responses of chatgpt in few-shot learning senario.

About

Test Set for Phrasing Break Assessment for ESL learners with Pre-trained Language Models

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published