This open-source project aims to provide a sample template for data labeling. Users could implement other features based on this project, e.g., uploading data, or modifying the UI.
You are welcome to bring up any encountered issues. Microsoft Azure Form Recognizer team would update the source code periodically.
Labeling UX is a user-friendly web application tool for you to label your document through creating label key and assigning the label value to each label key. There are several different types of label key, including "Field", "Selection mark", "Signature" and "Table". By using label key of type "Table", you can label a table with customized name of every column/row as well.
To get started, you would need to clone the repo and run the application on your local machine. Also, you would need to set up input data before you could start labeling.
Running Labeling UX application requires NodeJS and NPM.
Required version of NodeJS : ^12.22.0 || ^14.17.0 || ^16.10.0 || >=17.0.0
For development, it is suggested to use TypeScript 4.7.4.
git clone https://github.com/microsoft/Form-Recognizer-Toolkit.git
Run the below command under root folder to install node packages for Server
cd Form-Recognizer-Toolkit/SampleCode/LabelingUX
npm i
Run the command below to install node packages for Client
cd Client
npm i
Run the command below to switch back to root folder and start the application, which runs the server at port 4000 and the client application at port 3000.
cd ..
npm run dev
Now, you should see "Compiled successfully!" in your terminal, and the application should automatically open in your default browser with URL : http://localhost:3000/label
Now, since we have not set up input data, the Labeling UX application should look like this:
All the labeling data will be read from and write into Server/data
.
In order to experience Labeling UX project, you will need to add the files mentioned below in Server/data
folder:
- Documents you would like to label. (Supported types of document for labeling: PDF, JPG, JPEG, PNG, TIFF, TIF)
- Corresponding
.ocr.json
files for these documents. - (Optional) If you already have the
.labels.json
files for corresponding documents orfields.json
file for your labeled documents. You could add these files intodata
folder as well.
Note:
- You will need to provide
.ocr.json
files for corresponding documents to start labeling. If you don't have.ocr.json
files for the documents, check out the instruction at the end of this README -- "How to create.ocr.json
file for your documents?"..labels.json
files andfields.json
file will be automatically generated when you start labeling your documents.
The input data should be similar as below:
SampleLabelingUX
│ README.md
│
└───Server
│ │
│ └───data
│ │ example1.pdf
│ │ example1.pdf.ocr.json
│ │ example1.pdf.labels.json (optional)
│ │ example2.pdf
│ │ example2.pdf.ocr.json
│ │ example2.pdf.labels.json (optional)
│ │ fields.json (optional)
│ │ ...
│
└───Client
After setting up input data, refresh the application webpage and you should see Labeling UX application similar to the screenshot below, with thumbnails of your documents in the left pane and the first document in the canvas of middle pane.
Note:
If you provide the correct corresponding
.ocr.json
files for your documents, your document rendered in the middle canvas pane should include:
- Light-yellow background for all the text in your documents
- Table icon on the top-left corner for each table in your documents
- Pink border for all the selection marks in your documents
You should be able to toggle the layer filter to hide/show different layers as shown below.
Reminder:
If you have added
ocr.json
file for your document, yet you are not able to see the light-yellow background for all the text nor tables icon on each table of your documents. Please check the naming for yourocr.json
file. It should be<document_name>.<document_extension>.ocr.json
. For example: The name ofocr.json
file for document namedinvoiceA.pdf
should beinvoiceA.pdf.ocr.json
If everything works as described above, you are ready to start labeling.
Before starting the detailed tutorial regarding how to label your document. Let's have a quick tour of the Labeling UX application. Labeling UX application basically consist of three pane:
- Label Canvas (Middle part of application)
- This is where you perform specifying label value and inspect label value of your document.
- On the top-left corner of label canvas, there is a "Region" button, which allow you to draw region for specifying label value of an area in the document.
- On the top right corner, there is a layer filter icon, which allows you to toggle "Text", "Table" and "Selection mark" layer.
- On the bottom-center, there is a page control tool, which allow you to switch to a different page of document.
- On the bottom-right corner, there is a set of tool for you to zoom in/zoom-out/zoom to fit and rotate the document.
- Document gallery (left pane)
- This is where all the thumbnails of your documents would be displayed.
- By clicking on thumbnail of document, you can switch the document rendered in the label canvas.
- While hovering on the thumbnail of your document, a "Delete" icon would be displayed for you to delete a document. Notice that the deletion will not only delete the document file but also delete its corresponding
ocr.json
andlabels.json from
Server/data` folder.
- Label Pane (right pane)
- This is where all label keys and label values are displayed.
- You could create label key and assign label value to a label key in this pane.
To label a document is to assign key-value pairs for a document. For example, if there is text Due date: 12/15/2021
in your document, you might want to create a label key Due date
and assign label value 12/15/2021
to it. With that said, to label a document consists of three main actions:
- Creating a label key
- Specifying label value
- Assigning the specified label value to a label key
In this section, how each action could be performed using Labeling UX application will be elaborated.
Notice that table labeling is more complex and will be explained in a separate topic at the end of this section.
With ocr.json
provided for the document you would like to label, Labeling UX application is able to get the bounding boxes and the content value for all the text, selection marks and tables. There are several approaches to specify label value in your document.
You could click on the bounding box of the text (which is of light-yellow background) or selection mark (which is of pink border). The text being clicked should change into green background while the selection mark being clicked should turn with pink background, which means it is selected.
You could continue clicking on another bounding box to specify multiple words/selection marks in your selection. Notice that for label key of selection mark type, only single selection mark to be assigned to it.
To unselect, simply click on a selected bounding box again.
Besides clicking on all bounding boxes one by one, we can also perform multiple selection by continuously mouse down and swipe through all the text you would like to select.
To perform a group selection, press "Shift" key and drag the "+" cursor to specify the area in which all text you would like to selected.
You can also press "Shift" key and drag the cursor to the area of selected bounding box to perform unselecting.
Click on the "Region" button on the top-left corner of middle canvas pane to enter region-drawing mode. Drag the "+" cursor on the document to draw the region you would like to label.
To modify the area of a drawn region, hover on the corner of drawn region and drag the vertex.
To delete the region you drew, hover on the drawn region. There should be a "x" icon on the top-right corner of the region. Click on the "x" for region deletion.
To leave region-drawing mode, click on the "Region" button again.
Notice that only one region can be assigned to a label key or table cell.
In order to label a document, you need to create label keys and assign the label value to a label key. In Labeling UX application, there are four types of label key:
- Field: for labeling string, number, date, time and integer
- Selection Mark: for label value with a single selection mark (bounding box with pink border) or a "Region"
- Signature: for label value specified with single "Region"
- Table: perform table labeling
There are two approaches to create a label key, as described below.
The below action applies for label key of type "Field", "Selection mark" and "Signature". Creation of label key of type "Table" will be explained in a separate sub-section later.
At the top label pane, there is a "+" button. Click on the button and choose which type of label key you would like to create. Then, type the name of the label key and hit "Enter". The label key should now be created and displayed on the label pane.
Note: This approach is not applicable for table labeling.
After selecting bounding boxes or finishing drawing a region, a pop-up will be displayed next to the bounding boxes/region. Enter the name of the label key in the text field of the pop-up and click on the type of label key you would like to create.
This will not only create a new label key but also assign the label value you just selected to this newly created label key, as shown in the GIF below.
After you have the label key created and label value specified, you can now assign the label value to the corresponding label key.
The below is the applicable label value that can be assigned to each type of label key.
- Field:
- Bounding boxes of text
- Bounding boxes of selection mark
- Single draw region
- Selection Mark:
- Single bounding box of selection mark
- Single draw region
- Signature:
- Single draw region
After specifying label value through bounding boxes or region, a pop-up will be displayed next to the bounding boxes/region, which include a list of applicable label keys you have created. Click on the label key you would like to assign the label value to.
Notice that the list of label key in the pop-up filter out all the inapplicable label keys. For example, the label key with type "table", "selection mark" and "signature" will not be displayed when you specify label value with text bounding boxes, since text is not allowed to assign to these three types of label key.
After specifying label value, you can click on the label key you would like to assign the label value to in the label pane.
Notice that if the label key you click on is not applicable to the label value you specified. There will be a "Label assignment warning" message modal which provide simple instruction on what label value is applicable for this label key.
After assigning label, you might want to inspect and possibly modify the labels you have assigned. How these could be performed is described as below.
To inspect a label of type "field", "signature", and "selection mark", simply hover on the label key in label pane. The corresponding label value will be highlighted with thicker border. You could also distinguish the label value of different label key by colors.
For "field" type label key, of which label value with bounding boxes is assigned, you are able to add more label value to a label key simply by selecting additional text with bounding boxes and assign to the label key through clicking the label key in pop-up or label pane. This action will not overwrite the label value you previously assigned but add the newly-selected value and merge with the original one.
If a label key is assigned with value specified by bounding boxes and you are now specifying a value with region drawn, the original value from bounding boxes will be overwritten.
If a label key is assigned with value specified by region and you are now specifying a value with either another drawn region or bounding boxes, the original drawn region value will be overwritten with the new value of the new value from bounding box.
If you would like re-select/re-draw a label value, you can delete the label value you originally assigned to a label key simply by clicking on the "x" icon on the right of label value.
This action applies for all types of label key, which are "Field", "Selection mark", "Signature" and "Table".
To rename a label key. Click on the three-dot icon on the right of the label key. Click on the "Rename" option and entering new name for label key. (Hit "Esc" if you don't want to rename the label key) After entering the new name, hit "Enter", then a confirmation modal will be displayed.
This action applies for all types of label key, which are "Field", "Selection mark", "Signature" and "Table".
To delete a label key. Click on the three-dot icon on the right of the label key. Click on "Delete" and a confirmation modal will be displayed.
Notice that deleting a label key will also delete all the label value assigned to this label key in all your documents.
This action applies only to label key of type "Field". For label key with type "Selection mark", "Signature" and "Table", "Sub type" option will be disabled.
To assign sub type for a label key. Click on the three-dot icon on the right of label key. Hover on "Sub type" and a menu of all sub type options will be displayed. Click on the sub type that suit your label value.
We have discussed how label of type "Field", "Selection mark" and "Signature" can be created and assigned. In this section, we will discuss a separate topic, which is table labeling.
You could create a table label key with customized rows and columns that suit the content of your document and assign label values to each table cell accordingly.
There are a few terms you need to know before start creating a label key of type "Table".
If the table you would like to label is with fixed numbers of row and column, it is suggested to use fixed table.
Otherwise, if the table you would like to label is with different row count from on document to another, for example, a list of purchase items, it is suggested to use dynamic table.
As discussed above, you could specify the "Sub type" to a label key of type "Field". With the similar concept, you cound specify "Sub type" to a row/column of a table.
If you choose header type as column, you could specify sub type for every column of the table you created, but not allowed to specify sub type for any row, and vice versa.
Click on the "+" button at the top of label pane and select table and select "Table". A "Create table field" modal will be displayed. Enter the name (label key) of this table, and select the table type, header type (if it is a fixed table). Then click "Create" button on the bottom-right corner of the modal.
To open the table label pane for a created label key with type of "Table", click on the table label key in label pane.
To close the table label pane, click on the "x" icon on the top-right corner of table label pane.
After you created a label key of type "Table", open the table label pane, the default content of the table label key should be displayed. Next we will introduce how you could modify the table into the one that suit your usage.
Click on the name of column or row and and select "Rename" in the option menu. Type in the new name then hit "Enter". (Hit "Esc" if you do not want to rename the column/row) A confirmation modal should be displayed. After clicking on "Yes", the column/row should be renamed.
For a fixed table, you can insert a column/row, whereas for a dynamic table, you can insert a column and add a row at the bottom of the table.
To insert a row/column, click on the name of column/row or and select "Insert" in menu options. Type in the name for the inserting column/row and hit "Enter". (Hit "Esc" if you do not want to insert a new column/row.)
To add a new row in dynamic table, simply click on the "+" icon at the bottom-left corner of the table.
For a fixed table, you can delete a column/row, whereas for a dynamic table, you can delete a column.
To delete a row/column, click on the name of column/row or and select "Delete" in menu options. A confirmation modal should be displayed. After clicking on "Yes", the column/row should be deleted.
For a fixed table with header type of column, you can assign sub type for each column, whereas for a fixed table with header type of row, you can assign sub type for each row. For a dynamic table, you can assign sub type for each column.
To assign sub type, click on the name of column/row and hover on "Sub type" in menu options. The sub type options should be displayed and you could select the sub type you would like to assign for this column/row.
For any table cell, you could assign the label value of with bounding boxes or a single draw region.
With the table label pane opened, specify the label value as described earlier with bounding boxes or drawn region, then click on the table cell you would like to assign the label value to.
To inspect a label value for a table cell, hover on the table cell you would like to inspect in table label pane. The corresponding label value will be highlighted with thicker border on your document.
After assigning label value to a table cell, similar to modifying label value of label key of type "Field", "Selection mark" and "Signature", you can modify the label value by either updating or deleting the label value.
If you have bounding boxes of value assigned to a table cell, you could select more bounding boxes and their value to this table cell.
If it is a table cell with value specified by bounding boxes and you are now specifying a value with region drawn, the original value from bounding boxes will be overwritten with the new drawing region.
If it is a table cell with value specified by region and you are now specifying a value with either bounding boxes or newly drawn region, the original drawn region will be overwritten with the new value of the new value from bounding box.
If you would like to delete the label value for a table cell, hover on the table cell and there should be a "x" icon displayed on the top-right corner of the table cell. Click on the icon to delete the label value for this table cell.
After finishing labeling for one document, you could now switch to another document and label the new document assigning label value to all the label keys you have created when labeling first document.
After finishing labeling all your documents. There should be labels.json
for each document you have labeled. labels.json
stores the coordinates of and the text value of each label as well as the corresponding label key for each label value.
Also, there should be a fields.json
file. Which store the definition of all the label keys you created for labeling.
SampleLabelingUX
│ README.md
│
└───Server
│ │
│ └───data
│ │ example1.pdf
│ │ example1.pdf.ocr.json
│ │ example1.pdf.labels.json
│ │ example2.pdf
│ │ example2.pdf.ocr.json
│ │ example2.pdf.labels.json
│ │ ...
│ │ fields.json
│
└───Client
With all labels.json
files and a fields.json
file, your label result will be stored completely. You can use these files to further train a model or start labeling with a different set of documents and visit this set of label result later again.
You can get the ocr.json
file for your document with two approaches:
-
Upload document to Form Recognizer Studio Layout and analyze the document, download the JSON file by clicking on the download button in result tab of analyze result pane (the pane on the right).
-
Use Form Recognizer SDKs to analyze document with Layout API.
After getting ocr.json
file for your document, it is important to rename the file with the naming convention in order for Labeling UX to correctly connect your document with its corresponding ocr.json
file.
The naming convention for ocr.json
file is <document name>.<document extension>.ocr.json
. For example, the name of ocr.json
file for document named invoiceA.pdf
should be invoiceA.pdf.ocr.json