Merge pull request PaddlePaddle#8066 from WenmuZhou/doc2

update PP-Structurev to PP-StructureV
honglili-ya · Oct 25, 2022 · 59b3ead · 59b3ead
2 parents 210135c + 6120094
commit 59b3ead
Show file tree

Hide file tree

Showing 40 changed files with 228 additions and 228 deletions.
diff --git a/StyleText/README.md b/StyleText/README.md
@@ -120,7 +120,7 @@ In actual application scenarios, it is often necessary to synthesize pictures in
      * `with_label`：Whether the `label_file` is label file list.
    * `CorpusGenerator`：
      * `method`：Method of CorpusGenerator，supports `FileCorpus` and `EnNumCorpus`. If `EnNumCorpus` is used，No other configuration is needed，otherwise you need to set `corpus_file` and `language`.
-     * `language`：Language of the corpus. Currently, the tool only supports English(en), Simplified Chinese(ch) and Korean(ko). 
+     * `language`：Language of the corpus. Currently, the tool only supports English(en), Simplified Chinese(ch) and Korean(ko).
      * `corpus_file`: Filepath of the corpus. Corpus file should be a text file which will be split by line-endings（'\n'）. Corpus generator samples one line each time.
 
 
@@ -171,9 +171,8 @@ After adding the above synthetic data for training, the accuracy of the recognit
 
 | Scenario | Characters | Raw Data | Test Data | Only Use Raw Data</br>Recognition Accuracy | New Synthetic Data | Simultaneous Use of Synthetic Data</br>Recognition Accuracy | Index Improvement |
 | -------- | ---------- | -------- | -------- | -------------------------- | ------------ | ---------------------- | -------- |
-| Metal surface | English and numbers | 2203     | 650      | 0.5938                     | 20000        | 0.7546                 | 16%      |
-| Random background | Korean       | 5631     | 1230     | 0.3012                     | 100000       | 0.5057                 | 20%      |
-
+| Metal surface | English and numbers | 2203     | 650      | 59.38%                     | 20000        | 75.46%                 | 16.08%      |
+| Random background | Korean       | 5631     | 1230     | 30.12%                     | 100000       | 50.57%                 | 20.45%      |
 
 <a name="Code_structure"></a>
 ### Code Structure

diff --git a/StyleText/README_ch.md b/StyleText/README_ch.md
@@ -156,8 +156,8 @@ python3 tools/synth_image.py -c configs/config.yml --style_image examples/style_
 
 | 场景     | 字符       | 原始数据 | 测试数据 | 只使用原始数据</br>识别准确率 | 新增合成数据 | 同时使用合成数据</br>识别准确率 | 指标提升 |
 | -------- | ---------- | -------- | -------- | -------------------------- | ------------ | ---------------------- | -------- |
-| 金属表面 | 英文和数字 | 2203     | 650      | 0.5938                     | 20000        | 0.7546                 | 16%      |
-| 随机背景 | 韩语       | 5631     | 1230     | 0.3012                     | 100000       | 0.5057                 | 20%      |
+| 金属表面 | 英文和数字 | 2203     | 650      | 59.38%                     | 20000        | 75.46%                 | 16.08%      |
+| 随机背景 | 韩语       | 5631     | 1230     | 30.12%                     | 100000       | 50.57%                 | 20.45%      |
 
 
 <a name="代码结构"></a>

diff --git a/applications/PCB字符识别/PCB字符识别.md b/applications/PCB字符识别/PCB字符识别.md
@@ -266,8 +266,8 @@ python3 tools/eval.py \
 | 序号 | 方案 | hmean  |  效果提升  |   实验分析  |
 | -------- | -------- | -------- | -------- | -------- |
 |   1 |  PP-OCRv3英文超轻量检测预训练模型   | 64.64%     |     -     |    提供的预训练模型具有泛化能力       |
-|   2 | PP-OCRv3英文超轻量检测预训练模型 + 验证集padding    |  72.13%  |+7.5% | padding可以提升尺寸较小图片的检测效果|
-|   3 | PP-OCRv3英文超轻量检测预训练模型  + fine-tune   | 100% |  +27.9%     | fine-tune会提升垂类场景效果 |
+|   2 | PP-OCRv3英文超轻量检测预训练模型 + 验证集padding    |  72.13%  |+7.49% | padding可以提升尺寸较小图片的检测效果|
+|   3 | PP-OCRv3英文超轻量检测预训练模型  + fine-tune   | 100.00% |  +27.87%     | fine-tune会提升垂类场景效果 |
 
 
 ```
@@ -420,9 +420,9 @@ python3 tools/eval.py \
 | 序号 | 方案 | acc    |  效果提升  |   实验分析  |
 | -------- | -------- | -------- | -------- | -------- |
 |   1 | PP-OCRv3中英文超轻量识别预训练模型直接评估 | 46.67%     |     -     |    提供的预训练模型具有泛化能力       |
-|   2 | PP-OCRv3中英文超轻量识别预训练模型 + fine-tune   |  42.02%  |-4.6% | 在数据量不足的情况，反而比预训练模型效果低(也可以通过调整超参数再试试)|
-|   3 | PP-OCRv3中英文超轻量识别预训练模型 + fine-tune + 公开通用识别数据集   | 77% |  +30%     | 在数据量不足的情况下，可以考虑补充公开数据训练 |
-|   4 | PP-OCRv3中英文超轻量识别预训练模型 + fine-tune + 增加PCB图像数量   | 99.99% |  +23%     | 如果能获取更多数据量的情况，可以通过增加数据量提升效果 |
+|   2 | PP-OCRv3中英文超轻量识别预训练模型 + fine-tune   |  42.02%  |-4.65% | 在数据量不足的情况，反而比预训练模型效果低(也可以通过调整超参数再试试)|
+|   3 | PP-OCRv3中英文超轻量识别预训练模型 + fine-tune + 公开通用识别数据集   | 77.00% |  +30.33%     | 在数据量不足的情况下，可以考虑补充公开数据训练 |
+|   4 | PP-OCRv3中英文超轻量识别预训练模型 + fine-tune + 增加PCB图像数量   | 99.99% |  +22.99%     | 如果能获取更多数据量的情况，可以通过增加数据量提升效果 |
 
 ```
 注：上述实验结果均是在1500张图片（1200张训练集，300张测试集）、2W张图片、添加公开通用识别数据集上训练、评估的得到，AIstudio只提供了100张数据，所以指标有所差异属于正常，只要策略有效、规律相同即可。
@@ -614,23 +614,23 @@ python3 tools/end2end/eval_end2end.py ./save_gt_label/ ./save_PPOCRV2_infer/
 | 序号 | 方案                                                     | hmean  | 效果提升 | 实验分析                              |
 | ---- | -------------------------------------------------------- | ------ | -------- | ------------------------------------- |
 | 1    | PP-OCRv3英文超轻量检测预训练模型直接评估                 | 64.64% | -        | 提供的预训练模型具有泛化能力          |
-| 2    | PP-OCRv3英文超轻量检测预训练模型 + 验证集padding直接评估 | 72.13% | +7.5%    | padding可以提升尺寸较小图片的检测效果 |
-| 3    | PP-OCRv3英文超轻量检测预训练模型  + fine-tune            | 100%   | +27.9%   | fine-tune会提升垂类场景效果           |
+| 2    | PP-OCRv3英文超轻量检测预训练模型 + 验证集padding直接评估 | 72.13% | +7.49%    | padding可以提升尺寸较小图片的检测效果 |
+| 3    | PP-OCRv3英文超轻量检测预训练模型  + fine-tune            | 100.00%   | +27.87%   | fine-tune会提升垂类场景效果           |
 
 * 识别
 
 | 序号 | 方案                                                         | acc    | 效果提升 | 实验分析                                                     |
 | ---- | ------------------------------------------------------------ | ------ | -------- | ------------------------------------------------------------ |
 | 1    | PP-OCRv3中英文超轻量识别预训练模型直接评估                   | 46.67% | -        | 提供的预训练模型具有泛化能力                                 |
-| 2    | PP-OCRv3中英文超轻量识别预训练模型 + fine-tune               | 42.02% | -4.6%    | 在数据量不足的情况，反而比预训练模型效果低(也可以通过调整超参数再试试) |
-| 3    | PP-OCRv3中英文超轻量识别预训练模型 + fine-tune + 公开通用识别数据集 | 77%    | +30%     | 在数据量不足的情况下，可以考虑补充公开数据训练               |
-| 4    | PP-OCRv3中英文超轻量识别预训练模型 + fine-tune + 增加PCB图像数量 | 99.99% | +23%     | 如果能获取更多数据量的情况，可以通过增加数据量提升效果       |
+| 2    | PP-OCRv3中英文超轻量识别预训练模型 + fine-tune               | 42.02% | -4.65%    | 在数据量不足的情况，反而比预训练模型效果低(也可以通过调整超参数再试试) |
+| 3    | PP-OCRv3中英文超轻量识别预训练模型 + fine-tune + 公开通用识别数据集 | 77.00%    | +30.33%     | 在数据量不足的情况下，可以考虑补充公开数据训练               |
+| 4    | PP-OCRv3中英文超轻量识别预训练模型 + fine-tune + 增加PCB图像数量 | 99.99% | +22.99%     | 如果能获取更多数据量的情况，可以通过增加数据量提升效果       |
 
 * 端到端
 
 | det                                           | rec                                                          | fmeasure |
 | --------------------------------------------- | ------------------------------------------------------------ | -------- |
-| PP-OCRv3英文超轻量检测预训练模型  + fine-tune | PP-OCRv3中英文超轻量识别预训练模型 + fine-tune + 增加PCB图像数量 | 93.3%    |
+| PP-OCRv3英文超轻量检测预训练模型  + fine-tune | PP-OCRv3中英文超轻量识别预训练模型 + fine-tune + 增加PCB图像数量 | 93.30%    |
 
 *结论*
 

diff --git a/applications/中文表格识别.md b/applications/中文表格识别.md
@@ -34,7 +34,7 @@
 ![](https://ai-studio-static-online.cdn.bcebos.com/5ffff2093a144a6993a75eef71634a52276015ee43a04566b9c89d353198c746)
 
 
-当前的表格识别算法不能很好的处理这些场景下的表格图像。在本例中，我们使用PP-Structurev2最新发布的表格识别模型SLANet来演示如何进行中文表格是识别。同时，为了方便作业流程，我们使用表格属性识别模型对表格图像的属性进行识别，对表格的难易程度进行判断，加快人工进行校对速度。
+当前的表格识别算法不能很好的处理这些场景下的表格图像。在本例中，我们使用PP-StructureV2最新发布的表格识别模型SLANet来演示如何进行中文表格是识别。同时，为了方便作业流程，我们使用表格属性识别模型对表格图像的属性进行识别，对表格的难易程度进行判断，加快人工进行校对速度。
 
 本项目AI Studio链接：https://aistudio.baidu.com/aistudio/projectdetail/4588067
 
@@ -192,14 +192,14 @@ plt.show()
 
 ### 2.3 训练
 
-这里选用PP-Structurev2中的表格识别模型[SLANet](https://github.com/PaddlePaddle/PaddleOCR/blob/dygraph/configs/table/SLANet.yml)
+这里选用PP-StructureV2中的表格识别模型[SLANet](https://github.com/PaddlePaddle/PaddleOCR/blob/dygraph/configs/table/SLANet.yml)
 
-SLANet是PP-Structurev2全新推出的表格识别模型，相比PP-Structurev1中TableRec-RARE，在速度不变的情况下精度提升4.7%。TEDS提升2%
+SLANet是PP-StructureV2全新推出的表格识别模型，相比PP-StructureV1中TableRec-RARE，在速度不变的情况下精度提升4.7%。TEDS提升2%
 
 
 |算法|Acc|[TEDS(Tree-Edit-Distance-based Similarity)](https://github.com/ibm-aur-nlp/PubTabNet/tree/master/src)|Speed|
 | --- | --- | --- | ---|
-| EDD<sup>[2]</sup> |x| 88.3% |x|
+| EDD<sup>[2]</sup> |x| 88.30% |x|
 | TableRec-RARE(ours) | 71.73%| 93.88% |779ms|
 | SLANet(ours) | 76.31%|    95.89%|766ms|
 

diff --git a/applications/光功率计数码管字符识别/光功率计数码管字符识别.md b/applications/光功率计数码管字符识别/光功率计数码管字符识别.md
@@ -182,15 +182,15 @@ PPOCRLabel是一款适用于OCR领域的半自动化图形标注工具，内置P
 
 | ID | 策略 |  模型大小 | 精度 | 预测耗时（CPU + MKLDNN)|
 |-----|-----|--------|----| --- |
-| 01 | PP-OCRv2 | 8M | 74.8% | 8.54ms |
-| 02 | SVTR_Tiny | 21M | 80.1% | 97ms |
-| 03 | SVTR_LCNet(h32) | 12M | 71.9% | 6.6ms |
-| 04 | SVTR_LCNet(h48) | 12M | 73.98% | 7.6ms |
-| 05 | + GTC | 12M | 75.8% | 7.6ms |
-| 06 | + TextConAug | 12M | 76.3% | 7.6ms |
-| 07 | + TextRotNet | 12M | 76.9% | 7.6ms |
-| 08 | + UDML | 12M | 78.4% | 7.6ms |
-| 09 | + UIM | 12M | 79.4% | 7.6ms |
+| 01 | PP-OCRv2 | 8M | 74.80% | 8.54ms |
+| 02 | SVTR_Tiny | 21M | 80.10% | 97.00ms |
+| 03 | SVTR_LCNet(h32) | 12M | 71.90% | 6.60ms |
+| 04 | SVTR_LCNet(h48) | 12M | 73.98% | 7.60ms |
+| 05 | + GTC | 12M | 75.80% | 7.60ms |
+| 06 | + TextConAug | 12M | 76.30% | 7.60ms |
+| 07 | + TextRotNet | 12M | 76.90% | 7.60ms |
+| 08 | + UDML | 12M | 78.40% | 7.60ms |
+| 09 | + UIM | 12M | 79.40% | 7.60ms |
 
 
 ### 3.3 开始训练

diff --git a/applications/印章弯曲文字识别.md b/applications/印章弯曲文字识别.md
@@ -30,9 +30,9 @@
 
 | 任务 | 训练数据数量 | 精度 |
 | -------- | - | -------- |
-| 印章检测 | 1000    | 95%  |
-| 印章文字识别-端对端OCR方法 | 700    | 47%  |
-| 印章文字识别-两阶段OCR方法 |  700   | 55%  |
+| 印章检测 | 1000    | 95.00%  |
+| 印章文字识别-端对端OCR方法 | 700    | 47.00%  |
+| 印章文字识别-两阶段OCR方法 |  700   | 55.00%  |
 
 点击进入 [AI Studio 项目](https://aistudio.baidu.com/aistudio/projectdetail/4586113)
 

diff --git a/applications/发票关键信息抽取.md b/applications/发票关键信息抽取.md
@@ -145,8 +145,8 @@ LayoutXLM与VI-LayoutXLM针对该场景的训练结果如下所示。
 
 | 模型 | 迭代轮数 | Hmean |
 | :---: | :---: | :---: |
-| LayoutXLM | 50 | 100% |
-| VI-LayoutXLM | 50 | 100% |
+| LayoutXLM | 50 | 100.00% |
+| VI-LayoutXLM | 50 | 100.00% |
 
 可以看出，由于当前数据量较少，场景比较简单，因此2个模型的Hmean均达到了100%。
 
@@ -274,8 +274,8 @@ LayoutXLM与VI-LayoutXLM针对该场景的训练结果如下所示。
 
 | 模型 | 迭代轮数 | Hmean |
 | :---: | :---: | :---: |
-| LayoutXLM | 50 | 98.0% |
-| VI-LayoutXLM | 50 | 99.3% |
+| LayoutXLM | 50 | 98.00% |
+| VI-LayoutXLM | 50 | 99.30% |
 
 可以看出，对于VI-LayoutXLM相比LayoutXLM的Hmean高了1.3%。
 

diff --git a/applications/液晶屏读数识别.md b/applications/液晶屏读数识别.md
@@ -110,7 +110,7 @@ python tools/eval.py -c configs/det/ch_PP-OCRv3/ch_PP-OCRv3_det_cml.yml -o Globa
 
 |   | 方案                        |hmeans|
 |---|---------------------------|---|
-| 0 | PP-OCRv3中英文超轻量检测预训练模型直接预测 |47.5%|
+| 0 | PP-OCRv3中英文超轻量检测预训练模型直接预测 |47.50%|
 
 #### 4.3.2 预训练模型直接finetune
 ##### 修改配置文件
@@ -143,8 +143,8 @@ python tools/eval.py -c configs/det/ch_PP-OCRv3/ch_PP-OCRv3_det_cml.yml -o Globa
 结果如下：
 |   | 方案                        |hmeans|
 |---|---------------------------|---|
-| 0 | PP-OCRv3中英文超轻量检测预训练模型直接预测 |47.5%|
-| 1 | PP-OCRv3中英文超轻量检测预训练模型fintune |65.2%|
+| 0 | PP-OCRv3中英文超轻量检测预训练模型直接预测 |47.50%|
+| 1 | PP-OCRv3中英文超轻量检测预训练模型fintune |65.20%|
 
 #### 4.3.3 基于预训练模型Finetune_student模型
 
@@ -175,9 +175,9 @@ python tools/eval.py -c configs/det/ch_PP-OCRv3/ch_PP-OCRv3_det_student.yml -o G
 结果如下：
 |   | 方案                        |hmeans|
 |---|---------------------------|---|
-| 0 | PP-OCRv3中英文超轻量检测预训练模型直接预测 |47.5%|
-| 1 | PP-OCRv3中英文超轻量检测预训练模型fintune |65.2%|
-| 2 | PP-OCRv3中英文超轻量检测预训练模型fintune学生模型 |80.0%|
+| 0 | PP-OCRv3中英文超轻量检测预训练模型直接预测 |47.50%|
+| 1 | PP-OCRv3中英文超轻量检测预训练模型fintune |65.20%|
+| 2 | PP-OCRv3中英文超轻量检测预训练模型fintune学生模型 |80.00%|
 
 #### 4.3.4 基于预训练模型Finetune_teacher模型
 
@@ -233,10 +233,10 @@ python tools/eval.py -c configs/det/ch_PP-OCRv3/ch_PP-OCRv3_det_dml.yml -o Globa
 结果如下：
 |   | 方案                        |hmeans|
 |---|---------------------------|---|
-| 0 | PP-OCRv3中英文超轻量检测预训练模型直接预测 |47.5%|
-| 1 | PP-OCRv3中英文超轻量检测预训练模型fintune |65.2%|
-| 2 | PP-OCRv3中英文超轻量检测预训练模型fintune学生模型 |80.0%|
-| 3 | PP-OCRv3中英文超轻量检测预训练模型fintune教师模型 |84.8%|
+| 0 | PP-OCRv3中英文超轻量检测预训练模型直接预测 |47.50%|
+| 1 | PP-OCRv3中英文超轻量检测预训练模型fintune |65.20%|
+| 2 | PP-OCRv3中英文超轻量检测预训练模型fintune学生模型 |80.00%|
+| 3 | PP-OCRv3中英文超轻量检测预训练模型fintune教师模型 |84.80%|
 
 #### 4.3.5 采用CML蒸馏进一步提升student模型精度
 
@@ -294,11 +294,11 @@ python tools/eval.py -c configs/det/ch_PP-OCRv3/ch_PP-OCRv3_det_cml.yml -o Globa
 结果如下：
 |   | 方案                        |hmeans|
 |---|---------------------------|---|
-| 0 | PP-OCRv3中英文超轻量检测预训练模型直接预测 |47.5%|
-| 1 | PP-OCRv3中英文超轻量检测预训练模型fintune |65.2%|
-| 2 | PP-OCRv3中英文超轻量检测预训练模型fintune学生模型 |80.0%|
-| 3 | PP-OCRv3中英文超轻量检测预训练模型fintune教师模型 |84.8%|
-| 4 | 基于2和3训练好的模型fintune |82.7%|
+| 0 | PP-OCRv3中英文超轻量检测预训练模型直接预测 |47.50%|
+| 1 | PP-OCRv3中英文超轻量检测预训练模型fintune |65.20%|
+| 2 | PP-OCRv3中英文超轻量检测预训练模型fintune学生模型 |80.00%|
+| 3 | PP-OCRv3中英文超轻量检测预训练模型fintune教师模型 |84.80%|
+| 4 | 基于2和3训练好的模型fintune |82.70%|
 
 如需获取已训练模型，请扫码填写问卷，加入PaddleOCR官方交流群获取全部OCR垂类模型下载链接、《动手学OCR》电子书等全套OCR学习资料🎁
 <div align="left">
@@ -445,7 +445,7 @@ python tools/eval.py -c configs/rec/PP-OCRv3/ch_PP-OCRv3_rec_distillation.yml -o
 结果如下：
 |   | 方案                        |accuracy|
 |---|---------------------------|---|
-| 0 | PP-OCRv3中英文超轻量识别预训练模型直接预测 |70.4%|
+| 0 | PP-OCRv3中英文超轻量识别预训练模型直接预测 |70.40%|
 
 ####  开始训练
 我们使用上面修改好的配置文件configs/rec/PP-OCRv3/ch_PP-OCRv3_rec_distillation.yml，预训练模型，数据集路径，学习率，训练轮数等都已经设置完毕后，可以使用下面命令开始训练。
@@ -465,8 +465,8 @@ python tools/eval.py -c configs/rec/PP-OCRv3/ch_PP-OCRv3_rec_distillation.yml -o
 结果如下：
 |   | 方案                        |accuracy|
 |---|---------------------------|---|
-| 0 | PP-OCRv3中英文超轻量识别预训练模型直接预测 |70.4%|
-| 1 | PP-OCRv3中英文超轻量识别预训练模型finetune |82.2%|
+| 0 | PP-OCRv3中英文超轻量识别预训练模型直接预测 |70.40%|
+| 1 | PP-OCRv3中英文超轻量识别预训练模型finetune |82.20%|
 
 如需获取已训练模型，请扫码填写问卷，加入PaddleOCR官方交流群获取全部OCR垂类模型下载链接、《动手学OCR》电子书等全套OCR学习资料🎁
 <div align="left">