GitHub - ethanlighter/crnn_ctc

Branches Tags

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
__pycache__		__pycache__
dataset		dataset
dicts		dicts
log		log
model		model
tools		tools
LICENSE		LICENSE
config.py		config.py
main.py		main.py
readme.txt		readme.txt

Repository files navigation

本项目基于tensorflow2.3.1构建了crnn+ctc文字识别模型
crnn模型采用了resnet18+lstm，backbone可选(resnet18或vggnet)
实现功能：
1 不定长文字训练和识别
2 tensorflow内嵌tensorrt加速
3 bacth数据文字识别(待更新)
-----------------------------------------------------------------
参数配置：
config.py
--base_dir 设置为项目所在路径
--train_anno 训练文件路径
--test_anno 测试文件路径
--img_root 图片文件路径
--save_model_path 训练模型保存路径
--pre_weight 预训练文件路径
--dict_file_path 字典文件路径
其它参数可保持不变
------------------------------------------------------------------
数据格式:
  图片相对于img_root的路径\t标签内容
exmaple：
  /line1/img_203.jpg\t天气真好
------------------------------------------------------------------
训练:
python main.py --type train
测试:
python main.py --type test --model_file {模型参数文件路径}
demo:
python main.py --model_file {模型文件路径} --img_path {图片路径}
--------------------------------------------------------------------
您也可以直接下载已经训练好的中文通用文字识别模型，下载地址 ： 链接: https://pan.baidu.com/s/1mcxZmvzABE-L-4CRRUc85g  密码: v1cl
下载好之后修改demo.py中的模型路径，即可使用该模型，该模型使用的字典为dict文件夹下的ppocr_keys_v1，该字典来源于百度paddleocr中文字典