Conformer/mmdetection at main · wozaimoyu/Conformer

History

Name		Name	Last commit message	Last commit date
parent directory ..
.dev_scripts		.dev_scripts
.github		.github
configs		configs
demo		demo
docker		docker
docs		docs
mmdet		mmdet
requirements		requirements
resources		resources
tests		tests
tools		tools
.gitignore		.gitignore
.readthedocs.yml		.readthedocs.yml
LICENSE		LICENSE
README.md		README.md
pytest.ini		pytest.ini
requirements.txt		requirements.txt
setup.py		setup.py

README.md

Notice

The code is forked from official project. So the basic install and usage of mmdetection can be found in get_started.md. We just add Conformer as a backbone in mmdet/models/backbones/Conformer.py.

At present, we use the feature maps of different stages in the CNN branch as the input of FPN, so that it can be quickly applied to the detection algorithm based on the feature pyramid. At the same time, we think that how to use the features of Transformer branch for detection is also an interesting problem.

Training and inference under different detction algorithms

We provide some config files in configs/. And anyone can use Conformer to replace the backbone in the existing detection algorithms. We take the Faster R-CNN algorithm as an example to illustrate how to perform training and inference:

export CUDA_VISIBLE_DEVICES=0,1,2,3,4,5,6,7
export OMP_NUM_THREADS=1
GPU_NUM=8

CONFIG="./configs/faster_rcnn/faster_rcnn_conformer_small_patch32_fpn_1x_coco.py"
WORK_DIR='./work_dir/faster_rcnn_conformer_small_patch32_lr_1e_4_fpn_1x_coco_1344_800'

# Train
python -m torch.distributed.launch --nproc_per_node=${GPU_NUM} --master_port=50040 --use_env ./tools/train.py ${CONFIG} --work-dir ${WORK_DIR} --gpus ${GPU_NUM}  --launcher pytorch --cfg-options model.pretrained='./pretrain_models/Conformer_small_patch32.pth' model.backbone.patch_size=32

# Test on multiple cards
python -m torch.distributed.launch --nproc_per_node=${GPU_NUM} --master_port=50040 --use_env ./tools/test.py ${CONFIG} ${WORK_DIR}/latest.pth --launcher pytorch  --eval bbox

# Test on single card
#./tools/test.py ${CONFIG} ${WORK_DIR}/latest.pth --eval bbox

Here, we use the Conformer_small_patch32 as backbone network, whose pretrain model weight can be downloaded from baidu (k7q5) or google drive. And the results are shown as following:

Method	Parameters	MACs	FPS	Bbox mAP	Model link	Log link
Faster R-CNN	55.4 M	288.4 G	13.5	43.1	baidu(7ax9) google	baidu(ymv4)
Mask R-CNN	58.1 M	341.4 G	10.9	43.6	baidu(qkwq) google	baidu(gh2v)
PAA (1x single scale)	-	-	-	46.5	(coming soon)	-
Cascade Mask RCNN (1x single scale)	-	-	-	47.3	(coming soon)	-

Update Detection Performance

Method	Schedule	Parameters	MACs	FPS	Bbox mAP	Segm mAP
Faster R-CNN	1x	55.4 M	288.4 G	13.5	43.7	-
Faster R-CNN	3x	55.4 M	288.4 G	13.5	46.1	-

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

mmdetection

mmdetection

README.md

Notice

Training and inference under different detction algorithms

Update Detection Performance

Files

mmdetection

Directory actions

More options

Directory actions

More options

Latest commit

History

mmdetection

Folders and files

parent directory

README.md

Notice

Training and inference under different detction algorithms

Update Detection Performance