-
Big table
GPU: 3080 x2
train set: train + val; eval set: test
cut size: (640, 640)
| detector | batch | lr | script | ||
|---|---|---|---|---|---|
| Faster RCNN | 4x2 | 0.04 |
73.44 |
60.78 |
exp/sh/Baseline_TinyPersonV2.sh:1.3 |
| RetinaNet | 4x2 | 0.04 |
72.75 |
59.45 |
exp/sh/Baseline_TinyPersonV2.sh:2.1 |
| Adap RetinaNet | 3x2 | 0.04 | 74.19 | 63.78 | exp/sh/Baseline_TinyPersonV2.sh:2.2 |
| FCOS | 4x2 | 0.04 |
65.08 |
54,37 |
exp/sh/Baseline_TinyPersonV2.sh:3.1 |
| Adap FCOS | 4x2 | 0.04 |
71.59 |
60.98 |
exp/sh/Baseline_TinyPersonV2.sh:3.2 |
| RepPoint | 4x2 | 0.04 |
73.47 |
60.79 |
exp/sh/Baseline_TinyPersonV2.sh:4.1 |
| Adap RepPoint | 4x2 | 0.04 |
75.27 |
64.56 |
exp/sh/Baseline_TinyPersonV2.sh:4.2 |
-
目錄結(jié)構(gòu)
數(shù)據(jù)集位置:~/dataset/tiny_set_v2
數(shù)據(jù)集來源:17suo(基礎(chǔ)) + tiny_set_v1
規(guī)則:模態(tài)/視頻號/圖片
劃分:10: 1 : 10(視頻號級別)
-
TinyPerson_v2(RGB)
Tab. 1. 目標(biāo)size ratio統(tǒng)計.
| dataset_v3.0.1 | absolute size | relative size | aspect ratio |
|---|---|---|---|
| TinyPerson_v2 (rgb) | 23.197±9.815 | 0.016±0.007 | 0.731±0.422 |
| TinyPerson_v2 (infra) | 12.614±4.817 | 0.017±0.006 | 0.827±0.422 |
| dataset_v2.0.1 | absolute size | relative size | aspect ratio |
|---|---|---|---|
| TinyPerson_v2 (rgb) | 22.692±12.117 | 0.016±0.008 | 0.746±0.509 |
| TinyPerson_v2 (rgb without ignore) | 22.619±10.849 | 0.016±0.007 | 0.723±0.424 |
| TinyPerson_v2 (xray) | 13.551±5.078 | 0.018±0.007 | 0.874±0.495 |
| TinyPerson_v2 (xray without ignore) | 13.615±5.099 | 0.018±0.007 | 0.862±0.481 |

TinyPerson
Tab. 2. 數(shù)據(jù)集圖片與標(biāo)注數(shù)量.
| TinyPerson_v3.0.1 | Train set | Valid set | Test set | sum |
|---|---|---|---|---|
| #image (rgb) | 4614 | 375 | 3559 | 8548 |
| #image (infra) | 4614 | 375 | 3559 | 8548 |
| #annotation (rgb) | 170383 | 16408 | 275486 | 462277 |
| #annotation (infra) | 172190 | 16447 | 228775 | 417412 |
| TinyPerson_v2.0.1 | Train set | Valid set | Test set | sum |
|---|---|---|---|---|
| #image (rgb) | 5711 | 568 | 5753 (5754?) | 12032 |
| #image (xray) | 8938 | 906 | 8950 | 18794 |
| #annotation (rgb) | 262063 | 42399 | 315165 | 619627 |
| #annotation (xray) | 390356 | 72525 | 291968 | 754849 |

TinyPerson
Fig. 分布.

TinyPerson



[Issue]
- issue: 無法算均值方差,存在annitation的bbox為nan
- issue: 復(fù)雜bug(7.9)>> [繼續(xù)train]
- issue: >> [繼續(xù)train]
--resume-from+{dir/}+epoch_9.pth
--resume-from ../TOV_mmdetection_cache/work_dir/${CONFIG}/trainval${WH[0]}x${WH[1]}_lr${LR}_clipg10_1x_b${B}${GPU}g/epoch_9.pth
- issue: >> [re-test]
evaluation.do_final_eval=True \
--resume-from+{dir/}+epoch_9.pth
evaluation.do_final_eval=True \
--resume-from ../TOV_mmdetection_cache/work_dir/${CONFIG}/trainval${WH[0]}x${WH[1]}_lr${LR}_1x_b${B}${GPU}g/epoch_12.pth
issue: >> [刪除無用Epoch]
$python exp/tools/clear_tmp_pth.py ../TOV_mmdetection_cache/work_dir/-
issue: 發(fā)散 >> [clip_g]
梯度過大,loss發(fā)散,但仍需訓(xùn)練。可改進(jìn)網(wǎng)路,增加梯度裁剪。
issue 1: loss不收斂

issue 2: loss NaN,程序退出
FCOS
"""
"configs2/TinyPersonV2/base/fcos_r50_caffe_fpn_gn-head_1x_TinyPersonV2_640.py"
"""
optimizer_config = dict( # wd: 改小grad_clip['max_norm']
_delete_=True, grad_clip=dict(max_norm=15, norm_type=2))
# optimizer_config = dict(
# _delete_=True, grad_clip=dict(max_norm=35, norm_type=2))
# learning policy
lr_config = dict(
warmup_iters=500,) # wd: 觀察500 iters
RetinaNet
"""
"configs2/TinyPersonV2/base/retinanet_r50_fpn_1x_TinyPersonV2_640.py"
"""
optimizer_config = dict( # wd: 改小grad_clip['max_norm']
_delete_=True, grad_clip=dict(max_norm=15, norm_type=2))
# optimizer_config = dict(
# _delete_=True, grad_clip=dict(max_norm=35, norm_type=2))
# learning policy
lr_config = dict(
warmup_iters=500,) # wd: 觀察500 iters
# xb exp
python exp/tools/semi_coarse_point_manager.py
python exp/tools/coarse_point_manager.py
