行人检测--What Can Help Pedestrian Detection?

What Can Help Pedestrian Detection?
CVPR2017

本文主要分析 extra features 对于基于CNN 的行人检测有什么帮助，设计了一个行人检测网络 HyperLearner 可以有效利用这些 extra features

这里的 extra features 主要指 various channel features

行人检测中存在什么问题了？
行人检测--What Can Help Pedestrian Detection?
一个是低分辨率情况的背景和小尺寸的人很难区分，另一个是多个人在一起的时候基于CNN的单个行人的检测定位误差相对较大

这里我们将 channel features 融入到 Faster R-CNN 中去
行人检测--What Can Help Pedestrian Detection?

3.2. Introduction to channel features

各种 channel features 分为三类： apparent-to-semantic channels, temporal channels and depth channels
1） apparent-to-semantic channels 主要包括 ICF channel [10]( colors and gradients), edge channel, segmentation channel ( FCN )and heatmap channel( blur the segmentation channel )，特征从 low-level apparent to high-level semantic
2) Temporal channels 包括 optical flow [1] and motion [29]
3）Depth channels disparity channel

行人检测--What Can Help Pedestrian Detection?
分割信息对行人检测的帮助最大

HyperLearner 网络
行人检测--What Can Help Pedestrian Detection?
这里的 Aggregated activation map 是怎么得到了？将 body network 中不同尺寸的特征图归一化到同一个尺寸、相同通道数目，组合起来得到 Aggregated activation map，这个 Aggregated activation map 作为 channel feature network (CFN) 的输入得到 Channel Feature，body network 的最终输出加上 Aggregated activation map 作为 region proposal network (RPN) and Fast R-CNN (FRCNN) 的输入。

行人检测--What Can Help Pedestrian Detection?

Caltech test set
行人检测--What Can Help Pedestrian Detection?