Spatial pyramid pooling in deep convolutional networks for visual recognition

说明:compute the feature maps from the entire image only once, and then pool features in arbitrary regions (sub-images) to generate fixed-length representations for training the detectors.
Object Detection《SSP-Net》笔记(2)
Object Detection《SSP-Net》笔记(2)

Introduction

直接输入整图,所有区域共享卷积计算;在Conv5层输出上提取所有区域的特征
spatial pyramid pooling (SPP)【空间金字塔池化】
1,The SPP layer pools the features and generates fixed length outputs, which are then fed into the fully connected layers (or other classifiers)。[替换Conv5的Pooling层]
Object Detection《SSP-Net》笔记(2)

Training the Network

1,Single-size training:
a fixed-size input (224×224) cropped。
Object Detection《SSP-Net》笔记(2)
2,Multi-size training
We consider two sizes: 180×180 in addition to 224×224. Rather than crop a smaller 180×180
region, we resize the aforementioned 224×224 region to 180×180.
Object Detection《SSP-Net》笔记(2)
Object Detection《SSP-Net》笔记(2)

问题

继承了R-CNN的剩余问题:需要存储大量特征+复杂的多阶段训练+训练时间仍然长
新问题:SPP层之前的所有卷积层参数不能finetune

SPP-Net概览

Object Detection《SSP-Net》笔记(2)

相关文章:

  • 2021-08-27
  • 2021-10-31
  • 2022-01-06
  • 2021-08-20
  • 2021-07-17
  • 2021-06-23
  • 2021-05-25
  • 2021-12-15
猜你喜欢
  • 2021-08-19
  • 2021-04-16
  • 2021-07-30
  • 2022-01-02
  • 2021-04-09
  • 2021-11-27
  • 2021-11-07
相关资源
相似解决方案