Notes: 谷歌deeplabv3+的代码现在已经开源,详见deeplab(Github),还有一个使用的demo样例。

0.

spatial pyramid pooling
  • probing the incoming features with filters or pooling operations at multiple rates and multiple effective fields-of-view
  • encode multi-scale contextual information
encode-decoder
  • gradually recovering the spatial information
  • capture sharper object boundaries
convolution
  • depthwise convolution: a spatial convolution performed independently over each channel of an input
  • pointwise convolution: a 1x1 convolution, projecting the channels output by the depthwise convolution onto a new channel space
  • differences:
    • Inception: 1x1 conv first
    • depthvise separable convolution: channel-wise first

details can be found here.

contributes
  1. decoder module: refine the segmentation results especially along object boundaries
  2. depth-wise separable convolution
    • Atrous Spatial Pyramid Pooling
    • decoder

3. Methods

  • capture multi-scale context
    Deeplabv3+ 阅读笔记

Deeplabv3+ 阅读笔记

Deeplabv3+ 阅读笔记

Deeplabv3+ 阅读笔记

相关文章:

  • 2021-12-06
  • 2021-09-02
  • 2021-12-02
  • 2021-05-15
  • 2021-12-27
  • 2021-05-29
猜你喜欢
  • 2021-09-26
  • 2021-06-12
  • 2021-11-04
  • 2022-12-23
  • 2021-12-22
相关资源
相似解决方案