[论文阅读笔记] U-Net

2018-8-7, jingwenlai

# 何为UNet?

[论文阅读笔记] U-Net

基于FPN，修改并扩展，加入了同级连接形成UNet，含有更高层语义信息，可有更大的感受野及更好的精度。

此外，通过overlap-tile strategy使得可以无缝切割任意大的图像。

[论文阅读笔记] U-Net

针对细胞粘连的挑战，引入weighted loss,对边界赋予更大的权重。

[论文阅读笔记] U-Net

## 网络结构

如上图1所示，输入为偶数，需要留意的是the concatenation with the correspondingly cropped feature map from teh contracting path. The cropping is necessary due to the loss of border pixels in every convolution.

## 训练

1. favor large input tiles over a large batch size , batch_size=1 (to minimize the overhead and make maximum use of the GPUmemory)

2. high momentum(0.99) [such that a large number of the previously seen training samples determine the update in the curent optimization step]

3. Energy function is computed by a pixel-wise soft-max over the final feature map combined with the cross entropy loss function. 这一段还需要好好再读下。(权重的设计也是本篇出彩的地方，可以结合代码再细读下)

### Data Augmentation

医学数据样本均较少，除传统的shift/rotation/gray value variations之外，还引入一种deformation augmentation: We generate smooth deformations using random displacement vectors on a coarse 3 by 3 grid. The displacements are sampled from a Gaussian distribution with 10 pixels standard deviation. Per-pixel displacements are then computed using bicubic interpolation. [较独特的data augmentation方法]

## 结果

针对医学图像，取得了很好的结果

[论文阅读笔记] U-Net

原始的caffe实现代码可在https://lmb.informatik.uni-freiburg.de/people/ronneber/u-net/下载，

仅需要很少的标注数据及很少的训练时间，即达到了前述效果。

作者也相信在其它任务中，U-Net也会有很好的表现。