Cross-Modal Alignment

Cross-Modal Alignment 论文阅读

原文：Zhu Y , Xu Y , Ni B , et al. Enhancing pulmonary nodule detection via cross-modal alignment[C]// Visual Communications & Image Processing. IEEE, 2018.

简介

问题

在肺结节检测中，医学影像标注数据稀少。
由于成像设备间的差异造成数据间存在差异性。

创新点

本文提出一种跨模态方案，通过模态对齐实现通道聚合检测。

Cycle-GAN：用于迁移结节形态学特征，即合成结节图像；
目标函数：用于评估Cycle-GAN和结节检测的性能；
融合方法：用于结节分类。

主要内容

通道聚合检测器
Cycle-GAN
结节评估

Cross-Modal Alignment

结节检测

将范围为-1400_{200的HU值图像转换为0}255的灰度图；
采用HOG在六个方向上提取图像特征；
聚合所有特征通道构建结节检测器；
根据置信度评估候选结节。

假阳性率偏高。作者使用了两部分数据集，LUNA16数据集和ChestHosp（上海市胸科医院）数据集。两者由于成像设备原因存在数据差异性，严重影响了结节检测的精度。

跨模态对齐

在Cycle-GAN模型中只用于合成结节图像，因此作者认为数据差异性造成的影响可忽略。

source model A
target model B
negative model C

目标函数

映射关系：
- $G: A \to B$
- $G_i: B \to A$
辨别器： $D_A, D_B, D$

对抗损失（Adversarial loss）

$\mathcal{L}_{GAN}(G_i,D_B, A, B)$
$\mathcal{L}_{GAN}(G_i,D_A, B, A)$

$D_A$ ： $D_A$ 输出数据来自A而不是B的概率。
$D_B$ ： $D_B$ 输出数据来自B而不是A的概率。

循环一致性损失（Cycle consistency loss）

Because the mapping is highly under-constrained, a network can map the same set of source images to any permutation of images in the target modal.

为了减少映射空间，引入循环一致性（Cycle-Consistency）。

$\mathcal{L}_{cyc}(G, G_i) =\mathbb{E}_{a \sim A}\left[\|G_i(G(a))-a\|_{1}\right] +\mathbb{E}_{b \sim B}\left[\|G(G_i(b))-b\|_{1}\right]$

其中， $\|\cdot\|_1$ 表示 $L1$ 正则。

辅助损失（Auxiliary loss）

G的输出结果可能偏离理论数据的分布；
G的输出结果中的非结节数据难以与结节数据区分。

以上两点易将结节图像迁移至数据集C，因此引入一个辅助辨别器D，用于区分数据属于B还是C。

$\begin{aligned} \mathcal{L}_{\text {aux}}(G, D) =\mathbb{E}_{b \sim B}[\log (1-D(b))]+\mathbb{E}_{c \sim C}[\log D(c)] +\lambda_{1} \mathbb{E}_{a \sim A}[\log (1-D(G(a)))] \end{aligned}$

总损失

$\begin{aligned} G^{*}, D^{*}&=\arg \min _{G, G_i, D} \max _{D_{A}, D_{B}} \mathcal{L}\left(G, G_i, D, D_{A}, D_{B}\right) \\ &=\arg \min _{G, G_i, D} \max _{D_{A}, D_{B}} [\mathcal{L}_{GAN}\left(G, D_{B}, A, B\right) +\mathcal{L}_{GAN}\left(G_i, D_{A}, B, A\right) +\mathcal{L}_{aux}(G, D)+\lambda_{2} \mathcal{L}_{cyc}(G, G_i)] \end{aligned}$

融合阶段

降低假阳性率，重用辅助辨别器D用于评估候选结节。

single vote: D的输出概率
weight vote： $s=s_2 \cdot exp(s_1)$ ，其中 $s_2$ 为原始得分
bias vote： $s=s_2 + \lambda \cdot exp(s_1)$