【李宏毅2020 ML/DL】补充：Structured Learning: Introduction & Structured Linear Model

我已经有两年 ML 经历，这系列课主要用来查缺补漏，会记录一些细节的、自己不知道的东西。

本次笔记补充视频 BV1JE411g7XF 的缺失部分。在另一个UP主上传的2017课程BV13x411v7US中可找到。本节内容 65 分钟左右。

本节内容综述

介绍了 Structured Learning 及其框架。
要解这个框架，需要解决三个问题。
接着，进入 Structured Linear Model 部分；我们先从这三个问题入手，如何解这三个问题？哪一个最难？

Structured Learning

Example Application
Unified Framework

Object Detection
Statistics

3 Problems

Problem 1: Evaluation
Problem 2: Inference
Problem 3: Training
Link to DNN?

Structured Linear Model

Problem 1
Problem 2&3
Algorithm

小细节

Structured Learning

【李宏毅2020 ML/DL】补充：Structured Learning: Introduction & Structured Linear Model
如上，对于 Structured Learning 来讲，我们输入输出不仅仅理解为一个“向量”，可以理解为一个 object。

Example Application

【李宏毅2020 ML/DL】补充：Structured Learning: Introduction & Structured Linear Model
如上，Structured Learning 其实比较有用武之地。

Unified Framework

【李宏毅2020 ML/DL】补充：Structured Learning: Introduction & Structured Linear Model
如上，在Structured Learning的训练中，我们训练的是 $F$ ，用于衡量 $X, Y$ 的匹配程度。

Object Detection

【李宏毅2020 ML/DL】补充：Structured Learning: Introduction & Structured Linear Model
如上，目标检测中可能使用了Structured Learning的思想。

具体怎么做呢？
【李宏毅2020 ML/DL】补充：Structured Learning: Introduction & Structured Linear Model
如上，很直观了。

在测试时，穷举可能的红框，输出分数最高的。

Statistics

【李宏毅2020 ML/DL】补充：Structured Learning: Introduction & Structured Linear Model
如上，其在统计/几率意义上的框架如上。 $F$ 与 $P$ 其实是一样的事情。

使用几率其实是有坏处的：

几率不能解释一切；
0-1的限制并不必要。

但是也有好处，比如可理解。

此外，有人提出 Energy-based Model 。

3 Problems

Problem 1: Evaluation

【李宏毅2020 ML/DL】补充：Structured Learning: Introduction & Structured Linear Model
如上，我们的 F 应该长什么样呢？

Problem 2: Inference

【李宏毅2020 ML/DL】补充：Structured Learning: Introduction & Structured Linear Model
如上，如何解 argmax 的问题？穷举合理吗？

Problem 3: Training

【李宏毅2020 ML/DL】补充：Structured Learning: Introduction & Structured Linear Model
第三个问题就是，如何有效训练？

Link to DNN?

【李宏毅2020 ML/DL】补充：Structured Learning: Introduction & Structured Linear Model
如上，似乎可以直观地用 DNN 做。

Structured Linear Model

【李宏毅2020 ML/DL】补充：Structured Learning: Introduction & Structured Linear Model
如上，我们认为第一个最难。因为如果第一个解决了，似乎第三个也迎刃而解了。

Problem 1

【李宏毅2020 ML/DL】补充：Structured Learning: Introduction & Structured Linear Model
如上，我们的框架，用于解决 Problem 1 。

如上，在目标检测中，我们定义了用于提取的特征的 $\phi$ 。很多时候，我们用神经网络来自动提取特征，尤其对于图片。

Problem 2&3

【李宏毅2020 ML/DL】补充：Structured Learning: Introduction & Structured Linear Model
如上，我们先假设我们解决了穷举的 $y$ 的问题。

对于训练，我们让正确的 $(x,y)$ 的函数值大过所有其他错误的组合。

用目标检测举例如上。希望红色圈圈的值，大过所有蓝色圈圈的值；红色星星的值，打过所有蓝色星星的值。

Algorithm

【李宏毅2020 ML/DL】补充：Structured Learning: Introduction & Structured Linear Model
其算法如上。其中，我们还是先假设问题2已经解决。算法与机器学习很像。当 w 不再更新时，停止迭代。