李宏毅2020人类语言处理—P1

人类语言处理（注重speech任务）== 自然语言处理（偏重Text任务）
处理的对象：Text和Speech（语音）

Speech processing is not only speech recognition。

audio：
1 second has 16k sample points, and each point has 256 possible values.
所以没有人可以说同一段话两次

本课程聚焦近3年的发展，探讨在“硬train一发”（把数据集丢进深度学习网络训练就能解决问题）之后的进展。

nlp task

6 kinds
李宏毅2020人类语言处理—P1

unsupervised voice conversion，and only one utterance from each speaker（one-shot learning）

1.speaker recognition，听声音辨别说话者 2.Keyword spotting，检测关键句（唤醒词：Hey Siri）
Text generation，used RNN，bert… its task include:Translation，Summarization，Chat-bot，Question Answer(this class focus)…