《A Through Examination of the CNN_Daily Mail Reading Comprehension Task》——Stanford Attentive Reader

序

论文其他细节不再注意，只关注它的网络结构。
可能是年代比较久远，github上只有一个这篇论文的代码…还是python2.7的

模型结构

《A Through Examination of the CNN_Daily Mail Reading Comprehension Task》——Stanford Attentive Reader

模型分三部分：
第一部分，编码：问题的词编码一样，先通过一个embedding表，把词编程embedding，然后过双向GRU，前向和后向连在一起表示这个token出的表示，同样对问题也编码，只说了问题编码后的维度：h,估计和其他论文一样，都是前向后向的最后一个concat到一起。

《A Through Examination of the CNN_Daily Mail Reading Comprehension Task》——Stanford Attentive Reader

第二部分：attention部分，跟其他论文一样，只是attention的计算方式变了：bilinear term，公式见下：
大概率感觉这个Ws矩阵应该是个变量，需要学习出来。
第三部分： predict部分，细节在下面的对比里面说

《A Through Examination of the CNN_Daily Mail Reading Comprehension Task》——Stanford Attentive Reader

和 attentive reader对比

第一

attention匹配函数不一样，而且这个变化对于结果好贡献很大。

第二

和attentive reader对比，这里直接用o去预测了，没有像attentive reader一样再加上question 的embedding q，并且表现也不差。

第三

这个模型最后预测时不用整个词库，只用了entity的词库。
最搞笑的是：加粗那一句，他们说只有第一个是最重要的，其他都是为了简化模型，所以模型核心就是换了一个attention 匹配函数，和张俊林大佬说的一样。
The original model considers all the words from the vocabulary V in making predictions. We think this is unnecessary, and only predict among entities which appear in the passage. Of these changes, only the first seems important; the other two just aim at keeping the model simple.

END

本篇完

相关文章：

2021-05-14
2022-12-23
2021-05-30
2021-08-15
2021-05-02
2022-03-07
2021-06-15
2021-12-28

猜你喜欢

2021-11-16
2021-04-11
2021-11-20
2021-05-05
2022-01-17
2022-12-23
2021-12-18

相关资源

下载 2021-06-05
下载 2022-12-22
下载 2021-11-02
下载 2021-11-02

相似解决方案

热门标签

Java Python linux javascript Mysql C# Docker 算法前端 SpringBoot Redis Vue spring 设计模式 .net core .net kubernetes c++ 数据库数据结构大数据 js 机器学习微服务 Android Go 程序员面试 JVM ASP.net core 云原生人工智能后端 PHP git CSS golang k8s Nginx Django mybatis 深度学习多线程 React 架构 devops 爬虫云计算 Spring Boot LeetCode