可解释推荐系统研究综述

Behnoush Abdollahi and Olfa Nasraoui. Using explainability for constrained
matrix factorization. In Proceedings of the Eleventh ACM Conference on
Recommender Systems, pages 79–83. ACM, 2017.

提出了MEP（mean explainability precision）和MER（mean explainability recall）。EP（explainability precision）指的是Top-N推荐中可以解释的项目占推荐数量的的比例，ER（explainability recall）指的是Top-N推荐中可解释的项目在所有可解释项目中占的比例。MEP和MER分别是所有用户的EP和ER的平均值。

对于第二种方法，具体方法取决于解释的特定类型。一种常见的解释是一段文本句子，在这种类型下，可以通过基于文本的度量进行离线评估。例如，在电子商务平台上，可以将用户撰写的评论作为用户购买该商品的基本事实解释。如果生成的解释是一段文本，可以采取常用的文本生成度量方法，例如BLEU（bilingual evaluation understudy）¹和ROUGE（recall-oriented understudy for gisting evaluation）²。另外，也可以使用常用的可读性度量方法，如Gunning Fog Index³， Flesch Reading Ease⁴, Flesch Kincaid Grade Level⁵, Automated Readability Index⁶ 和 Smog Index⁷.

在线评价

首先，也可以基于CR和CTR，从用户的实际行为对解释的质量进行评价。除此之外，还有其他维度如说服力、有效性、效率和满意度。说服力很容易实现，就是看这些解释是否帮助用户接受推荐。
Vig等人[2009]基于MovieLens网站对4个解释接口进行了研究，其中4个接口分别是RelSort、PrefSort、RelOnly和PrefOnly。
可解释推荐系统研究综述

受试者完成一项在线调查中,他们评估每个接口如何帮助他们(1)理解为什么一个项目被推荐(理由),(2)决定是否他们想推荐项目(有效性),和(3)确定推荐项匹配他们的情绪(情绪兼容性)。根据调查结果，作者得出了标签偏好和标签相关性在促进合理性、有效性和情绪相容性方面的作用的结论。

可解释推荐在不同领域的应用

未来研究方向

未完待续

Kishore Papineni, Salim Roukos, Todd Ward, and Wei-Jing Zhu. Bleu: a method for automatic evaluation of machine translation. In Proceedings of the 40th annual meeting on association for computational linguistics, pages 311–318. Association for Computational Linguistics, 2002. ↩︎
Chin-Yew Lin. Rouge: A package for automatic evaluation of summaries. Text Summarization Branches Out, 2004. ↩︎
Robert Gunning. The technique of clear writing. McGraw-Hill, New York, 1952. ↩︎
Rudolph Flesch. A new readability yardstick. Journal of applied psychology, 32(3):221, 1948. ↩︎
J Peter Kincaid, Robert P Fishburne Jr, Richard L Rogers, and Brad S Chissom. Derivation of new readability formulas (automated readability index, fog count and flesch reading ease formula) for navy enlisted personnel. Technical report, Naval Technical Training Command Millington TN Research Branch, 1975. ↩︎
RJ Senter and Edgar A Smith. Automated readability index. Technical report, CINCINNATI UNIV OH, 1967. ↩︎
G Harry Mc Laughlin. Smog grading-a new readability formula. Journal of reading, 12(8):639–646, 1969. ↩︎