SDM，长短期兴趣融合召回

SDM简述

CIKM ’19，阿里巴巴的一项推荐工作。

贡献

在已有的 sequence-based 工作基础上，解决两个问题：

session 中存在 multiple interest tendencies
long-term behaviors are various and complex. 为此设计 long-short term gated 作长短期兴趣融合

网络结构

SDM，长短期兴趣融合召回

user profile preference

使用 $e_u=concat(\{e_u^p|p\in P\})$ 表达用户向量。where $P=\{age,gender,life\_stage\}$ .

short-term preference

使用 $e_i=concat(\{e_i^f|f\in F\})$ 表达商品向量。where $F=\{id,cate\_first\_level,cate\_leaf\_level,shop,brand\}$ . due to the sparsity caused by the large-scale items, encoding items only by item_id is far from satisfaction.
送往LSTM，得到 $[h^u_1,... ,h^u_t]$
送往self_att，得到 $[\hat h^u_1,... ,\hat h^u_t]$
user_attention，权重由 $softmax(<e_u,h^u_i>)$ 得到，不再像self_att那样先线性投影 Q,K 空间再点积相乘。该步得到短期偏好 $s^u_t$

long-term preference

$L^u=\{L_f^u|f\in F\},l^u_k\in L^u_f， l^u_k\in R^d$ , F 为field的集合，同上。 $L^u_f$ 为某个field的偏好list，同一field共享embedding 矩阵。
$z^u_f =user\_attention(e_u,L^u_f)\in R^d$ , 起到 pooling 作用。
$z^u=concat(\{z^u_f|f\in F\})$ ，得到长期偏好 $p^u=tanh(Wz^u+b)$ .

long-short term fusion gate

“we elaborately design a gated neural network”, $G^u=\sigma(W_1e_u+W_2s^u_t+W_3p^u+b)，G^u\in R^d$ ,该gate用来控制短期兴趣的占比。
$\odot$ 为element-wise multiplication,进一步得到 $o^u_t=G^u\odot s^u_t+(1-G^u)\odot p^u$ 用于召回。

candidate matching

$score(item_i)=<o^u_t,v_i>,score(item_i)\in R,v_i\in V$ , $V$ 是另一个item emb矩阵。

paper对比实验

我的讨论

参考

paper，SDM

目录