【发布时间】:2020-05-18 14:54:21
【问题描述】:
我有这样的数据框。
UserID Review MovieID
0 10112 Good MOV001
1 10112 Excellent MOV002
2 10112 Average MOV003
3 10113 Good MOV001
4 10113 Bad MOV002
5 10113 Good MOV003
6 10113 Excellent MOV004
7 10114 Good MOV001
8 10114 Bad MOV002
9 10114 Good MOV003
10 10114 Excellent MOV004
我已将评论更改为 int 值。
movies.Review[movies.Status == 'Average'] = 2
movies.Review[movies.Status == 'Good'] = 3
movies.Review[movies.Status == 'Excellent'] = 5
movies.Review[movies.Status == 'Very Good'] = 4
movies.Review[movies.Status == 'Okay'] = 1
movies.Review[movies.Status == 'Bad'] = 0
movies
现在我的数据框将如下所示,
UserID Review MovieID
0 10112 3 MOV001
1 10112 5 MOV002
2 10112 2 MOV003
3 10113 3 MOV001
4 10113 0 MOV002
5 10113 3 MOV003
6 10113 5 MOV004
7 10114 3 MOV001
8 10114 0 MOV002
9 10114 3 MOV003
10 10114 5 MOV004
现在如何根据评论找到每部电影的平均分数?谁能帮帮我?
【问题讨论】:
-
groupby.mean?? -
你能帮我写代码吗?
-
根据您设置分数的逻辑,查看
np.select -
是的,如前所述,您需要
groupby.mean -
类似
movies.groupby('MovieID').mean()
标签: python python-3.x pandas dataframe recommendation-engine