【发布时间】:2020-08-26 08:42:46
【问题描述】:
我想从两列中统计每天出现的唯一名称:
df = pd.DataFrame({
'ColA':['john wick','bloody mary','peter pan','jeff bridges','billy boy'],
'ColB':['bloody mary','jeff bridges','billy boy','billy boy','john wick'],
'date':['2000-01-01', '2000-01-01', '2000-01-03', '2000-01-03', '2000-01-03'],})
datetime_series = pd.to_datetime(df['date'])
datetime_index = pd.DatetimeIndex(datetime_series.values)
df2 = df.set_index(datetime_index)
df2.drop('date',axis=1,inplace=True)
df2
Out[746]:
ColA ColB
2000-01-01 john wick bloody mary
2000-01-01 bloody mary jeff bridges
2000-01-03 peter pan billy boy
2000-01-03 jeff bridges billy boy
2000-01-03 billy boy john wick
以便我获得一个系列或类似以下内容:
unique occurrences of names
2000-01-01 3
2000-01-03 4
【问题讨论】:
标签: python pandas count unique