您可以制作包含日期时间段的时间数据帧,然后您可以进行 OUTER JOIN(使用pd.merge()),如下所示:
import pandas as pd
from datetime import timedelta
df = pd.DataFrame({
'init': ['2022-02-02 10:34:00', '2022-02-02 11:34:00', '2022-02-02 13:34:00', '2022-02-02 14:34:00', '2022-02-02 17:34:00'],
'end': ['2022-02-02 11:34:00', '2022-02-02 12:34:00', '2022-02-02 14:34:00', '2022-02-02 15:34:00', '2022-02-02 18:34:00'],
'temp': [34, 12, 23, 22, 18],
})
# to convert str to datetime type for init and end columns
df['init'] = pd.to_datetime(df['init'])
df['end'] = pd.to_datetime(df['end'])
# to create temporal dataframe for additional rows
tmp_df = pd.DataFrame()
tmp_df['init'] = pd.date_range(start=df.iloc[0]['init'] - timedelta(hours=1), end=df.iloc[-1]['end'], freq="H")
# to create final result
result = pd.merge(df, tmp_df, on='init', how='outer')
result = result.sort_values(by=['init']).reset_index(drop=True)
#result['end'] = result['init'] + timedelta(hours=1) # use this if you make end value as init + 1 hour
result['end'] = result['end'].apply(lambda x: datetime(2020, 2, 2, 11, 34, 0) if x is pd.NaT else x)
result['temp'] = result['temp'].fillna(0) # convert NaN to 0
print(result)
这将打印您所期望的:
>>> result
init end temp
0 2022-02-02 09:34:00 2020-02-02 11:34:00 0.0
1 2022-02-02 10:34:00 2022-02-02 11:34:00 34.0
2 2022-02-02 11:34:00 2022-02-02 12:34:00 12.0
3 2022-02-02 12:34:00 2020-02-02 11:34:00 0.0
4 2022-02-02 13:34:00 2022-02-02 14:34:00 23.0
5 2022-02-02 14:34:00 2022-02-02 15:34:00 22.0
6 2022-02-02 15:34:00 2020-02-02 11:34:00 0.0
7 2022-02-02 16:34:00 2020-02-02 11:34:00 0.0
8 2022-02-02 17:34:00 2022-02-02 18:34:00 18.0
9 2022-02-02 18:34:00 2020-02-02 11:34:00 0.0
如果要将“结束”列设为“init + 1 小时”,请使用此代码(已在代码中注释)#result['end'] = result['init'] + timedelta(hours=1),而不是 result['end'] = result['end'].apply(lambda x: datetime(2020, 2, 2, 11, 34, 0) if x is pd.NaT else x)。
这将打印以下内容:
init end temp
0 2022-02-02 09:34:00 2022-02-02 10:34:00 0.0
1 2022-02-02 10:34:00 2022-02-02 11:34:00 34.0
2 2022-02-02 11:34:00 2022-02-02 12:34:00 12.0
3 2022-02-02 12:34:00 2022-02-02 13:34:00 0.0
4 2022-02-02 13:34:00 2022-02-02 14:34:00 23.0
5 2022-02-02 14:34:00 2022-02-02 15:34:00 22.0
6 2022-02-02 15:34:00 2022-02-02 16:34:00 0.0
7 2022-02-02 16:34:00 2022-02-02 17:34:00 0.0
8 2022-02-02 17:34:00 2022-02-02 18:34:00 18.0
9 2022-02-02 18:34:00 2022-02-02 19:34:00 0.0