【发布时间】:2022-08-06 12:29:51
【问题描述】:
考虑以下代码
import dask
import dask.dataframe as dd
import pandas as pd
data_dict = {\'data1\':[1,2,3,4,5,6,7,8,9,10]}
df_pd = pd.DataFrame(data_dict)
df_dask = dd.from_pandas(df_pd,npartitions=2)
df_dask[\'data1x2\'] = df_dask[\'data1\'].apply(lambda x:2*x,meta=(\'data1x2\',\'int64\')).compute()
print(\'-\'*80)
print(df_dask[\'data1x2\'])
print(\'-\'*80)
print(df_dask[\'data1x2\'].compute())
print(\'-\'*80)
我想不通的是:为什么第一次和第二次打印的输出有区别?毕竟,我在应用函数时调用了计算并将结果存储在 df_dask[\'data1x2\'] 中。
标签: python pandas dataframe dask dask-dataframe