【发布时间】:2019-12-13 23:47:36
【问题描述】:
我正在测试 dask,但我无法理解 dask 比普通 python 慢得多。我是在 jupyer 中开发了两个示例来获得每个示例的时间,我认为我做错了什么
第一次使用 dask:28.5 秒,之后使用普通 python 140 毫秒
import dask
import dask.array as da
%%time
def inc(x):
return x + 1
def double(x):
return x + 2
def add(x, y):
return x + y
N = 100000
data = [0 for x in range(N)]
x = da.from_array(data, chunks=(1000))
output = []
for x in data:
a = dask.delayed(inc)(x)
b = dask.delayed(double)(x)
c = dask.delayed(add)(a, b)
output.append(c)
total = dask.delayed(sum)(output)
total.compute()
**28.8 seconds**
现在使用普通 python
%%time
def inc(x):
return x + 1
def double(x):
return x + 2
def add(x, y):
return x + y
N = 100000
data = [0 for x in range(N)]
output = []
for x in data:
a = inc(x)
b = double(x)
c = add(a, b)
output.append(c)
total = sum(output)
**140 milliseconds**
【问题讨论】:
标签: python-3.x dask dask-delayed