无法替换Dask系列分区

时间:2018-07-03 06:21:29

标签: dask dask-delayed

我正在尝试用自己的分区替换Series dask分区。 我在@MRocklin帖子中使用了this给出的代码段。

list_of_delayed = dask_df.to_delayed()
new_partition = dask.delayed(pd.read_csv)(filename)
list_of_delayed[i] = new_partition
new_dask_df = dd.from_delayed(list_of_delayed, meta=dask_df._meta)

除了dask_df是我的系列文章之外,我做的完全相同。我收到以下错误:

Traceback (most recent call last):
File "sdfr_dhruvkmr.py", line 465, in <module>
    pts = task[(task.task_date <= dtm.Time.iloc[i]) & (task.T_Date == dtm.Date.iloc[i])]
  File "/usr/lib/python2.7/site-packages/edask/dataframe.py", line 130, in __getitem__
    new_dask_df = dd.from_delayed(list_of_delayed)
  File "/usr/lib/python2.7/site-packages/edask/edask/dask/dataframe/io/io.py", line 493, in from_delayed
    type(df).__name__)
TypeError: Expected Delayed object, got Delayed

0 个答案:

没有答案