Skip to content
This repository has been archived by the owner on Jul 16, 2021. It is now read-only.

Facing issue AttributeError: 'DataFrame' object has no attribute 'to_delayed' for dask-xgboost #82

Open
bhavana3 opened this issue Apr 8, 2021 · 0 comments

Comments

@bhavana3
Copy link

bhavana3 commented Apr 8, 2021

What happened:
Facing an issue like AttributeError: 'DataFrame' object has no attribute 'to_delayed' from core.py (line 175, version latest - 0.1.11) data_parts = data.to_delayed()
What you expected to happen:
train model by calling
bst = dxgb.train(client, hyperparameters, df, labels)

Minimal Complete Verifiable Example:

# 
import dask.dataframe as dd

df = dd.read_csv(data_files_location + files_pattern, compression='gzip', delimiter=CONTROL_A_DELIM, quoting=csv.QUOTE_NONE, header=0,
                         na_values=NA_DELIM, keep_default_na=False, dtype=str, blocksize=None)
labels = df[label]
df.drop(label, axis=1, inplace=True)
bst = dxgb.train(client, hyperparameters, df, labels)

Environment:

  • Dask version: >=1.15.2
  • Python version:3.7
  • Operating System: N/A (using jupyter notebooks on cloud)
  • Install method (conda, pip, source): conda
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant