Speed up initial set up #1442
Unanswered
TalhaAsmal
asked this question in
Ideas
Replies: 1 comment
-
It is not in our plan yet. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
While running through a structured data test on an average sized dataset (560000 x 310) I noticed the initial set up (before training begins) was taking quite long. I traced it down to line 259 in auto_model.py, self._analyze_data(dataset).
I noticed basically each data point is checked for consistency and whether it's a categorical or numeric feature, and I couldn't help but feel there must be a better way than to sequentially iterate through each value in each row. Are there any plans to investigate speeding it up, perhaps making use of parallel processing or batch analysis?
Beta Was this translation helpful? Give feedback.
All reactions