[Question / Not sure if it's an issue] Suggested choice of hyperparameters feat_dim (N_a) == output_dim (N_d) leads to ValueError #14

vrtr2020 · 2020-10-27T13:38:52Z

Both in docstring of TabNet class and in the original article they suggest N_a == N_d for most datasets.
(Dimensionalities of hidden representations and the outputs of each decision step)
But in the code (tabnet.py:129) there is a ValueError which is raised if N_a <= N_d.
I'm not sure if it's an issue or it's my comprehension of the code which is not correct.
Could you please clarify this point ?

P.S.
I'd like to thank you for your implementation of a very interesting paper.
I'm trying to use tabnet module for a small POC with an imbalanced dataset containing ~20k samples, mostly categorical data.

The text was updated successfully, but these errors were encountered:

titu1994 · 2020-10-27T16:45:58Z

There is a cuda bug that occurs if Nd and Na are same. The code internally takes Nd-Na dim of information for self attention, and setting them same means a (0, X) dim vector which fails on gpu.

To get true Nd = Na, make Na = 2Nd and that works.

titu1994 · 2020-10-28T02:00:19Z

And yes, I should update that docstring.

MarcusTortorelli · 2020-10-30T20:10:56Z

You should update your examples also.

csetraynor · 2021-02-27T17:59:28Z

Interestingly for the example train_embedding, I get much better performance when setting Na = Nd+1, than Na = 2Nd. For example: feature_dim=5, output_dim=4 (90% Accuracy rate), feature_dim=8, output_dim=4 (60% accuracy rate)

rforgione · 2023-03-10T09:09:49Z

Is anyone currently working on making this clearer? I found it very confusing that the default values do not run. The actual meaning should probably be noted in the comments/docstrings.

Also, I wonder if it makes sense to allow inputs for N_a and N_d as they are treated in the paper and current doc strings, and then handle the actual dimensionality under the hood (it seems like under the hood N_a needs to be set to N_a + N_d per the original meanings).

Kipkull · 2023-04-25T08:39:15Z

Is anyone currently working on making this clearer? I found it very confusing that the default values do not run. The actual meaning should probably be noted in the comments/docstrings.

Also, I wonder if it makes sense to allow inputs for N_a and N_d as they are treated in the paper and current doc strings, and then handle the actual dimensionality under the hood (it seems like under the hood N_a needs to be set to N_a + N_d per the original meanings).

move this line 270 in tabnet.py: features_for_coef = transform_f4[:, self.output_dim:]
into "if" of line 272, would save the problem

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Question / Not sure if it's an issue] Suggested choice of hyperparameters feat_dim (N_a) == output_dim (N_d) leads to ValueError #14

[Question / Not sure if it's an issue] Suggested choice of hyperparameters feat_dim (N_a) == output_dim (N_d) leads to ValueError #14

vrtr2020 commented Oct 27, 2020

titu1994 commented Oct 27, 2020

titu1994 commented Oct 28, 2020

MarcusTortorelli commented Oct 30, 2020

csetraynor commented Feb 27, 2021

rforgione commented Mar 10, 2023

Kipkull commented Apr 25, 2023

[Question / Not sure if it's an issue] Suggested choice of hyperparameters feat_dim (N_a) == output_dim (N_d) leads to ValueError #14

[Question / Not sure if it's an issue] Suggested choice of hyperparameters feat_dim (N_a) == output_dim (N_d) leads to ValueError #14

Comments

vrtr2020 commented Oct 27, 2020

titu1994 commented Oct 27, 2020

titu1994 commented Oct 28, 2020

MarcusTortorelli commented Oct 30, 2020

csetraynor commented Feb 27, 2021

rforgione commented Mar 10, 2023

Kipkull commented Apr 25, 2023