crash when data size exceeds int32 #5705

SiNZeRo · 2023-02-09T10:45:45Z

maybe related to this code:

Line 326 in 4f47547

const size_t data_offset = offset + data_index * num_columns_in_cur_partition;

jameslamb · 2023-02-09T15:07:10Z

Thanks for using LightGBM.

I've updated the link in your question to one that's anchored to a specific commit, so even if that file you've linked to is altered, anyone reading this in the future will know what line you meant. If you don't know how to do that, see https://docs.github.com/en/repositories/working-with-files/using-files/getting-permanent-links-to-files#press-y-to-permalink-to-a-file-in-a-specific-commit.

Typically a report like "this crashed" without any other details is very difficult for us to investigate. Since you didn't provide any details like a minimal, reproducible example, I'm going to interpret this as a question.... "does training with device_type=cuda support using training data with more than max(int32) (2,147,483,647) rows"?

I SUSPECT that the answer is "no", given that even CPU-based LightGBM does not support more than int32 rows in the input data: #5454 .

@guolinke or @shiyu1994 can you please comment?

SiNZeRo · 2023-02-09T16:22:49Z

Thanks for using LightGBM.

I've updated the link in your question to one that's anchored to a specific commit, so even if that file you've linked to is altered, anyone reading this in the future will know what line you meant. If you don't know how to do that, see https://docs.github.com/en/repositories/working-with-files/using-files/getting-permanent-links-to-files#press-y-to-permalink-to-a-file-in-a-specific-commit.

Typically a report like "this crashed" without any other details is very difficult for us to investigate. Since you didn't provide any details like a minimal, reproducible example, I'm going to interpret this as a question.... "does training with device_type=cuda support using training data with more than max(int32) (2,147,483,647) rows"?

I SUSPECT that the answer is "no", given that even CPU-based LightGBM does not support more than int32 rows in the input data: #5454 .

@guolinke or @shiyu1994 can you please comment?

thanks for the updates.

sorry for misleading you, actually datasize means num_rows * num_feats.

somewhat related to this pr: #5167

jameslamb · 2023-02-11T16:51:36Z

closed via #5706

jameslamb · 2023-02-11T16:51:50Z

thanks for the help @SiNZeRo !

github-actions · 2023-08-19T03:01:54Z

This issue has been automatically locked since there has not been any recent activity since it was closed. To start a new related discussion, open a new issue at https://github.com/microsoft/LightGBM/issues including a reference to this.

jameslamb added the question label Feb 9, 2023

SiNZeRo mentioned this issue Feb 9, 2023

cast data_index as size_t in cuda_row_data to avoid integer overflow #5706

Merged

jameslamb closed this as completed Feb 11, 2023

github-actions bot locked as resolved and limited conversation to collaborators Aug 19, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

crash when data size exceeds int32 #5705

crash when data size exceeds int32 #5705

SiNZeRo commented Feb 9, 2023 •

edited by jameslamb

Loading

jameslamb commented Feb 9, 2023 •

edited

Loading

SiNZeRo commented Feb 9, 2023

jameslamb commented Feb 11, 2023

jameslamb commented Feb 11, 2023

github-actions bot commented Aug 19, 2023

crash when data size exceeds int32 #5705

crash when data size exceeds int32 #5705

Comments

SiNZeRo commented Feb 9, 2023 • edited by jameslamb Loading

jameslamb commented Feb 9, 2023 • edited Loading

SiNZeRo commented Feb 9, 2023

jameslamb commented Feb 11, 2023

jameslamb commented Feb 11, 2023

github-actions bot commented Aug 19, 2023

SiNZeRo commented Feb 9, 2023 •

edited by jameslamb

Loading

jameslamb commented Feb 9, 2023 •

edited

Loading