[bug] Fix race condition #460

akhoroshev · 2023-09-23T12:32:13Z

I encountered segfault when I started generating a lot of requests with tensor para > 1.

It happens because threads exit from method LlamaV2::forward and release input/output tensors. But at the same time threads from LlamaV2::internalThreadEntry with rank > 0 can access these tensors.

lzhangzz

LGTM

fix race condition

314ba9b

akhoroshev changed the title ~~fix race condition~~ [bug] Fix race condition Sep 23, 2023

lvhan028 requested review from lzhangzz and lvhan028 September 25, 2023 02:55

lzhangzz approved these changes Sep 25, 2023

View reviewed changes

hatrexltd mentioned this pull request Sep 25, 2023

[Bug] topk is larger #441

Closed

2 tasks

lvhan028 mentioned this pull request Sep 26, 2023

[Bug] CUDA runtime error #475

Closed

2 tasks

lvhan028 approved these changes Sep 26, 2023

View reviewed changes

lvhan028 merged commit a54e3e0 into InternLM:main Sep 26, 2023
3 checks passed

lvhan028 added the Bug:P1 label Sep 26, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[bug] Fix race condition #460

[bug] Fix race condition #460

akhoroshev commented Sep 23, 2023 •

edited

Loading

lzhangzz left a comment

[bug] Fix race condition #460

[bug] Fix race condition #460

Conversation

akhoroshev commented Sep 23, 2023 • edited Loading

lzhangzz left a comment

Choose a reason for hiding this comment

akhoroshev commented Sep 23, 2023 •

edited

Loading