Why not support Tesla P100 and limit computing power to greater than 7? #1284

jianhuaz · 2023-10-07T11:23:51Z

Bfloat16 is only supported on GPUs with compute capability of at least 8.0. Your Tesla P100-PCIE-16GB GPU has compute capability 6.0.
RuntimeError: GPUs with compute capability below 7.0 are not supported.

wasertech · 2023-10-07T19:18:53Z

As the runtime error states, the P100 only has a compute capability of 6.0, which is lower than the minimum requirement of 8.0 for bfloat16 support. This means that Tesla P100 cannot use bfloat16 efficiently or correctly. By limiting the computing power to greater than 8, the software can ensure that the GPU has the right hardware and software to use bfloat16 with no problem.
Note that fp16 only requires a 7.0 as compute capability but I'm afraid your GPU is just not made to do such computation efficiently.

jianhuaz · 2023-10-08T01:39:04Z

But P100 is widely used in school teaching, with lower performance and no problem. If Vllm is discarded during the teaching phase, and company employees do not learn Vllm well during the school phase, would it be a loss for Vllm.

jianhuaz · 2023-10-08T01:41:02Z

A software that is not only used in production environments. Before being used in production, it is used for teaching, validation of technical routes, and so on.

jianhuaz · 2023-10-08T01:44:43Z

A software that targets the world, and many developing and underdeveloped countries also need to use good technology to develop their own countries.

Harrison-cc · 2023-10-08T08:49:18Z

add parameter in command line
--dtype half

jianhuaz · 2023-10-08T12:24:36Z

@Harrison-cc Y

Harrison-cc · 2023-10-09T08:56:56Z

which means loading the model using fp16 (v100 support), but I'm not sure if it performs the same as bf16 loading. fp16 is less precise than bf16. (fp16 has 5 bits for exponent, bf16 has 8 bits)

jasonacox · 2024-01-28T02:20:12Z

You can use --dtype float. I managed to get a Docker container of vLLM running Mistral on a system with four P100's. Details: #963 (comment)

jianhuaz closed this as completed Oct 8, 2023

sherdencooper mentioned this issue Nov 18, 2023

vllm could not be used because of CUDA kernal sherdencooper/GPTFuzz#21

Open

jasonacox mentioned this issue Jan 28, 2024

Add compute capability 6.x support #2635

Closed

jasonacox mentioned this issue Apr 27, 2024

[Hardware][Nvidia] Enable support for Pascal GPUs #4409

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Why not support Tesla P100 and limit computing power to greater than 7? #1284

Why not support Tesla P100 and limit computing power to greater than 7? #1284

jianhuaz commented Oct 7, 2023

wasertech commented Oct 7, 2023 •

edited

Loading

jianhuaz commented Oct 8, 2023

jianhuaz commented Oct 8, 2023

jianhuaz commented Oct 8, 2023

Harrison-cc commented Oct 8, 2023

jianhuaz commented Oct 8, 2023

Harrison-cc commented Oct 9, 2023

jasonacox commented Jan 28, 2024

Why not support Tesla P100 and limit computing power to greater than 7? #1284

Why not support Tesla P100 and limit computing power to greater than 7? #1284

Comments

jianhuaz commented Oct 7, 2023

wasertech commented Oct 7, 2023 • edited Loading

jianhuaz commented Oct 8, 2023

jianhuaz commented Oct 8, 2023

jianhuaz commented Oct 8, 2023

Harrison-cc commented Oct 8, 2023

jianhuaz commented Oct 8, 2023

Harrison-cc commented Oct 9, 2023

jasonacox commented Jan 28, 2024

wasertech commented Oct 7, 2023 •

edited

Loading