[js/node] allow arenaExtendStrategy and gpuMemLimit option for CUDA EP #23176

nomagick · 2024-12-21T13:26:10Z

Description

Allow arenaExtendStrategy and gpuMemLimit for CUDA EP.

Motivation and Context

"arenaExtendStrategy" is required when pushing the model or batch size to the limit of GPU memory.
"gpuMemLimit" provides more control over the GPU memory when multiple Inference Sessions are loaded at the same time.

nomagick · 2024-12-21T13:28:16Z

@microsoft-github-policy-service agree

nomagick · 2024-12-23T13:52:38Z

@fs-eire Would you mind reviewing this?

js/common/lib/inference-session.ts

fs-eire · 2025-01-02T22:23:58Z

/azp run Windows ARM64 QNN CI Pipeline,Windows x64 QNN CI Pipeline,Windows CPU CI Pipeline,Windows GPU CUDA CI Pipeline,Windows GPU DML CI Pipeline,Windows GPU Doc Gen CI Pipeline,Windows GPU TensorRT CI Pipeline,ONNX Runtime Web CI Pipeline,Linux CPU CI Pipeline,Linux CPU Minimal Build E2E CI Pipeline

fs-eire · 2025-01-02T22:23:59Z

/azp run Linux GPU CI Pipeline,Linux GPU TensorRT CI Pipeline,Linux OpenVINO CI Pipeline,Linux QNN CI Pipeline,MacOS CI Pipeline,orttraining-linux-ci-pipeline,orttraining-linux-gpu-ci-pipeline,onnxruntime-binary-size-checks-ci-pipeline,Big Models,Linux Android Emulator QNN CI Pipeline

fs-eire · 2025-01-02T22:24:01Z

/azp run Android CI Pipeline,iOS CI Pipeline,ONNX Runtime React Native CI Pipeline,CoreML CI Pipeline,Linux DNNL CI Pipeline,Linux MIGraphX CI Pipeline,Linux ROCm CI Pipeline

azure-pipelines · 2025-01-02T22:24:28Z

Azure Pipelines successfully started running 7 pipeline(s).

azure-pipelines · 2025-01-02T22:24:31Z

Azure Pipelines successfully started running 8 pipeline(s).

azure-pipelines · 2025-01-02T22:24:36Z

Azure Pipelines successfully started running 10 pipeline(s).

snnn assigned fs-eire Dec 30, 2024

fs-eire reviewed Dec 30, 2024

View reviewed changes

js/common/lib/inference-session.ts Show resolved Hide resolved

[js/node] allow arenaExtendStrategy and gpuMemLimit for cuda

e03f545

nomagick force-pushed the main branch from bc002c0 to e03f545 Compare December 31, 2024 01:42

nomagick requested a review from fs-eire December 31, 2024 01:45

nomagick force-pushed the main branch from f8cf9a3 to 55ad5ec Compare December 31, 2024 09:12

fs-eire reviewed Jan 1, 2025

View reviewed changes

js/common/lib/inference-session.ts Outdated Show resolved Hide resolved

[js/node] gpuMemLimit as double then cast to u64

720abb3

nomagick force-pushed the main branch from 55ad5ec to 720abb3 Compare January 2, 2025 03:50

nomagick requested a review from fs-eire January 2, 2025 03:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[js/node] allow arenaExtendStrategy and gpuMemLimit option for CUDA EP #23176

[js/node] allow arenaExtendStrategy and gpuMemLimit option for CUDA EP #23176

nomagick commented Dec 21, 2024

nomagick commented Dec 21, 2024

nomagick commented Dec 23, 2024

fs-eire commented Jan 2, 2025

fs-eire commented Jan 2, 2025

fs-eire commented Jan 2, 2025

azure-pipelines bot commented Jan 2, 2025

azure-pipelines bot commented Jan 2, 2025

azure-pipelines bot commented Jan 2, 2025

[js/node] allow arenaExtendStrategy and gpuMemLimit option for CUDA EP #23176

Are you sure you want to change the base?

[js/node] allow arenaExtendStrategy and gpuMemLimit option for CUDA EP #23176

Conversation

nomagick commented Dec 21, 2024

Description

Motivation and Context

nomagick commented Dec 21, 2024

nomagick commented Dec 23, 2024

fs-eire commented Jan 2, 2025

fs-eire commented Jan 2, 2025

fs-eire commented Jan 2, 2025

azure-pipelines bot commented Jan 2, 2025

azure-pipelines bot commented Jan 2, 2025

azure-pipelines bot commented Jan 2, 2025