[ONNX] Refactor delegate memory management #32661

mvafin · 2025-11-03T14:43:39Z

Details:

The changes allow memory management to be moved on the delagate side.

Replaces multiple data-related fields in TensorMetaInfo with a single m_buffer field
Unifies tensor data access through get_buffer() instead of separate data pointers and external locations
Consolidates external data loading into reusable utility functions

Tickets:

N/A

Signed-off-by: Maxim Vafin <[email protected]>

src/frontends/onnx/frontend/src/frontend.cpp

Copilot

Pull Request Overview

Refactors memory management in the ONNX frontend by consolidating tensor data handling from multiple pointer-based members to a single AlignedBuffer approach.

Replaces multiple data-related fields in TensorMetaInfo with a single m_buffer field
Unifies tensor data access through get_buffer() instead of separate data pointers and external locations
Consolidates external data loading into reusable utility functions

Reviewed Changes

Copilot reviewed 9 out of 9 changed files in this pull request and generated no comments.

Show a summary per file

File	Description
src/frontends/onnx/tests/CMakeLists.txt	Adds tensor external data utility file to test sources and includes
src/frontends/onnx/frontend/src/translate_session.cpp	Updates tensor data check to use unified buffer approach
src/frontends/onnx/frontend/src/input_model.cpp	Updates tensor data check and constructor to use buffer
src/frontends/onnx/frontend/src/core/tensor.hpp	Removes old data fields, adds buffer-based constructor and accessor
src/frontends/onnx/frontend/src/core/tensor.cpp	Implements unified buffer-based data extraction and constant creation
src/frontends/onnx/frontend/src/core/node.cpp	Updates attribute tensor creation to use buffer
src/frontends/onnx/frontend/src/core/graph_iterator_proto.cpp	Major refactor of tensor data extraction to use buffer utilities
src/frontends/onnx/frontend/src/core/decoder_proto.hpp	Removes old tensor data field initialization
src/frontends/onnx/frontend/include/openvino/frontend/onnx/decoder.hpp	Updates TensorMetaInfo structure to use buffer field

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

bumbosiepsak

Review not completed yet. Publishing remarks already found and switching to reviewing a more urgent review.

bumbosiepsak · 2025-11-04T12:35:15Z

src/frontends/onnx/frontend/include/openvino/frontend/onnx/decoder.hpp

-    const uint8_t* m_tensor_data;
-    ov::Any m_tensor_data_any;
-    size_t m_tensor_data_size;
+    std::shared_ptr<ov::AlignedBuffer> m_buffer;


Missing include:

#include <memory> // For shared_ptr

Actually your AlignedBuffer already is a resource manager. Maybe you could keep it by value in TensorMetaInfo (instead of shared_ptr) in order to avoid managing the memory twice (and paying with memory fragmentation / non-locality)?

bumbosiepsak · 2025-11-04T12:51:39Z

src/frontends/onnx/tests/CMakeLists.txt



-target_include_directories(ov_onnx_frontend_tests PRIVATE "${CMAKE_CURRENT_SOURCE_DIR}")
+target_include_directories(ov_onnx_frontend_tests PRIVATE


I have a feeling, that objects in frontend/src belong to a separate / own target. Need of using relative paths traversing the file tree backwards indicates that.

In kosher CMake these libraries would export their interface directories. So that linking them would "infect" the current target with necessary paths. See: https://cmake.org/cmake/help/latest/prop_tgt/INTERFACE_INCLUDE_DIRECTORIES.html

Yeah, those are files from frontend library. We include them here to not duplicate code (or to not create a separate target for decoder)

bumbosiepsak · 2025-11-04T12:53:02Z

src/frontends/onnx/frontend/src/core/tensor.hpp

-            } else {
-                return m_tensor_place->get_data_size();
-            }
+            FRONT_END_NOT_IMPLEMENTED(get_data_size);


Question: is it planned to be implemented?

Maybe yes, or maybe will delete it later

bumbosiepsak · 2025-11-04T13:18:14Z

src/frontends/onnx/frontend/src/core/graph_iterator_proto.cpp

-        return true;
-    } else {
-        throw std::runtime_error("Unsupported memory management mode");
+template <typename T, typename Container>


I have a feeling, that this belongs in the AlignedBuffer (in a constructor or factory member function).

Since we're using C++17, you'd need to detect things yourself:

#include <type_traits> #include <utility> // for std::declval #include <iterator> // for std::begin, std::end template <typename T, typename = void> struct is_iterable : std::false_type {}; template <typename T> struct is_iterable<T, std::void_t< decltype(std::begin(std::declval<T>())), decltype(std::end(std::declval<T>())) >> : std::true_type {}; // These two factories return a shared_ptr, but as a member of AlignedBuffer // they could/should act in-place (i.e. fill the current AlignedBuffer). template <typename Container = T, typename std::enable_if_t<!is_iterable<U>::value, int> = 0> std::shared_ptr<ov::AlignedBuffer> make_buffer_from_container(const Container& container) { using T = typename Container::value_type; // Assuming you can do this auto buffer = std::make_shared<ov::AlignedBuffer>(container.size() * sizeof(T)); T* ptr = buffer->template get_ptr<T>(); size_t idx = 0; for (const auto& elem : container) { ptr[idx++] = static_cast<T>(elem); } return buffer; } template <typename Container = T, typename std::enable_if_t<is_iterable<U>::value, int> = 0> std::shared_ptr<ov::AlignedBuffer> make_buffer_from_container(const Container& container) { using T = typename Container::value_type; // Assuming you can do this auto buffer = std::make_shared<ov::AlignedBuffer>(container.size() * sizeof(T)); std::copy(container.begin(), container.end(), buffer->template get_ptr<T>()); return buffer; }

AlignedBuffer is just an interface to transfer memory, it doesn't need to have such interface, we can do it in frontend.

bumbosiepsak · 2025-11-04T13:19:26Z

src/frontends/onnx/frontend/src/core/graph_iterator_proto.cpp

+    T* ptr = buffer->template get_ptr<T>();
+    size_t idx = 0;
+    for (const auto& elem : container) {
+        ptr[idx++] = static_cast<T>(elem);


You can use ptr directly:

*ptr++ == static_cast<T>(elem);

javier-intel · 2025-11-05T08:27:51Z

Hello @mvafin, I'm working on implementing the delegate interface to interface with ORT. At first glance the changes in here seem to pose a problem with the implementation. I'd like to ask the PR to put on hold until we can discuss the changes.

mvafin requested a review from a team as a code owner November 3, 2025 14:43

github-actions bot added category: build OpenVINO cmake script / infra category: ONNX FE OpenVINO ONNX FrontEnd category: CPP API OpenVINO CPP API bindings labels Nov 3, 2025

mvafin force-pushed the mvafin/onnx/delegate_memory branch 5 times, most recently from c994e2b to f01eae9 Compare November 3, 2025 16:40

[ONNX] Refactor delegate memory management

f01eae9

Signed-off-by: Maxim Vafin <[email protected]>

mvafin commented Nov 3, 2025

View reviewed changes

src/frontends/onnx/frontend/src/frontend.cpp Outdated Show resolved Hide resolved

Update src/frontends/onnx/frontend/src/frontend.cpp

0e632bc

mvafin requested a review from Copilot November 4, 2025 08:44

Copilot AI reviewed Nov 4, 2025

View reviewed changes

mvafin requested a review from bumbosiepsak November 4, 2025 08:47

bumbosiepsak reviewed Nov 4, 2025

View reviewed changes



		target_include_directories(ov_onnx_frontend_tests PRIVATE "${CMAKE_CURRENT_SOURCE_DIR}")
		target_include_directories(ov_onnx_frontend_tests PRIVATE

[ONNX] Refactor delegate memory management #32661

Are you sure you want to change the base?

[ONNX] Refactor delegate memory management #32661

Conversation

mvafin commented Nov 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Details:

Tickets:

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

bumbosiepsak left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

javier-intel commented Nov 5, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

mvafin commented Nov 3, 2025 •

edited

Loading