Dynamic onnx #242

stqwzr · 2024-08-08T13:24:11Z

Small changes for creating dynamic onnx and the converting to tensorrt. Attached image input and output shapes will look like this. (Image after onnx visualization netron)

Tested with multiple batch sizes to ensure the model performs efficiently and correctly with dynamic input shapes. Also attached the screen of successfull converting to tensorrt by using trtexec only

Please review the changes and provide any feedback if needed. Thank you!

Update readme, for using dynamic onnx and then to converting to TensorRT

added flag dynamic, for using dynamic onnx

stqwzr · 2024-08-08T13:27:26Z

Also for converting i used this docker image nvcr.io/nvidia/tensorrt:23.06-py3

triple-Mu · 2024-08-09T03:24:20Z

Great job!
I will test your PR this weekend.
Thanks so much!

Egorundel · 2024-08-28T05:22:35Z

@stqwzr Hello!
Did you rewrite the code to make it work with batch inference?

stqwzr · 2024-08-28T07:58:51Z

Hi @Egorundel
I added only converting part which supporting batch inference, I didn't change the inference code.

Egorundel · 2024-08-28T08:14:32Z

@stqwzr It's a pity, because I tried to change the inference code so that everything works with the incoming image package std::vector<cv::Mat> batchImages. But so far it has been unsuccessful, as I am confused about how to allocate the memory associated with CUDA correctly.

In theory, everything is ready for it, just need to submit an image in the preprocessing and postprocessing functions and form an image vector in this way, and then submit it to the inference.

Do you have any tips on what can be done?

stqwzr · 2024-08-28T08:35:18Z

@Egorundel You can try with creating some flattened_batch_data (float *), where shape is (batch x channels x width x height).

#include <opencv2/opencv.hpp>
#include <vector>
#include <cuda_runtime.h>

void flattenBatch(const std::vector<cv::Mat>& input_batch, float* flattened_data) {
    int channels = input_batch[0].channels();
    int height = input_batch[0].rows;
    int width = input_batch[0].cols;

    for (size_t i = 0; i < input_batch.size(); ++i) {
        cv::Mat img = input_batch[i];
        img.convertTo(img, CV_32F); // Convert image to float if not already

        std::vector<cv::Mat> channels_vec;
        cv::split(img, channels_vec); // Split channels

        for (int c = 0; c < channels; ++c) {
            memcpy(flattened_data + i * channels * height * width + c * height * width, channels_vec[c].data, height * width * sizeof(float));
        }
    }
}

Kind of like this, btw it was generated by GPT maybe some issues, but logic should be same

stqwzr added 2 commits August 8, 2024 17:14

Update README.md

cb1d0e2

Update readme, for using dynamic onnx and then to converting to TensorRT

Update infer-det.py

bf458ef

added flag dynamic, for using dynamic onnx

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Dynamic onnx #242

Dynamic onnx #242

stqwzr commented Aug 8, 2024

stqwzr commented Aug 8, 2024

triple-Mu commented Aug 9, 2024

Egorundel commented Aug 28, 2024

stqwzr commented Aug 28, 2024

Egorundel commented Aug 28, 2024 •

edited

Loading

stqwzr commented Aug 28, 2024

Dynamic onnx #242

Are you sure you want to change the base?

Dynamic onnx #242

Conversation

stqwzr commented Aug 8, 2024

stqwzr commented Aug 8, 2024

triple-Mu commented Aug 9, 2024

Egorundel commented Aug 28, 2024

stqwzr commented Aug 28, 2024

Egorundel commented Aug 28, 2024 • edited Loading

stqwzr commented Aug 28, 2024

Egorundel commented Aug 28, 2024 •

edited

Loading