StreamPETR is an efficient camera-only 3D detection algorithm. This detection framework has an object-centric design and utilizes temporal information to enhance the performance with minimum computational overhead. Its long-sequence modeling ability and sparse query design results in significant performance improvement.
This repository demostrates how to deploy StreamPETR on NVIDIA Drive Orin Platform with TensorRT, in the following two steps:
The inference application is structured as depicted in the following diagram. The entire model is divided into an image encoder and a head. This division serves two primary purposes: firstly, it facilitates separate control over precision, and secondly, it allows for the separation of the backbone and head, enabling a cleaner handling of the recursive memory tensor update. During each frame, the application loads the image, ego motion, and timestamp externally. It then establishes the input 'memory' based on the output 'memory' from the previous frame.