Tensorflow/keras implementation of End-to-End Object Detection with Transformers(DETR) research paper
Paper link : https://arxiv.org/abs/2005.12872
This is an unofficial Tensorflow implementation of DETR,End-to-end Object Detection with Transformers (Carion et al.).This Repo uses tensorflow's official MultiHeadAttention layer for attention. For Backbone, a similiar implementation of Pytorch's Resnet has been used rather than Tensorflow's pretrained resnet architecture due to discrepancies in Model architecture like use of dilation in Pytorch convolution layers, presence bias weights in Tensorflow etc. PositionEmbeddingSine layer has been adopted from official Pytorch's DETR library. Weights have been manually converted to fit the architecture. You can download the converted Pytorch weights in Tensorflow from here.
python3 demo_coco.py --image_path path_to_image
- Model Architectures
- Loss
- Training
- Inference
- Pretrained Model
- DataGenerator
- Weight conversion scripts
- Pillow == 8.3.1
- Tensorflow == 2.6.2
- opencv-python == 4.4.0
- keras == 2.6.0