Skip to content

Commit

Permalink
Update configs/rtdetr/README.md
Browse files Browse the repository at this point in the history
Co-authored-by: Range King <RangeKingHZ@gmail.com>
  • Loading branch information
lyuwenyu and RangeKing authored Apr 18, 2023
1 parent 315017a commit 6fe61d6
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion configs/rtdetr/README.md
Original file line number Diff line number Diff line change
@@ -1,7 +1,7 @@
# DETRs Beat YOLOs on Real-time Object Detection

## Introduction
We propose a Real-Time DEtection TRansformer (RT-DETR), the first real-time end- to-end object detector to our best knowledge. Specifically, we design an efficient hybrid encoder to efficiently process multi-scale features by decoupling the intra-scale interac- tion and cross-scale fusion, and propose IoU-aware query selection to improve the initialization of object queries. In addition, our proposed detector supports flexibly adjust- ment of the inference speed by using different decoder lay- ers without the need for retraining, which facilitates the practical application of real-time object detectors. Our RT- DETR-L achieves 53.0% AP on COCO val2017 and 114 FPS on T4 GPU, while RT-DETR-X achieves 54.8% AP and 74 FPS, outperforming all YOLO detectors of the same scale in both speed and accuracy. Furthermore, our RT- DETR-R50 achieves 53.1% AP and 108 FPS, outperform- ing DINO-Deformable-DETR-R50 by 2.2% AP in accuracy and by about 21 times in FPS.
We propose a **R**eal-**T**ime **DE**tection **TR**ansformer (RT-DETR), the first real-time end-to-end object detector to our best knowledge. Specifically, we design an efficient hybrid encoder to efficiently process multi-scale features by decoupling the intra-scale interaction and cross-scale fusion, and propose IoU-aware query selection to improve the initialization of object queries. In addition, our proposed detector supports flexibly adjustment of the inference speed by using different decoder layers without the need for retraining, which facilitates the practical application of real-time object detectors. Our RT-DETR-L achieves 53.0% AP on COCO val2017 and 114 FPS on T4 GPU, while RT-DETR-X achieves 54.8% AP and 74 FPS, outperforming all YOLO detectors of the same scale in both speed and accuracy. Furthermore, our RT-DETR-R50 achieves 53.1% AP and 108 FPS, outperforming DINO-Deformable-DETR-R50 by 2.2% AP in accuracy and by about 21 times in FPS.

<div align="center">
<img src="https://user-images.githubusercontent.com/17582080/232390925-54e58fe6-1c17-4610-90b9-7e5525577d80.png" width=500 />
Expand Down

0 comments on commit 6fe61d6

Please sign in to comment.