KBT-YOLO: A Lightweight and Efficient Small Object Detector for UAV Aerial Imagery

Main Article Content

Fanghao Shi
Manli Wang

Keywords

small-object detection, dynamic convolution, bidirectional feature pyramid network, task align dynamic detect head

Abstract

Object detection in Unmanned Aerial Vehicle (UAV) aerial imagery frequently suffers from missed detections and false alarms due to the extremely small scale of objects, dense distributions, and highly complex backgrounds. To mitigate these challenges, this paper proposes a novel lightweight small object detection method tailored for UAV aerial images. First, a dynamic convolution module is introduced to replace the standard convolutions within the C3k2 module, which simultaneously enhances feature extraction capabilities and effectively reduces computational redundancy. Second, a Bidirectional Feature Pyramid Network (BiFPN) architecture is incorporated to dynamically balance feature interactions via learnable weights, thereby improving local feature capturing and the effectiveness of multi-scale fusion. Finally, a task-aligned dynamic detection head based on a dual-branch structure is designed. By integrating task decomposition and alignment mechanisms with dynamic convolutions, this detection head significantly boosts both detection accuracy and stability. Extensive experiments on the DIOR UAV aerial dataset demonstrate that the proposed method achieves leading detection accuracy while substantially minimizing the parameter count to merely 1.85M. This research offers a promising new technical paradigm for lightweight object detection in UAV applications.

Abstract 7 | PDF Downloads 3

References

  • [1] Liang X, Zhang J, Zhuo L, et al. Small object detection in unmanned aerial vehicle images using feature fusion and scaling-based single shot detector with spatial context analysis[J]. IEEE Transactions on Circuits and Systems for Video Technology, 2019, 30(6): 1758-1770.
  • [2] Girshick R. Fast r-cnn[C]//Proceedings of the IEEE international conference on computer vision. 2015: 1440-1448.
  • [3] Ren S, He K, Girshick R, et al. Faster R-CNN: Towards real-time object detection with region proposal networks[J]. IEEE transactions on pattern analysis and machine intelligence, 2016, 39(6): 1137-1149.
  • [4] He K, Gkioxari G, Dollár P, et al. Mask r-cnn[C]//Proceedings of the IEEE International Conference on Computer Vision. 2017: 2961-2969.
  • [5] Wang C Y, Bochkovskiy A, Liao H Y M. YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors[C]//Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 2023: 7464-7475.
  • [6] Liu W, Anguelov D, Erhan D, et al. Ssd: Single shot multibox detector[C]//European conference on computer vision. Cham: Springer International Publishing, 2016: 21-37.
  • [7] Lin T Y, Goyal P, Girshick R, et al. Focal loss for dense object detection[C]//Proceedings of the IEEE international conference on computer vision. 2017: 2980-2988.
  • [8] Li K, Cheng G, Bu S, et al. Rotation-insensitive and context-augmented object detection in remote sensing images[J]. IEEE Transactions on Geoscience and Remote Sensing, 2017, 56(4): 2337-2348.
  • [9] Zhang G, Lu S, Zhang W. CAD-Net: A context-aware detection network for objects in remote sensing imagery[J]. IEEE Transactions on Geoscience and Remote Sensing, 2019, 57(12): 10015-10024.
  • [10] Gong Y, Xiao Z, Tan X, et al. Context-aware convolutional neural network for object detection in VHR remote sensing imagery[J]. IEEE Transactions on Geoscience and Remote Sensing, 2019, 58(1): 34-44.
  • [11] Li Y, Huang Q, Pei X, et al. Cross-layer attention network for small object detection in remote sensing imagery[J]. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 2020, 14: 2148-2161.
  • [12] Han W, Li J, Wang S, et al. A context-scale-aware detector and a new benchmark for remote sensing small weak object detection in unmanned aerial vehicle images[J]. International Journal of Applied Earth Observation and Geoinformation, 2022, 112: 102966.
  • [13] Jing R, Zhang W, Li Y, et al. Feature aggregation network for small object detection[J]. Expert Systems with Applications, 2024, 255: 124686.
  • [14] Li K, Wan G, Cheng G, et al. Object detection in optical remote sensing images: A survey and a new benchmark[J]. ISPRS journal of photogrammetry and remote sensing, 2020, 159: 296-307.
  • [15] Luo J, Liu Z, Wang Y, et al. Efficient Small Object Detection You Only Look Once: A Small Object Detection Algorithm for Aerial Images[J]. Sensors, 2024, 24(21): 7067.
  • [16] Y. Zhang, M. Ye, G. Zhu, Y. Liu, P. Guo and J. Yan, "FFCA-YOLO for Small Object Detection in Remote Sensing Images," in IEEE Transactions on Geoscience and Remote Sensing, vol. 62, pp. 1-15, 2024, Art no. 5611215, doi: 10.1109/TGRS.2024.3363057.