面向遥感图像的改进RT-DETR目标检测算法
DOI:
CSTR:
作者:
作者单位:

1.南京信息工程大学电子与信息工程学院 南京 210044;2.无锡学院电子信息工程学院 无锡 214105

作者简介:

通讯作者:

中图分类号:

TP391.4;TN914

基金项目:

国家自然科学基金(62204172)资助


Improved RT-DETR object detection algorithm for remote sensing images
Author:
Affiliation:

1.School of Electronic and Information Engineering, Nanjing University of Information Science and Technology, Nanjing 210044, China;2.School of Electronic Information Engineering, Wuxi University,Wuxi 214105, China

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    针对遥感图像目标检测中目标分布密集、背景复杂和小目标众多等问题导致检测效果不佳。本文提出一种基于RT-DETR的RSD-DETR遥感图像目标检测算法。首先,设计了轻量级多尺度特征提取Faster-CGLU模块,将门控机制和部分卷积融合,优化局部和全局特征信息的聚合,同时减少计算冗余。其次,结合级联分组注意力(CGA)构建CGA-AIFI模块,以在抑制非相关背景信息的同时关注关键特征区域,增强模型与目标特征的交互能力。最后,设计跨尺度动态特征融合(CS-DFFM)结构,通过动态尺度序列特征融合(DySSFF)模块和三重特征编码器(TFE)模块,对多尺度特征图进行尺寸对齐和动态融合,防止上下采样导致的小目标特征信息丢失,增强了网络多尺度特征融合能力。实验结果表明,在SIMD和DOTA-v1.0数据集上,所提算法在参数量较基线模型降低22.11%的情况下,平均精度均值(mAP0.5)分别达到了79.9%和86.8%,较基线模型分别提高了2.5%和1.7%,模型实时性也得到了提高。检测效果优于其他经典模型,具有卓越的性能。

    Abstract:

    Dense target distribution, complex backgrounds, and a large number of small objects often lead to suboptimal detection performance in remote sensing image object detection. To address these challenges, this paper proposes RSD-DETR, a remote sensing object detection algorithm based on RT-DETR. First, a lightweight multi-scale feature extraction module, Faster-CGLU, is designed by integrating a gating mechanism with partial convolution, which optimizes the aggregation of local and global feature information while reducing computational redundancy. Second, a CGA-AIFI module is constructed using cascaded group attention (CGA), which focuses on critical feature regions while suppressing irrelevant background information, thereby enhancing the interaction between the model and object features. Finally, a cross-scale dynamic feature fusion module (CS-DFFM) is designed, which performs spatial alignment and dynamic fusion of multi-scale feature maps through the dynamic scale-sequence feature fusion (DySSFF) module and the triple feature encoder (TFE) module. This effectively mitigates the loss of small object features caused by upsampling and downsampling, and enhances the network′s multi-scale feature fusion capability. Experimental results show that on the SIMD and DOTA-v1.0 datasets, the proposed algorithm reduces the number of parameters by 22.11% compared with the baseline model, and the mean average precision (mAP0.5) reaches 79.9% and 86.8% respectively, which are 2.5% and 1.7% higher than those of the baseline model. The real-time performance of the model is also improved. The detection effect is better than other classic models, and it has excellent performance.

    参考文献
    相似文献
    引证文献
引用本文

陈辉,王新蕾.面向遥感图像的改进RT-DETR目标检测算法[J].电子测量技术,2026,49(4):204-216

复制
分享
相关视频

文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:
  • 最后修改日期:
  • 录用日期:
  • 在线发布日期: 2026-04-16
  • 出版日期:
文章二维码

重要通知公告

①《电子测量技术》期刊收款账户变更公告