基于时空特征融合的交通信号控制与仿真分析
DOI:
CSTR:
作者:
作者单位:

新疆大学电气工程学院 乌鲁木齐 830047

作者简介:

通讯作者:

中图分类号:

TN911.4;TP183

基金项目:

新疆维吾尔自治区自然科学基金(2022D01C430)、 国家自然科学基金(51468062) 项目资助


Traffic signal control and simulation analysis based on spatio-temporal feature fusion
Author:
Affiliation:

School of Electrical Engineering, Xinjiang University,Urumqi 830047, China

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    针对现有方法因忽略历史交通信息导致时空特征感知不足的问题,提出一种融合深度强化学习与时空特征建模的交叉口信号控制方法。该方法使用D3QN-LSTM混合网络架构,通过离散交通状态编码将多时段交通信息表征为高维矩阵,采用卷积神经网络提取空间特征,结合长短时记忆网络捕捉时序依赖关系,并设计基于奖励反馈的动态探索机制优化策略训练过程。基于SUMO仿真平台进行实验,结果表明:相较于固定时长控制及传统强化学习方法,所提方法在早高峰流量情况下平均排队长度指标上分别降低49.95%、35.04%和16.72%,累积等待时间减少63.03%、35.55%和20.15%,有效验证了时空特征建模与动态探索策略的优越性。为评估算法鲁棒性,进一步开展平峰期交通流实验,结果表明:所提算法在平均排队长度与累积等待时间指标上依然保持显著优势,证明该方法对不同交通场景具有强适应性和良好的泛化能力。

    Abstract:

    To address the insufficient spatio-temporal feature perception caused by neglecting historical traffic information in existing methods,this study proposes an intersection signal control method integrating deep reinforcement learning with spatio-temporal feature modeling. The approach employs a hybrid D3QN-LSTM network architecture,which encodes multi-period traffic information into high-dimensional matrices through discrete traffic state representation. A convolutional neural network extracts spatial features,while a long short-term memory network captures temporal dependencies. A reward-feedback-driven dynamic exploration mechanism is further designed to optimize policy training. Experiments conducted on the SUMO simulation platform demonstrate that during morning peak traffic,the proposed method reduces average queue length by 49.95%,35.04% and 16.72%,and decreases cumulative waiting time by 63.03%,35.55% and 20.15% compared to fixed-timing control,conventional reinforcement learning methods and D3QN,respectively,validating the superiority of spatio-temporal feature modeling and dynamic exploration strategies. To assess algorithmic robustness,off-peak traffic flow experiments further confirm that the proposed method maintains significant advantages in both average queue length and cumulative waiting time metrics,demonstrating strong adaptability and generalizability across varying traffic load conditions.

    参考文献
    相似文献
    引证文献
引用本文

刘振航,黄德启,黄德意,黄海峰.基于时空特征融合的交通信号控制与仿真分析[J].电子测量技术,2026,49(4):1-10

复制
分享
相关视频

文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:
  • 最后修改日期:
  • 录用日期:
  • 在线发布日期: 2026-04-16
  • 出版日期:
文章二维码

重要通知公告

①《电子测量技术》期刊收款账户变更公告