基于二次互相关SRP-PHAT算法声源定位研究
DOI:
CSTR:
作者:
作者单位:

1.杭州电子科技大学智能控制与机器人研究所杭州310018; 2.咸亨国际(杭州)电气制造有限公司杭州310009

作者简介:

通讯作者:

中图分类号:

TH89TB52

基金项目:


Research on sound source localization based on the second-order cross-correlation SRP-PHAT algorithm
Author:
Affiliation:

1.Institute of Intelligent Control and Robotics, Hangzhou Dianzi University,Hangzhou 310018, China; 2.Xianheng International Technology Co., Ltd.,Hangzhou 310009, China

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    可控响应功率相位变换算法(SRP-PHAT)因其在混响环境下的较强鲁棒性被广泛应用于声源定位。然而,传统的SRP-PHAT算法在多传声器阵列声源定位场景下的定位精度不足且计算量较大,不能满足高精度实时声源定位的需求。针对上述问题,提出一种基于二次互相关的SRP-PHAT算法,将阵列中两组通道信号间自相关和互相关的结果进行二次互相关运算,基于广义互相关相位变换函数(GCC-PHAT)进一步计算得到改进的SRP-PHAT函数,对其进行峰值搜索实现声源定位,以提高定位精度;在计算方面,通过将传声器阵列划分为参考通道和声源通道,仅在两组通道间进行互相关运算,避免了传统算法在全通道之间逐一计算带来的冗余,极大地减少了运算量。将传统的SRP-PHAT算法与基于二次互相关的SRP-PHAT算法在自研的128阵元多螺旋臂阵列声源定位系统进行移植,并在室内进行4种声源频率下(10~25 kHz)的声源定位实验。实验结果显示改进后的算法对4种不同声源频率下的声源定位时的方位角估计误差平均降低2.5°,俯仰角估计误差平均降低2°,定位的空间分辨率平均提升45.78%。改进后的算法相较于原算法在提高定位精度的同时大幅降低了计算量,为SRP-PHAT算法在多传声器阵列的实时声源定位提供了有效解决方案。

    Abstract:

    The steered response power with phase transform (SRP-PHAT) algorithm is widely used for sound source localization due to its strong robustness in reverberant environments. However, the traditional SRP-PHAT algorithm has insufficient localization accuracy and large computational volume in multi-microphone array source localization scenarios, which cannot meet the demand for high-precision real-time source localization. Aiming at the above problems, a SRP-PHAT algorithm based on quadratic cross-correlation is proposed. In this approach, the autocorrelation and cross-correlation between the signals of two groups of channels in the array are subjected to quadratic cross-correlation. The generalized cross-correlation phase transform function (GCC-PHAT) is used as the basis for further calculation to obtain the improved SRP-PHAT function, followed by peak detection to achieve sound source localization with improved accuracy. To reduce computational load, the microphone array is divided into reference channels and sound source channels, and correlation operations are performed only between these groups. This avoids the redundancy of traditional pairwise computations across all channels, significantly lowering the computational cost. The traditional SRP-PHAT algorithm and the SRP-PHAT algorithm based on quadratic inter-correlation are transplanted in the self-developed 128-array multi-spiral-arm array source localization system, and the experiments of source localization are conducted indoors at four source frequencies (10~25 kHz). The experimental results show that the improved algorithm reduces the azimuth estimation error by an average of 2.5°, the pitch estimation error by an average of 2°, and the spatial resolution of the localization by an average of 45.78% for the localization of sound sources at four different source frequencies. Compared with the conventional algorithm, the improved algorithm improves the localization accuracy and reduces the computation amount significantly, which provides an effective solution for the SRP-PHAT algorithm for real-time sound source localization in multi-microphone arrays.

    参考文献
    相似文献
    引证文献
引用本文

席旭刚,王晨,李文国,丁一,马秉宇.基于二次互相关SRP-PHAT算法声源定位研究[J].仪器仪表学报,2025,46(4):251-259

复制
相关视频

分享
文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:
  • 最后修改日期:
  • 录用日期:
  • 在线发布日期: 2025-06-23
  • 出版日期:
文章二维码