Enhancement of speech detected by laser coherent detection method based on spectral feature adaptation
DOI:
CSTR:
Author:
Affiliation:

1.State Key Laboratory of Precision Measurement Technology and Instrument,Tianjin University, Tianjin 300072, China; 2.Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100094, China; 3.University of Chinese Academy of Sciences, Beijing 100049, China

Clc Number:

TH741TN911.7

Fund Project:

  • Article
  • |
  • Figures
  • |
  • Metrics
  • |
  • Reference
  • |
  • Related
  • |
  • Cited by
  • |
  • Materials
  • |
  • Comments
    Abstract:

    To address the issue of slowly varying broadband background noise and channel effects caused by vibration of the target in laser-coherent speech detection, this paper proposes a speech enhancement method for specific speakers based on an analysis-re-synthesis framework. This method first extracts the features from the observed signal: pitch, voiced speech probability, and MCEP coefficients, where MCEP coefficients represent the spectral envelope features which can capture the shape of the spectral envelope. A GMM trained by speech features of the corresponding speaker is used to help estimate the spectral envelope features of the clean speech from the spectral envelope features of the observed speech, and then the speech signal is resynthesized by combining it with pitch and voiced speech probability estimated from the observed speech to achieve speech enhancement. The estimation of noise and channel parameters is achieved by adaptation, which maximize the posterior probability of the observed speech′s spectral envelope features, and then the estimation of the clean speech spectral envelope features is obtained by MMSE estimation. Both synthesized signal experiments and actual signal acquisition experiments verify the denoising and equalization capabilities of the algorithm in laser coherent speech detection scenarios.When the speech signal is detected by laser doppler vibrometer, there will be slowly varying broadband background noise and channel effects caused by the vibration measurement target. Thus, we propose a speech enhancement method for a specific speaker based on the analysis-resynthesis framework to enhance that distorted signal. This method first extracts the features from the observed signal: pitch, voiced speech probability, and MCEP coefficients, where MCEP coefficients are the spectral envelope features which can capture the shape of the spectral envelope. A GMM trained by speech features of the corresponding person is used to help estimate the spectral envelope features of the clean speech from the spectral envelope features of the observed speech, and then the speech signal is resynthesized by combining it with pitch and voiced speech probability estimated from the observed speech to achieve speech enhancement. The estimation of noise and channel parameters is achieved by adaptation, which maximize the posterior probability of the observed speech′s spectral envelope features, and then the estimation of the clean speech spectral envelope features is obtained by MMSE estimation. Synthetic signal experiments and actual signal acquisition experiments verify the denoising and equalization capabilities of the algorithm in laser coherent speech detection scenario.

    Reference
    Related
    Cited by
Get Citation
Share
Article Metrics
  • Abstract:
  • PDF:
  • HTML:
  • Cited by:
History
  • Received:
  • Revised:
  • Adopted:
  • Online: December 17,2024
  • Published:
Article QR Code