MFCC based real-time speech reproduction and recognition using distributed acoustic sensing technology.
DOI:
Author:
Affiliation:

Hebei University of Technoloogy

Clc Number:

Fund Project:

The National Natural Science Foundation of China (General Program, Key Program, Major Research Plan)

  • Article
  • |
  • Figures
  • |
  • Metrics
  • |
  • Reference
  • |
  • Related
  • |
  • Cited by
  • |
  • Materials
  • |
  • Comments
    Abstract:

    The distributed acoustic sensing technology was used for real-time speech reproduction and recognition, in which the voiceprint can be extracted by the Mel Frequency Cepstral Coefficients (MFCC) method. A classic ancient Chinese poem “You Zi Yin”, also called “A Traveler’s Song”, was analyzed both in time and frequency domain, where its real-time reproduction was achieved with a 116.91 ms time-delay. The smaller scaled MFCC0 at 1/12 of MFCC matrix was taken as a feature vector of each line against the ambient noise, which provides a recognition method via cross-correlation among the six original and recovered verse pairs. The averaged cross-correlation coefficient of the matching pairs is calculated 0.5806 higher than 0.1883 of the nonmatched pairs, promising an accurate and fast method for real-time speech reproduction and recognition over a passive optical fiber.

    Reference
    Related
    Cited by
Get Citation
Share
Article Metrics
  • Abstract:
  • PDF:
  • HTML:
  • Cited by:
History
  • Received:August 18,2023
  • Revised:October 13,2023
  • Adopted:November 07,2023
  • Online:
  • Published: