MFCC based real-time speech reproduction and recognition using distributed acoustic sensing technology
Article
Figures
Metrics
Preview PDF
Reference
Related
Cited by
Materials
Abstract:
The distributed acoustic sensing technology was used for real-time speech reproduction and recognition, in which the voiceprint can be extracted by the Mel frequency cepstral coefficient (MFCC) method. A classic ancient Chinese poem “You Zi Yin”, also called “A Traveler’s Song”, was analyzed both in time and frequency domains, where its real-time reproduction was achieved with a 116.91 ms time delay. The smaller scaled MFCC0 at 1/12 of MFCC matrix was taken as a feature vector of each line against the ambient noise, which provides a recognition method via cross-correlation among the 6 original and recovered verse pairs. The averaged cross-correlation coefficient of the matching pairs is calculated to be 0.580 6 higher than 0.188 3 of the nonmatched pairs, promising an accurate and fast method for real-time speech reproduction and recognition over a passive optical fiber.
ZHOU Ran, ZHAO Shuai, LUO Mingming, MENG Xin, MA Jie,,LIU Jianfei. MFCC based real-time speech reproduction and recognition using distributed acoustic sensing technology[J]. Optoelectronics Letters,2024,20(4):222-227