MFCC based real-time speech reproduction and recognition using distributed acoustic sensing technology.
Article
Figures
Metrics
Preview PDF
Reference
Related
Cited by
Materials
Abstract:
The distributed acoustic sensing technology was used for real-time speech reproduction and recognition, in which the voiceprint can be extracted by the Mel Frequency Cepstral Coefficients (MFCC) method. A classic ancient Chinese poem “You Zi Yin”, also called “A Traveler’s Song”, was analyzed both in time and frequency domain, where its real-time reproduction was achieved with a 116.91 ms time-delay. The smaller scaled MFCC0 at 1/12 of MFCC matrix was taken as a feature vector of each line against the ambient noise, which provides a recognition method via cross-correlation among the six original and recovered verse pairs. The averaged cross-correlation coefficient of the matching pairs is calculated 0.5806 higher than 0.1883 of the nonmatched pairs, promising an accurate and fast method for real-time speech reproduction and recognition over a passive optical fiber.
Keywords:
Project Supported:
The National Natural Science Foundation of China (General Program, Key Program, Major Research Plan)