MFCC based real-time speech reproduction and recognition using distributed acoustic sensing technology.

MFCC based real-time speech reproduction and recognition using distributed acoustic sensing technology.
DOI:
                        
                    
CSTR:
                        [cstr]
                    
Author:
                        Zhou RanZhou Ran
Hebei University of Technoloogy
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site
Zhao ShuaiZhao Shuai
Hebei University of Technoloogy
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site
Luo MingmingLuo Mingming
Hebei University of Technoloogy
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site
Meng XinMeng Xin
Hebei University of Technoloogy
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site
Ma JieMa Jie
Hebei University of Technoloogy
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site
Liu JianfeiLiu Jianfei
Hebei University of Technoloogy
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site

                    
Affiliation:Hebei University of Technoloogy
Clc Number:
Fund Project:The National Natural Science Foundation of China (General Program, Key Program, Major Research Plan)

Article

Figures

Metrics

Reference

Cited by

Materials

Comments

Abstract:

The distributed acoustic sensing technology was used for real-time speech reproduction and recognition, in which the voiceprint can be extracted by the Mel Frequency Cepstral Coefficients (MFCC) method. A classic ancient Chinese poem “You Zi Yin”, also called “A Traveler’s Song”, was analyzed both in time and frequency domain, where its real-time reproduction was achieved with a 116.91 ms time-delay. The smaller scaled MFCC0 at 1/12 of MFCC matrix was taken as a feature vector of each line against the ambient noise, which provides a recognition method via cross-correlation among the six original and recovered verse pairs. The averaged cross-correlation coefficient of the matching pairs is calculated 0.5806 higher than 0.1883 of the nonmatched pairs, promising an accurate and fast method for real-time speech reproduction and recognition over a passive optical fiber.

Key words:distributed acoustic sensing technology; speech reproduction and recognition; MFCC extraction.

Get Citation

Copy

Article Metrics

Abstract:
PDF:
HTML:
Cited by:

History

Received:August 18,2023
Revised:October 13,2023
Adopted:November 07,2023
Online:
Published:

Home

About us

Authors

Editors

News

Contents

Contact us

Get Citation

Share

Article Metrics

History

Article QR Code