›› 2015, Vol. 17 ›› Issue (4): 100-106.DOI: 10.13304/j.nykjdb.2015.176

Previous Articles     Next Articles

Robust Speech Recognition Method for Agricultural Product Market Information Collection

XU Jin-pu1,2, XU Feng-juan3, ZHU Ye-ping1*, LIU Sheng-ping1, YUE Hui-li1, LIU Dan1   

  1. (1.Institute of Agricultural Information, Chinese Academy of Agricultural Sciences, Beijing 100081|2.Animation and Media College, Qingdao Agricultural University, Shandong Qingdao 266109|3.Beijing Xinyuan Electronic Information Technology CO., Beijing 100040, China)
  • Received:2015-03-30 Revised:2015-05-06 Online:2015-08-15 Published:2015-08-15

农产品市场信息采集的语音识别鲁棒性方法

许金普1,2,许丰娟3,诸叶平1*,刘升平1,岳慧丽1,刘丹1   

  1. (1.中国农业科学院农业信息研究所, 北京 100081|2.青岛农业大学动漫与传媒学院, 山东 青岛 266109;3.北京市信源电子信息技术公司, 北京 100040)
  • 通讯作者: 诸叶平,研究员,博士生导师,研究方向为智能农业信息技术。E-mail:zhuyeping@caas.cn
  • 作者简介:许金普|讲师|博士研究生|研究方向为智能农业信息技术和语音识别。E-mail:xjp@qau.edu.cn。
  • 基金资助:

    国家自然科学基金项目(61271364);中国农业科学院创新工程项目(ASTIP-2015-AⅡ-03)资助。

Abstract:

At present, many portable devices of information collection were designed and applied to agricultural product market information acquisition. Speech recognition has made great progress, and voice interface can be applied to these devices, but the general robust recognition algorithms are too complicated. In order to solve this problem, this paper proposed a robust speech recognition method, suitable for the devices operation environment. At first, MMSE amplitude estimator was used to enhance the noisy speech signal and improve signal-to-noise ratio (SNR) of the input signal; and then, for the speech distortion and residual noise after enhancement, the cepstrum mean and variance normalization (CMVN) methods were used to compensate. The experimental results showed that the combined algorithm could effectively improve the system correct recognition rate, especially under low SNR (0~10 dB) environment.

Key words: agricultural market information, speech recognition, minimum mean square error, feature compensation

摘要:

当前便携式农产品市场信息采集设备缺少语音接口,且通用领域的识别算法又过于复杂,为此提出一种适用于该设备作业环境的语音识别鲁棒性方法。首先利用MMSE估计器对带噪信号进行增强,以提高输入信号的信噪比;对增强后产生的语音失真和残留噪声,再利用倒谱均值方差归一化(CMVN)方法进行补偿。实验结果表明,该联合后的算法能有效的提高系统的识别率,特别是在低信噪比(0~10 dB)环境下更为有效。

关键词: 农产品市场信息;语音识别;最小均方误差;特征补偿

CLC Number: