电子科技 ›› 2019, Vol. 32 ›› Issue (5): 32-37.doi: 10.16180/j.cnki.issn1007-7820.2019.05.007

• • 上一篇    下一篇

基于特征融合的海洋哺乳动物声音识别

钟鸣拓,蔡文郁   

  1. 杭州电子科技大学 电子信息学院,浙江 杭州 310018
  • 收稿日期:2018-05-16 出版日期:2019-05-15 发布日期:2019-05-06
  • 作者简介:钟鸣拓(1994-),男,硕士研究生。研究方向:语音识别、嵌入式开发。|蔡文郁(1979-),男,博士,副教授。研究方向:嵌入式系统。
  • 基金资助:
    浙江省自然科学基金(LY18F030006)

Marine Mammal Sound Recognition Based on Feature Fusion

ZHONG Mingtuo,CAI Wenyu   

  1. School of Electronic Information,Hangzhou Dianzi University,Hangzhou 310018,China
  • Received:2018-05-16 Online:2019-05-15 Published:2019-05-06
  • Supported by:
    Zhejiang Provincial Natural Science Foundation of China(LY18F030006)

摘要:

为了提高海洋哺乳动物声音识别算法的识别率和鲁棒性,提出了一种将梅尔倒谱系数MFCC、线性倒谱系数LFCC和时域特征融合作为特征参数进行声音识别的方法。该方法通过融合不同倒谱系数以增强对不同频段的表征能力,通过融合时域特征来更全面地描述声音信息。声音样本通过基于海洋环境下的预处理、特征提取与融合后,用支持向量机进行分类识别。相对于传统算法只针对一种或几种哺乳动物进行识别,该方法在包含61种海洋哺乳动物声音的样本库中进行测试。测试结果显示该算法较传统的梅尔倒谱系数在识别率上提升了5.5%,且在海洋低信噪比环境下有更好的识别表现。

关键词: 特征融合, 海洋哺乳动物, 声音识别, 支持向量机, 倒谱系数, 时域特征

Abstract:

In order to improve the recognition rate and robustness of marine mammal sound recognition algorithm, this paper proposed a method for sound recognition by using the fusion of MFCC,LFCC and temporal features as feature parameters. This method enhanced the characterization ability of different frequency bands by fusing different cepstral coefficients and described the sound information more comprehensively by integrating the temporal features. To be specific, each continuous marine mammal recording was first preprocessed into individual syllables. Then, cepstral coefficients and temporal features were calculated from each syllable. Finally, the fused features were identified by support vector machine. Unlike traditional algorithms which only recognized few mammals, this method was tested in a sample database containing 61 marine mammal sounds. The test results showed that the proposed algorithm improved the recognition rate by 5.5% compared with the traditional MFCC, and had a better recognition performance in the low SNR environment.

Key words: feature fusion, marine mammals, sound recognition, support vector machine, cepstral coefficients, temporal features

中图分类号: 

  • TN912.34
Baidu
map