西安电子科技大学学报

• 研究论文 • 上一篇    下一篇

面向复杂工业大数据的实时特征提取方法

孔宪光;章雄;马洪波;常建涛;牛萌   

  1. (西安电子科技大学 工业大数据技术研究中心,陕西 西安  710071)
  • 收稿日期:2015-08-13 出版日期:2016-10-20 发布日期:2016-12-02
  • 通讯作者: 孔宪光
  • 作者简介:孔宪光(1974-),男,副教授,博士,E-mail: kongxg@vip.sina.com.
  • 基金资助:

    中央高校基本科研业务费大数据群资助项目(BDY231423);国家自然科学基金资助项目(51505357);陕西省国际科技合作与交流计划资助项目(2016-KW-048)

Real time feature extraction method for complex industrial big data

KONG Xianguang;ZHANG Xiong;MA Hongbo;CHANG Jiantao;NIU Meng   

  1. (Industrial Big Data Technology Research Center, Xidian Univ., Xi'an  710071, China)
  • Received:2015-08-13 Online:2016-10-20 Published:2016-12-02
  • Contact: KONG Xianguang

摘要:

工业大数据具有大体量、多源性、连续采样和价值密度低等特点,造成其数据复杂度高、实时性强和异常数据多.而传统的特征提取方法已无法满足复杂工业大数据的实时性要求,同时工业大数据的处理方法不同于基于互联网的数据流处理方法,其对精度要求较高.针对该问题,提出一种鲁棒的增量在线特征提取方法,即鲁棒增量主成分分析,采用滑动窗口动态更新数据,过滤窗口内的异常数据点;然后对窗口内数据进行增量主成分分析,从而满足工业大数据处理的精度及实时性要求.实验结果表明,该方法可有效对数据流进行实时的特征提取,并达到一定的精度要求.

关键词: 工业大数据, 实时性与鲁棒性, 滑动窗口, 主成分分析, 离群点检测, 特征提取

Abstract:

Industrial big data have the traits of big volume, multi-sources, continuous sampling and low value density, which results in high complexity, real-time and high abnormality. Traditional feature extraction methods cannot meet the real-time requirements of complex industrial big data. In addition, the processing method for industrial big data is different from the internet data stream processing method, which has a higher accuracy requirement. Therefore, this paper proposes a robust incremental on-line feature extraction method as the Robust Incremental Principal Component Analysis. It uses the sliding window to update new coming data dynamically and filter the abnormal data in windows, then the incremental principal component analysis is implemented on data in windows in order to meet the accuracy and real-time requirements of industrial big data processing. Experimental results show that the proposed method can effectively extract the data stream in real time with high accuracy.

Key words: industrial big data, real-time and robustness, sliding window, principal component analysis, outlier detection, feature extraction

Baidu
map