引用本文: | 许勇刚,张建业,龚小刚,等.基于改进随机森林算法的电力业务实时流量分类方法[J].电力系统保护与控制,2016,44(24):82-89.[点击复制] |
XU Yonggang,ZHANG Jianye,GONG Xiaogang,et al.A method of real-time traffic classification in secure access of the power enterprise based on improved random forest algorithm[J].Power System Protection and Control,2016,44(24):82-89[点击复制] |
|
摘要: |
为了更有效地对电力业务系统安全接入过程中日渐增多的流量进行实时分类,提高电力系统的业务处理速度,提出了一种基于改进随机森林算法的电力业务实时流量分类方法。在分析电力业务安全接入实时流量特征的基础上,改进传统随机森林算法,基于分类间隔加权对随机森林进行修剪来提高分类实时性;对新的样本数据进行数据剪辑来提高分类的准确性。在此改进算法的基础上设计了电力业务安全接入实时流量分类流程。最后以某省电力公司安全接入实时流量分类为例,验证了所提方法的准确性和实时性。 |
关键词: 随机森林 数据剪辑 分类间隔 电力业务 流量分类 |
DOI:10.7667/PSPC152144 |
投稿时间:2015-12-09修订日期:2016-02-05 |
基金项目: |
|
A method of real-time traffic classification in secure access of the power enterprise based on improved random forest algorithm |
XU Yonggang,ZHANG Jianye,GONG Xiaogang,JIANG Ke,ZHOU Huan,YIN Jiying |
(Beijing China Power Information Technology Co., Ltd., Beijing 100085, China;State Grid Xinjiang Electric Power Co., Wulumuqi 830018, China;State Grid Zhejiang Electric Power Co., Hangzhou 310007, China ;School of Control and Computer Engineering, North China Electric Power University, Beijing 102206, China;State Development Investment Co., Beijing 100034, China) |
Abstract: |
This paper aims to classify the growing number of real-time traffic during the secure access process of the power business system more effectively and to improve the speed of business processing of the power system. A real-time traffic classification method of the power business based on improved random forests algorithm is proposed. On the basis of analyzing characteristics of real-time traffic in secure access of the power business, traditional random forests algorithm is improved. This paper prunes random forests based on margin weight to improve real-time performance of classification and does data-editing for the new sample data to improve accuracy performance of classification. Based on this improved algorithm, a process of real-time traffic classification in secure access of the power business is designed. At last, an instance of a province’s real-time traffic classification in secure access of the power enterprise is used to validate the feasibility and efficiency of the method proposed. |
Key words: random forests data editing classification margin power business traffic classification |