引用本文: | 方晓洁,黄伟琼,叶东华,等.分布式并行FP-growth算法在二次设备缺陷监测中的应用[J].电力系统保护与控制,2021,49(8):160-167.[点击复制] |
FANG Xiaojie,HUANG Weiqiong,YE Donghua,et al.Application of a distributed parallel FP-growth algorithm in secondary device defects monitoring[J].Power System Protection and Control,2021,49(8):160-167[点击复制] |
|
摘要: |
智能变电站设备监控数据存储分散,主站获取设备缺陷特征的难度大,有必要通过分布式数据挖掘的方法发现设备缺陷和信号之间的关系。分布式并行频繁模式树(FP-growth)算法采用Hadoop框架和Mapreduce算法,能够快速有效地发现信号间的强关联关系。针对二次设备的缺陷特征,建立异常模型,提出遥信历史数据准备和清洗方法,滤除复归、抖动等噪声信号,并将字符串数据转换为以关键字为标识的事务数据项集。在此基础上采用分布式并行FP-growth算法挖掘各变电站历史数据库异常信号的频繁项集和强关联关系。应用结果表明,该方法能够有效地发现二次设备的频发异常,找到诱发异常的缺陷,为家族性缺陷的认定提供数据基础。 |
关键词: 分布式并行频繁模式树 数据挖掘 关联规则 频发异常 家族性缺陷 |
DOI:DOI: 10.19783/j.cnki.pspc.200715 |
投稿时间:2020-06-23修订日期:2020-11-11 |
基金项目:国家重点研发计划专项资助“物联网终端评测平台关键技术研究及标准化”(2018YFB21002) |
|
Application of a distributed parallel FP-growth algorithm in secondary device defects monitoring |
FANG Xiaojie,HUANG Weiqiong,YE Donghua,HUANG Yubai |
(Zhangzhou Power Supply Company, State Grid Fujian Electric Power Co., Ltd., Zhangzhou 363000, China) |
Abstract: |
The monitoring data of smart substation devices is stored in a decentralized way. It is difficult to extract equipment defect features, so it is necessary to determine the association rules between device abnormal signals and defects by data distributed mining. Using a Hadoop framework and the MapReduce algorithm, a distributed parallel FP-growth algorithm can quickly and effectively find the strong correlation between signals. Given the defect characteristics of the secondary devices, an abnormal model is established. A method of preparing and cleaning the historical data of remote signals is proposed. This can filter out noise signals such as reset and jitter, and convert the string data into the key words data item set. The distributed parallel FP-growth algorithm is used to mine the frequent item sets and find strong correlation of abnormal signals in the historical database of each substation. The application results show that this method can effectively determine frequent abnormalities in the secondary devices and find the defects, providing the data basis for the identification of family defects.
This work is supported by the National Key Research and Development Program of China “Key Technologies Research and Standardization of IoT Terminal Evaluation Platform” (No. 2018YFB21002). |
Key words: distributed parallel FP-growth data mining association rules frequent abnormality family defect |