Reactor voiceprint clustering method based on deep adaptive K-means++ algorithm
DOI:10.19783/j.cnki.pspc.240502
Key Words:750 kV reactor  voiceprint clustering  adaptive clustering algorithm  sparse autoencoder  DAKCA
Author NameAffiliation
MIN Yongzhi1 1. School of Automation and Electrical Engineering, Lanzhou Jiaotong University, Lanzhou 730070, China
2. School of Electrical Engineering and Automation, Wuhan University, Wuhan 430072, China 
HAO Dayu1 1. School of Automation and Electrical Engineering, Lanzhou Jiaotong University, Lanzhou 730070, China
2. School of Electrical Engineering and Automation, Wuhan University, Wuhan 430072, China 
WANG Guo1 1. School of Automation and Electrical Engineering, Lanzhou Jiaotong University, Lanzhou 730070, China
2. School of Electrical Engineering and Automation, Wuhan University, Wuhan 430072, China 
HE Yigang2 1. School of Automation and Electrical Engineering, Lanzhou Jiaotong University, Lanzhou 730070, China
2. School of Electrical Engineering and Automation, Wuhan University, Wuhan 430072, China 
HE Jianshan1 1. School of Automation and Electrical Engineering, Lanzhou Jiaotong University, Lanzhou 730070, China
2. School of Electrical Engineering and Automation, Wuhan University, Wuhan 430072, China 
Hits: 1593
Download times: 252
Abstract:In high-voltage shunt reactor voiceprint signal monitoring systems, the high-dimensional non-stationarity of long-term, large-scale unlabeled voiceprint data make feature extraction difficult and reduce the adaptability of unsupervised clustering. To address this, a 750 kV reactor voiceprint clustering method based on deep adaptive K-means++ clustering algorithm (DAKCA) is proposed. First, the improved stacked sparse autoencoder (SSAE), fine-tuned using a two-stage unsupervised strategy, is used to extract the 32-dimensional depth features from the normalized frequency domain data obtained via fast Fourier transform. Then, an adaptive K-means++ clustering algorithm is developed using clustering validation index based on the nearest neighbor (CVNN), and a reactor voiceprint clustering model which can adaptively determine the optimal number of clusters is constructed. Finally, the method is validated using real measured voiceprint data from a 750 kV reactor in Northwest China. The results demonstrate that the DAKCA algorithm can stably extract 32-dimensional depth features from unlabeled voiceprint data under varying sample balance conditions and achieve optimal clustering, providing a reference for the direct and efficient use of unlabeled reactor voiceprint data.
View Full Text  View/Add Comment  Download reader