|
Reactor voiceprint clustering method based on deep adaptive K-means++ algorithm |
DOI:10.19783/j.cnki.pspc.240502 |
Key Words:750 kV reactor voiceprint clustering adaptive clustering algorithm sparse autoencoder DAKCA |
Author Name | Affiliation | MIN Yongzhi1 | 1. School of Automation and Electrical Engineering, Lanzhou Jiaotong University, Lanzhou 730070, China
2. School of Electrical Engineering and Automation, Wuhan University, Wuhan 430072, China | HAO Dayu1 | 1. School of Automation and Electrical Engineering, Lanzhou Jiaotong University, Lanzhou 730070, China
2. School of Electrical Engineering and Automation, Wuhan University, Wuhan 430072, China | WANG Guo1 | 1. School of Automation and Electrical Engineering, Lanzhou Jiaotong University, Lanzhou 730070, China
2. School of Electrical Engineering and Automation, Wuhan University, Wuhan 430072, China | HE Yigang2 | 1. School of Automation and Electrical Engineering, Lanzhou Jiaotong University, Lanzhou 730070, China
2. School of Electrical Engineering and Automation, Wuhan University, Wuhan 430072, China | HE Jianshan1 | 1. School of Automation and Electrical Engineering, Lanzhou Jiaotong University, Lanzhou 730070, China
2. School of Electrical Engineering and Automation, Wuhan University, Wuhan 430072, China |
|
Hits: 1593 |
Download times: 252 |
Abstract:In high-voltage shunt reactor voiceprint signal monitoring systems, the high-dimensional non-stationarity of long-term, large-scale unlabeled voiceprint data make feature extraction difficult and reduce the adaptability of unsupervised clustering. To address this, a 750 kV reactor voiceprint clustering method based on deep adaptive K-means++ clustering algorithm (DAKCA) is proposed. First, the improved stacked sparse autoencoder (SSAE), fine-tuned using a two-stage unsupervised strategy, is used to extract the 32-dimensional depth features from the normalized frequency domain data obtained via fast Fourier transform. Then, an adaptive K-means++ clustering algorithm is developed using clustering validation index based on the nearest neighbor (CVNN), and a reactor voiceprint clustering model which can adaptively determine the optimal number of clusters is constructed. Finally, the method is validated using real measured voiceprint data from a 750 kV reactor in Northwest China. The results demonstrate that the DAKCA algorithm can stably extract 32-dimensional depth features from unlabeled voiceprint data under varying sample balance conditions and achieve optimal clustering, providing a reference for the direct and efficient use of unlabeled reactor voiceprint data. |
View Full Text View/Add Comment Download reader |
|
|
|