中文说明:数据采用PCA降维后进行kmeans聚类确定样本类别,对聚类后数据作图,包括数据点以及质心位置, 随后进行样本集划分,利用knn算法进行有监督的学习分类,经测试,能够取得较好的分类效果。
English Description:
After PCA dimensionality reduction, kmeans clustering was used to determine the sample category, and the clustered data was mapped, including the location of data points and centroid,