应用错误收集

我一直在阅读sklearn中的PCA，特别是功能和组件之间的关系。我特别想确定相对于几个主要组件的功能重要性。但是，我发现一些帖子说的不同。

例如，在this post中的3个答案中，讨论了特征向量和载荷。特别地，pca.components_.T * np.sqrt(pca.explained_variance_)是mentioned显示功能部件的组件加载。为什么在这里使用sqrt？以及为什么要使用该产品？

但是，在this answer中，abs(pca.components_)指出了每个组件中功能的重要性。这似乎与上面指出的相矛盾，是吗？ This blog post还指示pca.components_是每个功能的组件加载。

此外，我想了解this如何回答问题："I think what you call the "loadings" is the result of the projection for each sample into the vector space spanned by the components. Those can be obtained by calling pca.transform(X_train) after calling pca.fit(X_train)."，但这是不正确的：负载与主要组件上每个功能的系数有关，而不是样品。同意？

在此非常感谢您进行澄清。

使用Sklearn阐明“负载”的含义

0 个答案: