TransWikia.com

How to automatically choose the number of components for PCA?

Cross Validated Asked by foobar on January 6, 2021

For PCA, we can print out the number of components vs % variance explained, like in the following picture:

enter image description here

And as human practitioners, we’re typically instructed to choose the number of components at the inflection point of the curve close to explaining all the variance.

Is there an algorithm that looks at the variance explained, and just automatically choose where that inflection point should be?

One Answer

Parallel Analysis is the standard way to choose the number of components algorithmically. It creates a sampling distribution for each of the eigenvalues and performs a series of hypothesis testing.

Note that this is not the exact conceptualization you mentioned because PA is based on eigenvalues, not the proportions of explained variances.

Answered by LambdaPsi on January 6, 2021

Add your own answers!

Ask a Question

Get help from others!

© 2024 TransWikia.com. All rights reserved. Sites we Love: PCI Database, UKBizDB, Menu Kuliner, Sharing RPP