Explained variance of Y in splsda

MaxBladen · October 16, 2022, 11:53pm

After doing some thinking and some reading, I think I’ve determined the cause of the explained variance equaling 1 for the first Y component.

In your scenario, the Y dataframe is a represented by a single variable (0 or 1 for each class). As far as the method is concerned, this is considered its own “block” - in the same way your various X blocks are treated.

Components generated for the X blocks use a combination of all the input features. For example, if you have three features, the loadings for the first component might be 0.3, 0.8 and 0.5. Using these weights in a linear combination of the input features allows us to represent all three features “simultaneously” with the one component.

Now when we try to do the same for the Y block, there is only a single variable to generate a component from. Therefore, it just uses this component as is (sometimes flipping the sign), so the resulting loading will just be 1 (or -1). Hence, when calculating the explained variance, the original Y data and the first Y component are essentially identical, meaning the proportion of explained variance is equal to 1.

When calculating the explained variance for subsequent Y components, the process is a little more complicated and subject to few different requirements. Hence, the second (and further) components are not identical to the Y vector, resulting in an explained variance value lower than 1.

Hope this clarifies things a bit

Topic		Replies	Views
Help with the % variance explained in block splsda (diablo) Analysis	6	526	October 17, 2022
Variance explained in PLS-DA in X and Y Analysis	5	136	December 11, 2024
Explained Variance	3	2008	March 4, 2020
Block PLS: variance explained by components Analysis	2	417	September 12, 2023
Block.plsda explained variance Analysis	0	309	July 20, 2021

Explained variance of Y in splsda

Related topics