Principal Components in plsda()

Athalberht · May 16, 2024, 2:19pm

Hi, I’m Alberto and this is my firs topic in this forum.

Recently, I’ve discovered mixOmics because I needed to calculate a PLS-DA. But I think I’m doing something wrong.

I used the following command:
df3 ← mixOmics::plsda(df2, df1$disease, ncomp = 2, scale = TRUE)

But, in the plot, first PC have 2.83% while second PC have 3.06%. It is a little weird for me, because with PCA or PCoA, the first component always has the highest value. For this reason, I decided to do a test, with the first 100 components. I show a screenshot with two dataframes (I only show you the 10 first rows). The first is a df with the components as they appear in the analysis. The second is after ordering the components by their percentage value.

Can someone explain to me why this happens? and how do I solve it?

Thanks!

kimanh.lecao · May 16, 2024, 9:52pm

hi @Athalberht ,

You are confused between PCA and PLS-DA. PCA aims to maximise the variance of the data, and so the components should explain as much variance as possible. PLSDA maximises the covariance between the components and the output, so the variance, while interpretable (i.e. the outcome can be explained by xx variance from the data) is not maximised here.

You can have a look at the mixOmics handbook that gives more details on this.

Kim-Anh

Topic		Replies	Views
Understanding interpretation of higher percent variance in Component 2	1	439	September 21, 2023
Variance explained in PLS-DA in X and Y Analysis	5	72	December 11, 2024
Help deciding the number of components in PLS-DA Analysis	3	383	June 27, 2024
PLS-da on DNA-methylation data Analysis	2	620	September 14, 2020
PLS-DA Amount of variance explained by components Analysis	2	1605	September 10, 2021

Principal Components in plsda()

Related topics