Variable importance in sPLS-DA

sophia · March 8, 2021, 7:24am

I’m utilizing sparse SPLS-DA to try to calssify my samples into 2 groups. The error rate is lowest with 1 single component. I have 800 variables, and the tuning results from tune.splsda advises me to utlize 790 of them for the component. My main interest is to decipher which variables are meaningful to classify groups. Is there a good way to decide a cutoff point to choose the variables?

Neystale · March 10, 2021, 4:07am

Hey sophia what is your sample size? Because it sounds like you might be running into a really flat grid search. Perhaps that means you need to try tuning criteria two from “A novel approach for biomarker selection and the integration of repeated measures experiments from two assays”.

sophia · March 10, 2021, 7:58am

Hi my sample size is 100. Also the group separation is not very good, my error rate is around 40% even after utilizing sPLS-DA. Will take a look at the paper you mentioned!

Neystale · March 10, 2021, 2:29pm

The tuning criteria two in that paper is for very small sample sizes when cross validation is not practical so I’m not sure useful it would be.

Maybe an alternative method will provide better results, such as elastic net.

christoa · March 11, 2021, 10:49am

Hi @sophia, what does your list.keepX look like? (It seems exaggerated to test 800 variables). Did you use Mfold or leave-one-out cross-validation? If Mfold was used, what is the nrepeat and how many folds did you use?

It would be a little easier to help you, if you would share the perf and tune outputs and the code that you have used.

Christopher

Topic		Replies	Views
Number of variables in final sPLS-DA Analysis	1	89	May 2, 2024
How to select the optimal number of variables for sPLS-DA and comparison with Selbal Analysis	4	690	July 1, 2020
Difference between PLS-DA and sPLS-DA Analysis	3	4056	December 21, 2020
The number of variables selected in a sPLS-DA should be similar? Analysis	5	307	September 20, 2022
Splsda error rate Analysis	1	772	September 7, 2020

Variable importance in sPLS-DA

Related topics