Help understanding high error rate using PLS-DA

kimanh.lecao · August 26, 2019, 7:01am

Hi Fan,
thank you for using mixOmics!
What the performance plot shows might be a case of overfitting. On the training data it looks fine (plotIndiv), but as soon as you use cross-validation, the PLS-DA model does not generate well. A few tips to improve performance:

consider using sparse PLS-DA to select only the best discriminant metabolites to explain our outcome. It means you need to tune the number of metabolites to select (we provide some examples in our book down vignette to tune sPLS-DA)
also increase the number of repeats to at least 1000 for more accurate estimations when using perf on the splsda object.

Let us know if that helps!

Kim-Anh

Topic		Replies	Views
Help deciding the number of components in PLS-DA Analysis	3	392	June 27, 2024
Error for perf.plsda Analysis	3	1826	September 16, 2021
PLSDA on small sample size, and OPLSDA Analysis	1	591	June 23, 2023
Pls-da classification error rate Analysis	3	1939	June 4, 2020
Splsda difficulties Analysis	3	852	December 21, 2020

Help understanding high error rate using PLS-DA

Related topics