DIABLO: Handling high dimensionality and tuning keepX

Yuqing · March 1, 2022, 8:18am

Thanks for your suggestion. I’ll try to increase nrepeat and see if the final model still suggest 3 components. If so, I will end up having 200~300 proteins selected by 3 components. I tried to tune the keeps parameter using multistages you suggested in another post: I started from {20,30,40,50}, and 50 had the lowest BER, then I tried {50,60,70} and 70 had the lowest BER, so I then tried {70,80,90,100}, and 100 had the lowest BER. Every time the model would gave me the upper limit of the choices. I’m currently at 100 and don’t want to go higher. Any suggestions on it?

Another question is that even if I stay with 100, I will end up with ~300 proteins from 3 components. My aim is to look into the biological mechanism, but I’m afraid 300 would still be too many.

Topic		Replies	Views
Generic questions about DIABLO: perf, keepX and no variable selection Support	5	1511	December 11, 2022
DIABLO data transformation and tuning Analysis	1	539	February 28, 2022
Analytical issues using DIABLO Analysis	2	786	April 13, 2022
Using DIABLO Output for ML Training Analysis	1	118	June 13, 2025
How to deal with varying number of features and high feature correlation in DIABLO? Support	2	232	February 29, 2024

DIABLO: Handling high dimensionality and tuning keepX

Related topics