Hello, how can we determine test.keepX for DIABLO analysis where I plan to integrate metabolome and microbiome data for 100 adults and would like to have robust results at the end. I read in another thread that when you would like to identify a minimal signature for prediction only, a small grid is obviously preferred (up to 10 or maybe 20), but when you want to interpret biology (e.g. by looking into protein-protein interactions), use a larger grid (up to 50, 100 or even 150 in rare cases). Based on this, do I go with the first option below if I want to interpret biology and also because fine grids are suggested to provide very precise results? Also, is this related to in any way sample size or number of feature in the two omics datase, if so, what would you advise if my sample size is 100 and I have 100 metabolites and 300 microbiome features? Thanks!
test.keepX = list (microbiome = seq(1:100),
metabolome = seq(1:100))
OR
test.keepX ā list (microbiome = c(5:9, seq(10, 18, 2), seq(20,30,5)),
metabolome = c(5:9, seq(10, 18, 2), seq(20,30,5)))