cimDiablo number of genes on the x axis

Seth · July 27, 2021, 9:50pm

Hi mixOmics team!

First thanks for producing such a neat tool. I have a couple of questions - one specific, one general.

First, after producing the cimDiablo plot I notice that there are fewer gene ids than there are branches. Is there a way to get all the gene ids printed or exported so that I can look at it manually? I was able to export a matrix from the cimDiablo object but I don’t know that they are in the same order as the plot (below).

Second, I remain a bit unclear about how one chooses the input subsets (test.keepX) and why and how that corresponds to nrepeat in tune.block.splsda . Can you please point me to guidance on choosing the best or at least sensible test.keepX and when to use different values of nrepeat?

Again, thanks!

aljabadi · July 29, 2021, 1:13am

Hi @Seth,

Thanks for reporting this.

This is an issue with the visualiser that we’re trying to fix (cimDiablo plot size affects variables shown · Issue #142 · mixOmicsTeam/mixOmics · GitHub)

In the meantime, you can simply expand the plot width in RStudio and it will show all the features.

Hope it helps

Al

aljabadi · July 29, 2021, 1:46am

Hi @Seth,

You can now do :

BiocManager::install('aljabadi/mixOmics@cimdiablo-plotdims')

And then save your plot output with a wider width following the examples at cimDiablo plot size affects variables shown · Issue #142 · mixOmicsTeam/mixOmics · GitHub

christoa · July 29, 2021, 8:48am

Hi @Seth,

In addition to what @aljabadi wrote, you can also change the size of the labels (col.cex and row.cex arguments). This often solves the problem for me.

If you are using an updated mixOmics version, the cimDiablo will be saved as list object. The col.names vector herein contains the column names from left to right, and the row.names contains the rownames from bottom to top.

You choose the test.keepX based on your research question. If you looking for a minimal signature of variables to predict an outcome, then you should not set test.keepX too high. If you are interested in retaining alot information for some reason, you can go higher, as long as you are able to interpret the results. Another thing to consider, is how percise you want your results to be? If you want very precise results, you can choose a fine grid (e.g. c(5:50)), but if this is not of vital importance, then you can choose a coarse grid (e.g. c(5:9, seq(10,49,5))). Increasing the number of variables to test, increases the computational time/demand of the tuning step, and so does the number of nrepeats (e.g. how many time should the cross-validation be repeated). If you are looking for very precise and reproducible results, you can increase the nrepeat to above 50, given that you have the computational requirements and/or patience.

Christopher

Seth · August 11, 2021, 12:22am

Thanks for the answers both!

Topic		Replies	Views
[DIABLO]: cimDiablo - export matrix Suggestions for improvement	8	1577	April 15, 2021
Question regarding the similarity matrix Analysis	1	243	July 27, 2023
cimDiablo function Support	1	193	September 7, 2023
Assistance Required with MixOmics cimDiablo Function Error	3	76	July 21, 2024
mixOmics::cimDiablo problems with group annotations Support	1	265	May 10, 2022

cimDiablo number of genes on the x axis

Related topics