What does the node size in the network() output represent?

l.bel · June 15, 2023, 8:19am

Dear mixOmics community,

I am new to your package and want to congratulate you on this easily accessible bioinformatical tool box!
I ran an rCCA model (N = 38) using the shrinkage method after NIPALS-imputation on a microbial data set (206 variables) and a sample data set (25 variables) - however, I have two questions that I was not able to answer yet:

when running the network() function, the output generates differently sized nodes (annotated as “size” when importing to Cytoscape). What does the node size represent?
I have tested the overall model for signficance by extracting the canonical variates and performaing a spearman calculation via cor.test(), yielding a p-value of < 2.2e-16. Is there a method for testing each canonical correlation for significance? I have tried using the p.asym() function of the CCP package, but this only returns NaNs as p-values.

Thanks a lot for your help in advance!

Best,
Luise

kimanh.lecao · June 15, 2023, 11:31pm

hi @l.bel,

when running the network() function, the output generates differently sized nodes (annotated as “size” when importing to Cytoscape). What does the node size represent?

Nothing it is coming from the iGraph package and the size of the node depends on the length of the variable name.

I have tested the overall model for signficance by extracting the canonical variates and performaing a spearman calculation via cor.test(), yielding a p-value of < 2.2e-16. Is there a method for testing each canonical correlation for significance? I have tried using the p.asym()function of the CCP package, but this only returns NaNs as p-values.

rCCA (assuming you are using a regularisation parameter given than P+Q >> N) will always lead very high correlations no matter the test you use. It’s best to either look at whether the graphical outputs make sense (i.e sample plots - do the sample cluster together according to your biological assumption; variable plots - do the clusters of bacteria and sample make sense?). rCCA also does not perform variable selection so it become very trick to make sense of the results. You should consider a sparse sPLS canonical mode, the outputs are similar, and you can calculate the correlation between variates. You can have variable selection in this method to extract interesting associations between bacteria and samples.

Kim-Anh

l.bel · June 16, 2023, 6:52am

Hi Kim-Anh,
thank you so much for your quick help - i will follow your suggenstion and code an sPLS model
have a great day!
best,
Luise

Topic		Replies	Views
Issues with network() command from an rCCA result	1	36	July 25, 2024
Selection of number of CVs from rCCA for downstream investigation Analysis	1	198	August 18, 2023
Questions on rCCA Functionality in mixOmics Support	1	45	December 11, 2024
Reduce the size of nodes in mixOmics	1	150	October 19, 2023
Error performing rCCA Support	3	783	February 22, 2022

What does the node size in the network() output represent?

Related topics