DIABLO: Error while shutting down parallel unable to terminate some child processes

Yuntao_Yang · January 11, 2021, 4:23pm

Hi,

I am using DIABLO to do a multi-omics analysis using multiple CPUs. I have 14 samples in total. Here is the error message.

$DEG
[1] 14 914

$DMR
[1] 14 677

$Metabolite
[1] 14 54

AA EA
8 6
DEG DMR Metabolite
DEG 0.0 0.5 0.1
DMR 0.5 0.0 0.1
Metabolite 0.1 0.1 0.0
Design matrix has changed to include Y; each block will be
linked to Y.
max.dist centroids.dist mahalanobis.dist
Overall.ER 1 1 1
Overall.BER 1 1 1

You have provided a sequence of keepX of length: 905 for block DEG and 668 for block DMR and 45 for block Metabolite.
This results in 27204300 models being fitted for each component and each nrepeat, this may take some time to run, be patient!
Error in checkForRemoteErrors(val) :
50 nodes produced errors; first error: unable to fork, possible reason: Cannot allocate memory
Calls: tune.block.splsda … clusterApply -> staticClusterApply -> checkForRemoteErrors
Execution halted
Error while shutting down parallel: unable to terminate some child processes

For the DEG dataset, there are 914 differential expression genes.
For the DMR dataset, there are 677 differential methylation regions.
For the Metabolite dataset, there are 54 differential metabolites.

For the KeepX, I set a wide range for all datasets as the demand from my collaborator. This results in 27,204,300 models. To accelerate the process, I ran this script on a server using 100 CPUs. But I got this error. I am not sure how to set up “cpus” and “nrepeat” to run DIABLO in this situation.

Thank you very much!

Best,
Yuntao Yang

aljabadi · January 12, 2021, 12:27am

Hi @Yuntao_Yang ,

Thanks for using mixOmics.

I understand that you wish to test every possible combination of test.keepX but as you can see this will lead to memory exhaustion due to the high number of models to test. I recommend you start with a coarse grid (e.g. c(1:10, seq(15, 95, 5), seq(100, 900, 50) for DEGs) to first find the right neighbourhood with the superior performance. You can then specify a fine grid in that neighbourhood.

Hope it helps,

Al

Yuntao_Yang · January 12, 2021, 5:44pm

Hi AI,

Thank you for your help! I will follow your suggestion.

Best,
Yuntao

Topic		Replies	Views
Perf diablo error Bugs	1	545	June 30, 2020
DIABLO: Handling high dimensionality and tuning keepX Analysis	10	989	December 11, 2022
Generic questions about DIABLO: perf, keepX and no variable selection Support	5	1379	December 11, 2022
Observing sample varialbe in Comp 1 and Comp2 Analysis	3	419	January 31, 2021
Analytical issues using DIABLO Analysis	2	737	April 13, 2022

DIABLO: Error while shutting down parallel unable to terminate some child processes

Related topics