Firstly, thank you so much for developing such an interesting algorithm. I would like to use DIABLO to integrate proteomics and methylation data. However, prior to doing that, I have a few questions on which I would like to get your expert opinion.
I have preprocessed the datasets individually and taken around 5000 most abundant proteins and M-values for methylation data for 122 samples. Would you consider 122 to be a small sample size? I read a thread previously where you have mentioned a few things to keep in mind while integrating small sample size data, however, if not, I would like to know whether I should combine the data and then divide it into a train and test set or should I first divide it into train and test and then combine the training data together?
Thank you so much in advance and I hope to hear back from you,