Integration of 2 data sets with DIABLO

mixOmics_user · April 21, 2020, 1:55am

[from Hao]

I have a question on using DIABLO on transcriptome and metabolome data.

We have gotten a transcriptome data for three conditions years ago and we have currently done a metabolome analysis for the same conditions. I am wondering whether I can use DIABLO for integration of those two omics ? And if not, can those two datasets be integrated to reveal the correlation between genes and metabolites ?

Thanks, we will be very appreciate for receiveing your kindly response.

kimanh.lecao · April 21, 2020, 1:58am

Dear Hao,

We recommend reading these two posts How to link data in DIABLO and Choosing Diablo Design Matrix.

You could consider first using a PLS or sparse PLS method on your two data sets before going to a DIABLO analysis. Also consider applying PLS-DA and sPLS-DA on each data set individually for a discriminant analysis. This will then help you understand the correlation structure (PLS) and discriminative power (PLS-DA) of you data before you move to DIABLO. See the bookdown vignette for some examples.

Kim-Anh

wh960823 · April 21, 2020, 4:06am

Thanks for your response! I have read the two posts and to understand how to use DIABLO method. However, I still have a confusion.

For my datasets, a transcriptome data are in 5 conditions, each in 3 replicates. And the metabolome data for the same 5 conditions, each in 6 replicates. However, although the treatment is the same, the cell used is not the same for two omics. If this could affect on the analysis and could we randomly match the transcriptome sample and metabolome sample in the same condition ? (For example, transcriptome-condition1-rep1 matches metabolome-condition1-rep2)

Or I should do some random sampling on the distribution of the omics data?

wh960823 · April 21, 2020, 4:08am

@kimanh.lecao thanks for your response

kimanh.lecao · April 22, 2020, 12:20am

hi @wh960823,
We do assume that the samples are matching in most of our methods, except MINT. If you are using cells you could violate this assumption but randomly sampling 3 replicates out of the 6 in the metabolome and assess whether the results are similar when you compare with other random samples of 3 (that comparison could be based for example by calculating the correlation between the variates of your PLS models from one subsample to another subsample).

Kim-Anh

Topic		Replies	Views
DIABLO without outcome variables? Analysis	1	48	May 9, 2025
Using DIABLO with unmatched samples in one dataset, ideas? Analysis	3	81	October 24, 2024
How to link data in DIABLO Analysis	1	693	March 22, 2020
DIABLO Exploratory Analysis Analysis	1	756	November 23, 2020
Using DIABLO to integrate multiple metabacording datasets Analysis	2	454	September 6, 2021

Integration of 2 data sets with DIABLO

Related topics