sPLSDA - Categorical predictors and dealing with confounders

mattvel · February 11, 2025, 4:50pm

a) Dealing with confounders: I know that one of the risk factors for the disease that I am working on is age. In addition, certain RNA transcripts increase with age. So I will not be sure if I find a top candidate that is increasing in expression as a result of the age or disease. Is there a way to deal with it?

b) Integration of clinical data with RNA-Seq: I have a dataset that contains categorical variables, e.g. diabetics (yes/no), and also continuous variables, e.g. White blood cell count. What is the recommended way to integrate such data with RNA-seq expression data from the same samples?

evahamrud · February 25, 2025, 3:35am

Hello @mattvel,

a) mixOmics models are not able to account for confounding effects such as batch effects at this time. If you find there is a strong age effect in your data which confounds your biological effect of interest (you can test this using mixOmics and putting age as your Y variable), you may need to correct for this upstream of mixOmics. See this post for some suggestions.

b) Mixing categorical and continuous variables is challenging, we recommend separating them into two matrices, and for the categorical variables (like diabetics yes/no) make them into a dummy numeric variable using the function map(). Check out these posts where people have had similar questions answered - here and here. Assuming you run a DIABLO model to integrate your RNA-seq and clinical data with disease as outcome, this should identify the relevant variables (e.g. diabetic) which explain the disease outcome.

Hope that helps!
Cheers,
Eva

Topic		Replies	Views
Continuous response variable Y in DIABLO? Analysis	4	1067	March 2, 2023
Confounding variables in mixOmics DIABLO Analysis	2	157	March 11, 2024
Input categorical variables	3	1355	September 18, 2019
N-integration with smaller datasets (few predictors) Support	3	542	July 4, 2019
Some doubts about the Case Study of DIABLO with Breast TCGA Dataset	7	163	April 18, 2024

sPLSDA - Categorical predictors and dealing with confounders

Related topics