CCA analysis (cross validation vs shrinkage method)

thkapell · August 11, 2025, 2:27pm

Hi all,

I wish to N-integrate two datasets from the same human patients; host transcriptome data (single-cell data aggregate to cluster level) and microbiome data (16S rRNA) to understand how the host affects the microbiome and vice versa along a numerical variable of disease progression. I am using the CCA method and wanted to ask if there are any guidelines as to whether one should use the regularized as opposed to the shrinkage method. My ultimate goal would be to validate these interactions experimentally with intervention studies. My cohort has 13 samples so it is on the low end of size. Apologies if this basic question has been answered elsewhere.

kimanh.lecao · October 23, 2025, 11:55pm

Hi @thkapell,

If you focus is on validating potential interactions experimentally, I would advise you use sPLS rather than CCA, as it will allow for variable selection. Have a look at our website for examples, and also screen previous post on how you should pre-filter the data beforehand to perhaps 5k variables per data set max since your number of samples is quite small.

Kim-Anh

Topic		Replies	Views
Question concerning rCCA analysis	5	1315	July 2, 2020
Can I use rCCA to analyze my data? Analysis	1	86	October 24, 2025
Selection of number of CVs from rCCA for downstream investigation Analysis	1	217	August 18, 2023
Extracting Highly Correlated Genes from rCCA or sPLS Objects Analysis	1	52	December 11, 2024
Sparce CCA paired DATA Support	1	399	March 29, 2021

CCA analysis (cross validation vs shrinkage method)

Related topics