Hi,
I am trying to integrate 5 data sets. I have filtered the data frames to matching samples across all 5 datasets and made a list. I am able to run, block.splsda(X = data, Y = Y, ncomp = 5, design = design) with no issues and get the following output:
Call:
block.splsda(X = data, Y = Y, ncomp = 5, design = design)
sGCCA with 5 components on block 1 named RNAseq
sGCCA with 5 components on block 2 named Flow
sGCCA with 5 components on block 3 named Lipoproteins
sGCCA with 5 components on block 4 named Small_metabolites
sGCCA with 5 components on block 5 named Amino_acids
sGCCA with 5 components on the outcome Y
Dimension of block 1 is 259 14736
Dimension of block 2 is 259 26
Dimension of block 3 is 259 100
Dimension of block 4 is 259 41
Dimension of block 5 is 259 37
Outcome Y has 19 levels
Selection of 14736 14736 14736 14736 14736 variables on each of the sGCCA components on the block 1
Selection of 26 26 26 26 26 variables on each of the sGCCA components on the block 2
Selection of 100 100 100 100 100 variables on each of the sGCCA components on the block 3
Selection of 40 40 40 40 40 variables on each of the sGCCA components on the block 4
Selection of 37 37 37 37 37 variables on each of the sGCCA components on the block 5
Main numerical outputs:
loading vectors: see object$loadings
variates: see object$variates
variable names: see object$names
Functions to visualise samples:
plotIndiv, plotArrow, cimDiablo, plotDiablo
Functions to visualise variables:
plotVar, plotLoadings, network, circosPlot
Other functions:
selectVar, perf, auc
perf.diablo = perf(sgccda.res, validation = ‘Mfold’, folds = 10, nrepeat = 10)
However when I run:
perf.diablo = perf(sgccda.res, validation = ‘Mfold’, folds = 10, nrepeat = 10)
I get the following error:
Error in Check.entry.single(newdata[[q]], ncomp[q], q = q) : samples should have a unique identifier/rowname
There are no duplicate rownames and all my rownames match
Thank you