Hello,
I have used the PLSDA-batch package to perform batch correction on two datasets that were sequenced using different Illumina chemistries. After running the pipeline, the final output is a matrix with CLR-transformed and batch-corrected data.
My understanding is that for differential abundance analysis using packages like ALDEx and ANCOM (which seem to be the most recommended), I would typically need a count table. Given that my data is now CLR-transformed, I’m unsure how to proceed.
Is there a way to transform the CLR data back to positive integer counts? Should that be feasible, is such a transformation even recommended for this type of analysis?
What would be the best approach for performing differential abundance analysis on this dataset? Should I consider using the mixOmics package to identify discriminative variables between treatments, as it might be a suitable alternative to ALDEx/ANCOM in this context?
How could I visualize compositional data and alpha diversity using this batch-corrected CLR-transformed data? I noticed that in the paper that introduced me to PLSDA-batch, the authors did not use the batch-corrected data for compositional plots or alpha diversity calculations but rather the data prior to correction. Would you recommend a similar approach?
I am sorry if my question is outside the scope of the forum, but I have been struggling with those issues for some time now. So, any guidance or suggestions will be greatly appreciated!
Best regards,
Adriana