How does tune.rcc handle NAs

blueskypie · June 29, 2020, 1:42am

While running the following line

tune.rcc(X, Y, grid1 = grid1, grid2 = grid2, validation = "Mfold")

I got the following warning:

Calls: tune.rcc → apply → FUN → Mfold → rcc → explained_variance
Warning in explained_variance(result$Y, result$variates$Y, ncomp) :
NA values put to zero, results will differ from PCA methods
used with NIPALS

does it mean the NAs were filled with zero?

aljabadi · June 29, 2020, 5:08am

Hi @blueskypie,

Thanks for getting in touch regarding your question.

The missing values are replaced by 0 only for calculating the explained variance of the components. Missing values are ignored in the iterative algorithm when deriving the said components. The explained variance calculations centre the data matrices so technically the missing values are replaced by the mean of columns to disregard the missing values in this calculation as well - although the calculated mean may in fact be affected by the missing/unknown values.

That being said, our warning message used to only consider the PCA calculations which has been fixed in the latest development version (it applied to all explained variance calculations where there are missing values).

Hope it helps.

Al

blueskypie · June 29, 2020, 2:12pm

Thank you so much for the quick response! I really appreciate! So “put NAs to zero” actually ignores the missing values in computing the variance since the mean will be zero after centering. Then is it correct to say that missing values are not imputed and actually ignored in tune.rcc?

aljabadi · June 30, 2020, 2:20am

Hi @blueskypie,

This is correct. But it is not due to how the explained variance calculations handle missing values (the explained variance of a component is not what rcc is trying to optimise). Explained variance of component is only calculated as a statistic after the rcc algorithm extracts the components. Basically, the following steps:

i) Extract the component while ignoring missing values
THEN
ii) Calculate the explained variance of the component while ignoring the missing values

Hope it helps.

Al

Topic		Replies	Views
Explained variance PCA plot not monotonic	1	300	April 14, 2021
"not positive definite" error in rcc() Support	3	491	July 2, 2020
PCA bug in the latest devel Bugs	1	628	October 22, 2020
Error performing rCCA Support	3	799	February 22, 2022
plotVar components are not orthogonal with NA values Analysis	1	348	September 12, 2019

How does tune.rcc handle NAs

Related topics