estim_delta: Estimates delta for Iterated Stable Autoencoder

Description

This function uses cross-validation to estimate delta for the Iterated Stable Autoencoder when considering Binomial noise. delta is the probability of deletion of each cell of the data matrix

Usage

estim_delta(X, delta = seq(0.1, 0.9, length.out = 9), nbsim = 10,
  noise = "Binomial", transformation = c("None", "CA"), pNA = 0.1,
  maxiter = 1000, threshold = 1e-08)

Arguments

a data frame or a matrix with count

delta

vector, a sequence of values for the probability of deletion of each cell of the data matrix

nbsim

number of times that pNA values are inserted and predicted in the data

noise

noise model assumed for the data. By default and only available "Binomial"

transformation

estimates a transformation of the original matrix; currently, only correspondence analysis CA is available

pNA

percentage of missing values added in the data set

maxiter

integer, maximum number of iterations of the iterative imputation algorithm

threshold

for assessing convergence of the iterative imputation algorithm (difference between two successive iterations)

Value

msep, matrix with the MSEP obtained for each simulation and each value of delta

delta, value giving in average the smallest MSEP over the nbsim simulations

Details

For each value delta, repeated learning cross-validation consists in inserting pNA percentage of missing values in the data set and predicting them with the Iterative Stable Autoencoder. More precisely, the prediction is obtained using the iterative imputation algorithm (imputecount) which alternates steps of imputation of the missing entries and estimation of the low-rank signal. This process is repeated nbsim times for all the deltas. The mean squared error of prediction is kept for each simulation and value of delta. The value of delta leading to the smallest MSEP on average over the simulations is given.

Examples

Run this code

# NOT RUN {
 # A regularized Correspondence Analysis 
 
# }
# NOT RUN {
library(FactoMineR)
 perfume <-  read.table("http://factominer.free.fr/docs/perfume.txt",header=TRUE,
 sep="\t",row.names=1)
 rownames(perfume)[4] <- "Cinema"
 
 isa.delt <- estim_delta(perfume, nbsim = 10, transformation = "CA")
 
 isa.ca <- ISA(perfume, delta = isa.delt$delta, noise = "Binomial", transformation = "CA")
 rownames(isa.ca$mu.hat) <- rownames(perfume)
 colnames(isa.ca$mu.hat) <- colnames(perfume)
 res.isa.ca <- CA(isa.ca$mu.hat, graph = FALSE)
 plot(res.isa.ca, title = "Regularized CA", cex = 0.6, selectCol = "contrib 20")
# }