Mean centers each column of an input data matrix so that it has a mean of zero.
Scales the entire matrix so that the largest absolute of the centered matrix is equal to unity.
Usage
normalize_input(X)
Value
A numeric matrix of the same dimensions as X but centred by column and scaled to have a maximum deviation of 1.
Arguments
X
matrix; Input data matrix with rows as observations and columns as variables/dimensions.
Author
Aaron Lun
Details
Normalization avoids numerical problems when the coordinates (and thus the distances between observations) are very large.
Directly computing distances on this scale may lead to underflow when computing the probabilities in the t-SNE algorithm.
Rescaling the input values mitigates these problems to some extent.