This function computes the spectral information divergence/dissimilarity between spectra based on the kullback-leibler divergence algorithm (see details).
sid(Xr, Xu = NULL,
mode = "density",
center = FALSE, scale = FALSE,
kernel = "gaussian",
n = if(mode == "density") round(0.5 * ncol(Xr)),
bw = "nrd0",
reg = 1e-04,
...)
a list
with the following components:
sid
: if only "X"
is specified (i.e. Xu = NULL
),
a square symmetric matrix of SID distances between all the components in
"X"
. If both "X"
and "Xu"
are specified, a matrix
of SID distances between the components in "X"
and the components
in "Xu"
) where the rows represent the objects in "X"
and the
columns represent the objects in "Xu"
Xr
: the (centered and/or scaled if specified) spectral
X
matrix
Xu
: the (centered and/or scaled if specified) spectral
Xu
matrix
densityDisXr
: if mode = "density"
, the computed
density distributions of Xr
densityDisXu
: if mode = "density"
, the computed
density distributions of Xu
a matrix containing the spectral (reference) data.
an optional matrix containing the spectral data of a second set of observations.
the method to be used for computing the spectral information
divergence. Options are "density"
(default) for computing the divergence
values on the density distributions of the spectral observations, and
"feature"
for computing the divergence vales on the spectral variables.
See details.
a logical indicating if the computations must be carried out on
the centred X
and Xu
(if specified) matrices. If
mode = "feature"
centring is not carried out since this option does
not accept negative values which are generated after centring the matrices.
Default is FALSE. See details.
a logical indicating if the computations must be carried out on
the variance scaled X
and Xu
(if specified) matrices. Default
is TRUE.
if mode = "density"
a character string indicating the
smoothing kernel to be used. It must be one of "gaussian"
(default),
"rectangular"
, "triangular"
, "epanechnikov"
,
"biweight"
, "cosine"
or "optcosine"
. See the
density
function of the stats
package.
if mode = "density"
a numerical value indicating the number
of equally spaced points at which the density is to be estimated. See the
density
function of the stats
package for further
details. Default is round(0.5 * ncol(X))
.
if mode = "density"
a numerical value indicating the
smoothing kernel bandwidth to be used. Optionally the character string
"nrd0"
can be used, it computes the bandwidth using the bw.nrd0
function of the stats
package (see bw.nrd0
). See the
density
and the bw.nrd0
functions for more
details. By default "nrd0"
is used, in this case the bandwidth is
computed as bw.nrd0(as.vector(X))
, if Xu
is specified the
bandwidth is computed as bw.nrd0(as.vector(rbind(X, Xu)))
.
a numerical value larger than 0 which indicates a regularization parameter. Values (probabilities) below this threshold are replaced by this value for numerical stability. Default is 1e-4.
additional arguments to be passed to the
density
function of the base package.
Leonardo Ramirez-Lopez
This function computes the spectral information divergence (distance)
between spectra.
When mode = "density"
, the function first computes the probability
distribution of each spectrum which result in a matrix of density
distribution estimates. The density distributions of all the observations in
the datasets are compared based on the kullback-leibler divergence algorithm.
When mode = "feature"
, the kullback-leibler divergence between all
the observations is computed directly on the spectral variables.
The spectral information divergence (SID) algorithm (Chang, 2000) uses the
Kullback-Leibler divergence (KLKL) or relative entropy
(Kullback and Leibler, 1951) to account for the vis-NIR information provided
by each spectrum. The SID between two spectra (x_ix_i and
x_jx_j) is computed as follows:
sid(x_i,x_j) = KL(x_i | | x_j) + KL(x_j | | x_i)sid(x_i,x_j) = KL(x_i |\ | x_j) + KL(x_j |\ | x_i)
sid(x_i,x_j) = _l=1^k p_l \ log(p_lq_l) + _l=1^k q_l \ log(q_lp_l)sid(x_i,x_j) = _l=1^k p_l \ log(p_l/q_l) + _l=1^k q_l \ log(q_l/p_l)
where kk represents the number of variables or spectral features, pp and qq are the probability vectors of x_ix_i and x_ix_j respectively which are calculated as:
p = x_i_l=1^k x_i,lp = x_i/_l=1^k x_i,l
q = x_j_l=1^k x_j,lq = x_j1/_l=1^k x_j,l
From the above equations it can be seen that the original SID algorithm
assumes that all the components in the data matrices are nonnegative.
Therefore centering cannot be applied when mode = "feature"
. If a
data matrix with negative values is provided and mode = "feature"
,
the sid
function automatically scales the matrix as follows:
X_s = X-min(X)max(X)-min(X)X_s = X-min(X)/max(X)-min(X)
or
X_s = X-min(X, Xu)max(X, Xu)-min(X, Xu)X_s = X-min(X, Xu)/max(X, Xu)-min(X, Xu)
Xu_s = Xu-min(X, Xu)max(X, Xu)-min(X, Xu)Xu_s = Xu-min(X, Xu)/max(X, Xu)-min(X, Xu)
if Xu
is specified. The 0 values are replaced by a regularization
parameter (reg
argument) for numerical stability.
The default of the sid
function is to compute the SID based on the
density distributions of the spectra (mode = "density"
). For each
spectrum in X
the density distribution is computed using the
density
function of the stats
package.
The 0 values of the estimated density distributions of the spectra are
replaced by a regularization parameter ("reg"
argument) for numerical
stability. Finally the divergence between the computed spectral histogramas
is computed using the SID algorithm. Note that if mode = "density"
,
the sid
function will accept negative values and matrix centering
will be possible.
Chang, C.I. 2000. An information theoretic-based approach to spectral variability, similarity and discriminability for hyperspectral image analysis. IEEE Transactions on Information Theory 46, 1927-1932.
# \donttest{
library(prospectr)
data(NIRsoil)
Xu <- NIRsoil$spc[!as.logical(NIRsoil$train), ]
Yu <- NIRsoil$CEC[!as.logical(NIRsoil$train)]
Yr <- NIRsoil$CEC[as.logical(NIRsoil$train)]
Xr <- NIRsoil$spc[as.logical(NIRsoil$train), ]
Xu <- Xu[!is.na(Yu), ]
Xr <- Xr[!is.na(Yr), ]
# Example 1
# Compute the SID distance between all the observations in Xr
xr_sid <- sid(Xr)
xr_sid
# Example 2
# Compute the SID distance between the observations in Xr and the observations
# in Xu
xr_xu_sid <- sid(Xr, Xu)
xr_xu_sid
# }
Run the code above in your browser using DataLab