datE.Admixture
provides the expression values from a mixture of cell types (admixed
population) and you want to estimate the proportion of each pure cell type in the mixed samples (rows of
datE.Admixture
). The function allows you to do this as long as you provide a data frame
MarkerMeansPure
that reports the mean expression values of markers in each of the pure cell types.proportionsInAdmixture(
MarkerMeansPure,
datE.Admixture,
calculateConditionNumber = FALSE,
coefToProportion = TRUE)
datE.Admixture
could
correspond to thousands of genes. The rows of datE.Admixture
correspond to the admixed samples for
which the function estimates the proportions of pukappa
function to calculates the condition number of the matrix MarkerMeansPure[,-1]
. This allows one to
determine whether the linear model for estimating the proportPredictedProportions
correspond to the admixed samples, i.e. the rows of datE.Admixture
. The columns of PredictedProportions
correspond to the pure populations, i.e. the columns of MarkerMeansPure[,-1].
PredictedProportions
. In general, datCoef
will only be different from PredictedProportions
if coefToProportion=TRUE
. See the description of coefToProportion
kappa
function. See the description of calculateConditionNumber.MarkerMeansPure
) that match column names of datE.Admixture
and that contain non-missing pure mean values.datE.MarkersAdmixtureTranspose
and MarkerMeansPure
are comparable. When dealing with Affymetrix data: we have successfully used it on untransformed MAS5 data.
For statisticians: To estimate the proportions, we use the coefficients
of a linear model. Specifically:
datCoef= t(lm(datE.MarkersAdmixtureTranspose ~MarkerMeansPure[,-1])$coefficients[-1,])
where datCoef
is a matrix whose rows correspond to the mixed samples (rows of datE.Admixture
) and the columns correspond to pure populations (e.g. cell types), i.e. the columns of MarkerMeansPure[,-1]
.
More details can be found in Abbas et al (2009).lm
, kappa