resample
. Using multinomial distribution, compute the number of occurences for each cloneset, than remove zero-number clonotypes and
return resulting data frame. Probabilities for rmultinom
for each cloneset is a percentage of this cloneset in
the .col
column. It's a some sort of simulation of how clonotypes are chosen from the organisms. For now it's not working
very well, so use downsample
instead.
downsample
. Choose .n
clones (not clonotypes!) from the input repertoires without any probabilistic simulation, but
exactly computing each choosed clones. Its output is same as for resample
(repertoires), but is more consistent and
biologically pleasant.
prop.sample
. Choose the first N clonotypes which occupies .perc
percents of overall UMIs / reads.