Loads files generated by the sally tool (see
http://www.mlsec.org/sally/) and represents the data as binary
token/ngrams x documents matrix. After loading, statistical tests are
applied to find features which are not volatile nor
constant. Co-occurring features are grouped to further compactify the
data. See system.file("extdata","sallyPreprocessing.py",
package="PRISMA") for a Python script which generates the
corresponding .fsally file from a .sally file which reduce the
loading time via loadPrismaData considerably.