powered by
Sparsemax activation module.
nn_contrib_sparsemax(dim = -1)
The dimension over which to apply the sparsemax function. (-1)
The SparseMax activation is described in 'From Softmax to Sparsemax: A Sparse Model of Attention and Multi-Label Classification' The implementation is based on aced125/sparsemax