The Smirnov similarity measure was presented in (Smirnov, 1968).
The measure assigns high similarity to matches when the frequency of the matching value is low, and the other values occur frequently, see (Borian et al., 2008).
References
Smirnov E.S. (1968). On exact methods in systematics.
Systematic Zoology, 17(1), 1-13.
Boriah S., Chandola V., Kumar V. (2008). Similarity measures for categorical data: A comparative evaluation.
In: Proceedings of the 8th SIAM International Conference on Data Mining, SIAM, p. 243-254.