The Gambaryan similarity measure was presented in (Gambaryan, 1964).
The measure assigns low weight to matches where the matching value occurs in about half the dataset, i.e., in between being frequent and rare, see (Borian et al., 2008).
References
Gambaryan P. (1964). A mathematical model of taxonomy.
SSR, 17(12), 47-53.
Boriah S., Chandola V., Kumar V. (2008). Similarity measures for categorical data: A comparative evaluation.
In: Proceedings of the 8th SIAM International Conference on Data Mining, SIAM, p. 243-254.