The default population size corresponds to the subset of the Wackypedia corpus from which
the simulation parameters were obtained. This excludes all articles with extreme type-token
statistics (very short, very long, extremely long words, etc.).
Article lengths are sampled from a lognormal distribution which is scaled so that the
central 95% of the values fall into the range specified by the length
argument.
The simulated data are surprising close to the original Wackypedia statistics.