If f.min
and f.max
are not specified, but the list is
marked as incomplete (with incomplete=TRUE
), they are
automatically determined from the frequency vector f
(making
the assumption that all types in this frequency range are listed).
Explicit specification of either f.min
or f.max
implies
an incomplete list. In this case, all types outside the specified
range will be deleted from the list. If incomplete=FALSE
is
explicitly given, N
and V
will be determined
automatically from the input data (which is assumed to be complete),
but the resulting type frequency list will still be incomplete.
If you just want to remove types with \(f=0\) without marking the
type frequency list as incomplete, use the option
delete.zeros=TRUE
.
A tfl
object is a data frame with the following variables:
k
integer type ID \(k\)
f
corresponding type frequency \(f_k\)
type
optional: character vector with type
representations used for printing
The data frame always has to be sorted with respect to the k
column (ascending order). If a type
column is present,
rownames are set to the types and can be used for character indexing.
The following attributes are used to store additional information
about the frequency spectrum:
N, V
sample size \(N\) and vocabulary size \(V\)
corresponding to the type frequency list. For a complete list,
these values could easily be determined from the f
variable, but they are essential for an incomplete list.
incomplete
if TRUE
, the type frequency list is
incomplete, i.e. it lists only types in the frequency range given
by f.min
and f.max
f.min
, f.max
range of type frequencies
represented in the list (should be ignored unless the
incomplete
flag is set)
hasTypes
indicates whether or not the type
variable is present