corpus_data_dir()
will return the data directory (class fs_path
)
where the binary files of a corpus are kept (a directory also known as
'home' directory).
corpus_info_file()
will return the path to the info file for a
corpus (class fs_path
object). If info file does not exist or INFO line
is missing in the registry file, NA
is returned.
corpus_full_name()
will return the full name of the corpus defined
in the registry file.
corpus_p_attributes()
returns a character
vector with the
positional attributes of a corpus.
corpus_s_attributes()
returns a character
vector with the
structural attributes of a corpus.
corpus_properties()
returns a character
vector with the corpus
properties defined in the registry file. If the corpus cannot be located,
NA
is returned.
corpus_property()
returns the value of a corpus property defined
in the registry file, or NA
if the corpus does not exist, is not loaded
of if the property requested is undefined.
corpus_get_registry()
will extract the registry directory with the
registry file defining a corpus from the internal C representation of
loaded corpora. The character
vector that is returned may be > 1 if there
are several corpora with the same id defined in registry files in different
(registry) directories. If the corpus is not found, NA
is returned.