Below are the R options and environment variables that are used by the future package and packages enhancing it.
WARNING: Note that the names and the default values of these options may change in future versions of the package. Please use with care until further notice.
Several functions have been moved to the parallelly package:
The options and environment variables controlling those have been adjusted
accordingly to have different prefixes.
For example, option future.fork.enable has been renamed to
parallelly.fork.enable and the corresponding environment variable
R_FUTURE_FORK_ENABLE
has been renamed to
R_PARALLELLY_FORK_ENABLE
.
For backward compatibility reasons, the parallelly package will
support both versions for a long foreseeable time.
See the parallelly::parallelly.options page for the settings.
(character string or future function) Default future strategy plan used unless otherwise specified via plan()
. This will also be the future plan set when calling plan("default")
. If not specified, this option may be set when the future package is loaded if command-line option --parallel=ncores
(short -p ncores
) is specified; if ncores > 1
, then option future.plan is set to multisession
otherwise sequential
(in addition to option mc.cores being set to ncores
, if ncores >= 1
). If system environment variable R_FUTURE_PLAN
is set, then that overrides the future plan set by the command-line option. (Default: sequential
)
(numeric) Maximum allowed total size (in bytes) of global variables identified. Used to prevent too large exports. If set of +Inf
, then the check for large globals is skipped. (Default: 500 * 1024 ^ 2
= 500 MiB)
(character string) Controls whether the identified globals should be scanned for so called references (e.g. external pointers and connections) or not. It is unlikely that another R process ("worker") can use a global that uses a internal reference of the master R process - we call such objects non-exportable globals.
If this option is "error"
, an informative error message is produced if a non-exportable global is detected.
If "warning"
, a warning is produced, but the processing will continue; it is likely that the future will be resolved with a run-time error unless processed in the master R process (e.g. plan(sequential)
and plan(multicore)
).
If "ignore"
, no scan is performed.
If the option is not set, then environment variable R_FUTURE_GLOBALS_ONREFERENCE
is used.
(Default: "ignore"
but may change)
(integer) An integer specifying the maximum recursive depth to which futures should be resolved. If negative, nothing is resolved. If 0
, only the future itself is resolved. If 1
, the future and any of its elements that are futures are resolved, and so on. If +Inf
, infinite search depth is used. (Default: 0
)
(character string) If random numbers are used in futures, then parallel (L'Ecuyer-CMRG) RNG should be used in order to get statistical sound RNGs. The defaults in the future framework assume that no random number generation (RNG) is taken place in the future expression because L'Ecuyer-CMRG RNGs come with an unnecessary overhead if not needed. To protect against mistakes, the future framework attempts to detect when random numbers are used despite L'Ecuyer-CMRG RNGs are not in place. If this is detected, and future.rng.onMisuse = "error"
, then an informative error message is produced. If "warning"
, then a warning message is produced. If "ignore"
, no check is performed.
If the option is not set, then environment variable R_FUTURE_RNG_ONMISUSE
is used. (Default: "warning"
)
(numeric) Maximum waiting time (in seconds) for a free worker before a timeout error is generated. If the option is not set, then environment variable R_FUTURE_WAIT_TIMEOUT
is used. (Default: 30 * 24 * 60 * 60
(= 30 days))
(numeric) Initial interval (in seconds) between polls. If the option is not set, then environment variable R_FUTURE_WAIT_INTERVAL
is used. (Default: 0.2
= 0.2 seconds)
(numeric) Positive scale factor used to increase the interval after each poll. If the option is not set, then environment variable R_FUTURE_WAIT_ALPHA
is used. (Default: 1.01
)
(logical) If TRUE
, extensive debug messages are generated. (Default: FALSE
)
(character vector or a logical) Specifies zero of more future startup scripts to be sourced when the future package is attached. It is only the first existing script that is sourced. If none of the specified files exist, nothing is sourced - there will be neither a warning nor an error.
If this option is not specified, environment variable R_FUTURE_STARTUP_SCRIPT
is considered, where multiple scripts may be separated by either a colon (:
) or a semicolon (;
). If neither is set, or either is set to TRUE
, the default is to look for a .future.R
script in the current directory and then in the user's home directory. To disable future startup scripts, set the option or the environment variable to FALSE
. Importantly, this option is always set to FALSE
if the future package is loaded as part of a future expression being evaluated, e.g. in a background process. In order words, they are sourced in the main R process but not in future processes. (Default: TRUE
in main R process and FALSE
in future processes / during future evaluation)
(character vector) Overrides commandArgs()
when the future package is loaded.
(logical) Enable or disable multi-threading while using forked parallel processing. If FALSE
, different multi-thread library settings are overridden such that they run in single-thread mode, which requires that the RhpcBLASctl package is installed. If not specified, this option is set according to environment variable R_FUTURE_FORK_MULTITHREADING_ENABLE
. If TRUE
, or not set (the default), multi-threading is allowed. Parallelization via multi-threaded processing (done in native code by some packages and external libraries) while at the same time using forked (aka "multicore") parallel processing is known to unstable. Note that this is not only true when using plan(multicore)
but also when using, for instance, mclapply()
of the parallel package.
See also parallelly::parallelly.options.
(integer) Either a named list of mandelbrot()
arguments or an integer in {1, 2, 3} specifying a predefined Mandelbrot region. (Default: 1L
)
(integer) Number of rows and columns of tiles. (Default: 3L
)
The following options exists only for troubleshooting purposes and must not be used in production. If used, there is a risk that the results are non-reproducible if processed elsewhere. To lower the risk of them being used by mistake, they are marked as deprecated and will produce warnings if set.
(character string) Action to take when non-existing global variables ("globals" or "unknowns") are identified when the future is created. If "error"
, an error is generated immediately. If "ignore"
, no action is taken and an attempt to evaluate the future expression will be made. The latter is useful when there is a risk for false-positive globals being identified, e.g. when future expression contains non-standard evaluation (NSE). (Default: "ignore"
)
(character string) Method used to identify globals. For details, see globalsOf()
. (Default: "ordered"
)
(logical) If TRUE
, globals that are Future
objects (typically created as explicit futures) will be resolved and have their values (using value()
) collected. Because searching for unresolved futures among globals (including their content) can be expensive, the default is not to do it and instead leave it to the run-time checks that assert proper ownership when resolving futures and collecting their values. (Default: FALSE
)
To set R options when R starts (even before the future package is loaded), see the Startup help page. The startup package provides a friendly mechanism for configurating R's startup process.
# NOT RUN {
# Set an R option:
options(future.rng.onMisuse = "ignore")
# Set an environment variable:
Sys.setenv(R_FUTURE_RNG_ONMISUSE = "ignore")
# }
Run the code above in your browser using DataLab