Function arguments are defused into quosures that keep track of the environment of the defused expression.
quo(1 + 1)
#> <quosure>
#> expr: ^1 + 1
#> env: global
You might have noticed that when constants are supplied, the quosure tracks the empty environment instead of the current environmnent.
quos("foo", 1, NULL)
#> <list_of<quosure>>
#>
#> [[1]]
#> <quosure>
#> expr: ^"foo"
#> env: empty
#>
#> [[2]]
#> <quosure>
#> expr: ^1
#> env: empty
#>
#> [[3]]
#> <quosure>
#> expr: ^NULL
#> env: empty
The reason for this has to do with compilation of R code which makes it impossible to consistently capture environments of constants from function arguments. Argument defusing relies on the promise mechanism of R for lazy evaluation of arguments. When functions are compiled and R notices that an argument is constant, it avoids creating a promise since they slow down function evaluation. Instead, the function is directly supplied a naked constant instead of constant wrapped in a promise.
We can observe this optimisation by calling into the C-level findVar()
function to capture promises.
# Return the object bound to `arg` without triggering evaluation of
# promises
f <- function(arg) {
rlang:::find_var(current_env(), sym("arg"))
}# Call `f()` with a symbol or with a constant
g <- function(symbolic) {
if (symbolic) {
f(letters)
} else {
f("foo")
}
}
# Make sure these small functions are compiled
f <- compiler::cmpfun(f)
g <- compiler::cmpfun(g)
When f()
is called with a symbolic argument, we get the promise object created by R.
g(symbolic = TRUE)
#> <promise: 0x7ffd79bac130>
However, supplying a constant to "f"
returns the constant directly.
g(symbolic = FALSE)
#> [1] "foo"
Without a promise, there is no way to figure out the original environment of an argument.
Data-masking APIs in the tidyverse are intentionally designed so that they don't need an environment for constants.
Data-masking APIs should be able to interpret constants. These can arise from normal argument passing as we have seen, or by injection with !!
. There should be no difference between dplyr::mutate(mtcars, var = cyl)
and dplyr::mutate(mtcars, var = !!mtcars$cyl)
.
Data-masking is an evaluation idiom, not an introspective one. The behaviour of data-masking function should not depend on the calling environment when a constant (or a symbol evaluating to a given value) is supplied.