Learn R Programming

dbplyr (version 2.4.0)

arrange.tbl_lazy: Arrange rows by column values

Description

This is an method for the dplyr arrange() generic. It generates the ORDER BY clause of the SQL query. It also affects the window_order() of windowed expressions in mutate.tbl_lazy().

Note that ORDER BY clauses can not generally appear in subqueries, which means that you should arrange() as late as possible in your pipelines.

Usage

# S3 method for tbl_lazy
arrange(.data, ..., .by_group = FALSE)

Value

Another tbl_lazy. Use show_query() to see the generated query, and use collect() to execute the query and return data to R.

Arguments

.data

A lazy data frame backed by a database query.

...

<data-masking> Variables, or functions of variables. Use desc() to sort a variable in descending order.

.by_group

If TRUE, will sort first by grouping variable. Applies to grouped data frames only.

Missing values

Unlike R, most databases sorts NA (NULLs) at the front. You can can override this behaviour by explicitly sorting on is.na(x).

Examples

Run this code
library(dplyr, warn.conflicts = FALSE)

db <- memdb_frame(a = c(3, 4, 1, 2), b = c(5, 1, 2, NA))
db %>% arrange(a) %>% show_query()

# Note that NAs are sorted first
db %>% arrange(b)
# override by sorting on is.na() first
db %>% arrange(is.na(b), b)

Run the code above in your browser using DataLab