The living atlases store content on hundreds of different fields, and users
often require thousands or millions of records at a time. To reduce time taken
to download data, and limit complexity of the resulting data.frame
, it is
sensible to restrict the fields returned by atlas_occurrences()
.
This function allows easy selection of fields, or commonly-requested groups
of columns, following syntax shared with dplyr::select()
.
galah_select(..., group = c("basic", "event", "assertions"))
zero or more individual column names to include
string
: (optional) name of one or more column groups to
include. Valid options are "basic"
, "event"
and
"assertion"
An object of class data.frame
and galah_select
specifying the name and type of each column to include in the
call to atlas_counts()
or atlas_occurrences()
.
Download occurrence records of Perameles taken in 2001, only returning scientific name and event date
galah_config(email = "your-email@email.com") galah_call() |> galah_identify("perameles")|> galah_filter(year == 2001) |> galah_select(scientificName, eventDate) |> atlas_occurrences()
Download occurrence record of Perameles taken in 2001, returning the "basic" group of columns plus the Basis of Record
galah_call() |> galah_identify("perameles") |> galah_filter(year == 2001) |> galah_select(group = "basic", basisOfRecord) |> atlas_occurrences()
Calling the argument group = "basic"
returns the following columns:
decimalLatitude
decimalLongitude
eventDate
scientificName
taxonConceptID
recordID
dataResourceName
Using group = "event"
returns the following columns:
eventRemarks
eventTime
eventID
eventDate
samplingEffort
samplingProtocol
Using group = "assertions"
returns all quality assertion-related
columns. The list of assertions is shown by search_fields(type = "assertions")
.
search_taxa()
, galah_filter()
and
galah_geolocate()
for other ways to restrict the information returned
by atlas_occurrences()
and related functions; atlas_counts()
for how to get counts by levels of variables returned by galah_select
.