powered by
Hive partitioning embeds field names and values in path segments, such as "/year=2019/month=2/data.parquet".
hive_partition(..., null_fallback = NULL, segment_encoding = "uri")
named list of data types, passed to schema()
schema()
character to be used in place of missing values (NA or NULL) in partition columns. Default is "__HIVE_DEFAULT_PARTITION__", which is what Hive uses.
NA
NULL
"__HIVE_DEFAULT_PARTITION__"
Decode partition segments after splitting paths. Default is "uri" (URI-decode segments). May also be "none" (leave as-is).
"uri"
"none"
A HivePartitioning, or a HivePartitioningFactory if calling hive_partition() with no arguments.
HivePartitioningFactory
hive_partition()
Because fields are named in the path segments, order of fields passed to hive_partition() does not matter.
# NOT RUN { hive_partition(year = int16(), month = int8()) # }
Run the code above in your browser using DataLab