Learn R Programming

⚠️There's a newer version (0.6.3) of this package.Take me there.

padr

padr is an R package that assists with preparing time series data. It provides two main functions that will quickly get the data in the format you want. When data is observed on too low a level, thicken will add a column of a higher interval to the data frame, after which the user can apply the appropriate aggregation. When there are missing records for time points where observations were absent, pad will automatically insert these records. A number of fill_ functions help to subsequently fill the missing values.

Usage

library(padr)
library(tidyverse)
coffee <- data.frame(
  time_stamp =  as.POSIXct(c(
    '2016-07-07 09:11:21', '2016-07-07 09:46:48',
    
    '2016-07-09 13:25:17',
    '2016-07-10 10:45:11'
  )),
  amount = c(3.14, 2.98, 4.11, 3.14)
)

coffee %>%
  thicken('day') %>%
  dplyr::group_by(time_stamp_day) %>%
  dplyr::summarise(day_amount = sum(amount)) %>%
  pad() %>%
  fill_by_value(day_amount, value = 0)
## # A tibble: 4 × 2
##   time_stamp_day day_amount
##           <date>      <dbl>
## 1     2016-07-07       6.12
## 2     2016-07-08       0.00
## 3     2016-07-09       4.11
## 4     2016-07-10       3.14

More information

See the the general introduction Vignette for more examples. The implementation details Vignette describes how padr handles different time zones and daylight savings time.

Copy Link

Version

Install

install.packages('padr')

Monthly Downloads

17,904

Version

0.5.3

License

MIT + file LICENSE

Issues

Pull Requests

Stars

Forks

Maintainer

Edwin Thoen

Last Published

September 12th, 2020

Functions in padr (0.5.3)

thicken

Add a variable of a higher interval to a data frame
pad_int

Pad the integer column of a data frame
pad_cust

Pad with a custom spanning
span_time

Wrapper around seq.POSIXct.
subset_span

Subset a spanned datetime vector
span_date

Wrapper around seq.Date.
span_around

Span an equally spaced vector around a datetime variable
pad

Pad the datetime column of a data frame
thicken_cust

Thicken with a custom spanning
fill_by_prevalent

Fill missing values by the most prevalent nonmissing value
format_interval

Make a period character vector
emergency

Emergency Calls for Montgomery County, PA
center_interval

Shift to the middle of each interval
fill_by_function

Fill missing values by a function of the nonmissings
fill_by_value

Fill missing values by a single value
get_interval

Get the interval of a datetime variable
coffee

Coffee Data Set
closest_weekday

Retrieve the closest given weekday