powered by
Calculates a random subset of the data based on repeated values from a specified column.
randomSubset(df, colName, seed = NA)
data frame. Must include a column named by the argument colName.
column name to check for duplicates
integer value. Defaults to NA, which will not change the current seed. Setting the seed to any given value can be used to create repeatable output.
df <- data.frame(Julian = c(1,2,2,3,4,4,4,6), y = 1:8) df df_random <- randomSubset(df, "Julian") df_random
Run the code above in your browser using DataLab