Invoke distinct on a Spark DataFrame
sdf_distinct(x, ..., name)
A Spark DataFrame.
Optional variables to use when determining uniqueness. If there are multiple rows for a given combination of inputs, only the first row will be preserved. If omitted, will use all variables.
A name to assign this table. Passed to [sdf_register()].
Other Spark data frames:
sdf_copy_to()
,
sdf_random_split()
,
sdf_register()
,
sdf_sample()
,
sdf_sort()
,
sdf_weighted_sample()