This function tests for duplicates entries in identifiers. It is possible to check duplicated identifiers by study segments or to consider only selected segments.
util_int_duplicate_ids_dataframe(
level = c("dataframe"),
id_vars_list,
identifier_name_list,
repetitions,
meta_data = NULL
)
a list with
DataframeData
: data frame with the results of the quality check for duplicated identifiers
DataframeTable
: data frame with selected duplicated identifiers check results, used for the data quality report.
Duplicates
: vector with row indices of duplicated identifiers, if any, otherwise NULL.
character a character vector indicating whether the assessment should be conducted at the study level (level = "dataframe") or at the segment level (level = "segment").
list id variable names for each segment or data frame
vector the segments or data frame names being assessed
vector an integer vector indicating the number of allowed repetitions in the id_vars. Currently, no repetitions are supported. # TODO
data.frame the data frame that contains metadata attributes of the study data, mandatory.
Other integrity_indicator_functions:
util_int_duplicate_content_dataframe()
,
util_int_duplicate_content_segment()
,
util_int_duplicate_ids_segment()
,
util_int_unexp_records_set_dataframe()
,
util_int_unexp_records_set_segment()