Learn R Programming

The package provides functionality to check variables against a set of basic format rules. This is useful in data migration or data cleaning to quickly check assumptions about data types and content.

See the file in vignettes/index.md for a complete example on how to use.

You can download release versions from the CRAN repository following standard menu options in R or RStudio.

The latest development version can be obtained from this repository in github using:


if (!require('devtools')) install.packages('devtools')
devtools::install_github('c5sire/datacheck')

Copy Link

Version

Install

install.packages('datacheck')

Monthly Downloads

50

Version

1.2.2

License

MIT + file LICENSE

Issues

Pull Requests

Stars

Forks

Maintainer

Last Published

April 18th, 2015

Functions in datacheck (1.2.2)

as.rules

Converts a vector of lines into a rules data frame
runDatacheck

Presents the packages graphical user interface
pkg_version

Get the current version of a package
heatmap_quality

Draws a heatmap based on data quality scores
is.onlyLowers

Tests if a string has only lower case letters
is.datadict.profile

is.datadict.profile
is.withinRange

Tests if a numeric value is between a minimal and maximum value. Serves as convenience function.
heatmap.quality

Draws a heatmap based on data quality scores
scoreSum

Line chart of cumulative sum of rule scores.
is.properName

Tests if string is like a proper name with inital letter in upper case
read_rules

Reads a file containing rules in data dictionary format.
datadict_profile

Create a data quality profile (main function)
has_rule_errors

Quick check if a rule profile on a table has any errors.
datacheck-package

Check a table against a set of constraints or rules defined in R.
score_sum

Line chart of cumulative sum of rule scores.
is_only_lowers

Tests if a string has only lower case letters
prep4rep

Prepares a summary table for display in a 'printed' report.
as_rules

Converts a vector of lines into a rules data frame
run_datacheck

Presents the packages graphical user interface
has.punct

Tests for presence of most common punctuation characters
pkg.version

Get the current version of a package
is_one_of

Tests if a string or 'factor level' is one of a pre-defined set
rule_coverage

Dotchart of rules per variable
is.oneOf

Tests if a string or 'factor level' is one of a pre-defined set
ruleCoverage

Dotchart of rules per variable
short_summary

Produces a tabular summary of descriptive statistics using the 'Hmisc::describe' function from the Hmisc package.
has.ruleErrors

Quick check if a rule profile on a table has any errors.
is_proper_name

Tests if string is like a proper name with inital letter in upper case
read.rules

Reads a file containing rules in data dictionary format.
is_datadict_profile

is.datadict.profile
shortSummary

Produces a tabular summary of descriptive statistics using the 'Hmisc::describe' function from the Hmisc package.
datadict.profile

Create a data quality profile (main function)
has_punct

Tests for presence of most common punctuation characters
is_within_range

Tests if a numeric value is between a minimal and maximum value. Serves as convenience function.