Learn R Programming

The dataset R Package

]

The aim of the dataset package is to make tidy datasets easier to release, exchange and reuse. It organizes and formats data frame R objects into well-referenced, well-described, interoperable datasets into release and reuse ready form.

  1. Offer a way to better utilise the utils:bibentry bibliographic entry objects by extending them with the fields of the Dublin Core and DataCite tenders, and making them detachable from the data. This extension aims to work with a data.frame or an inherited tibble, tsibble or data.table. See for more information the Bibentry for FAIR datasets vignette.
  2. Extending the haven_labelled class of the tidyverse for consistently labelled categorical variables with linked (standard) definitions and units of measures in our defined class.
  3. Offering a new data frame format, dataset_df that extends tibbles with semantically rich metadata, ready to be shared on open data exchange platforms and in data repositories. This s3 class is aimed at developers and we are working on several packages that provide interoperability with SDMX statistical data exchange platforms, Wikidata, or the EU Open Data portal. Read more in the Create Datasets that are Easy to Share Exchange and Extend vignette.

Copy Link

Version

Install

install.packages('dataset')

Monthly Downloads

6,899

Version

0.3.4

License

GPL (>= 3)

Maintainer

Daniel Antal

Last Published

December 23rd, 2024

Functions in dataset (0.3.4)

publisher

Get/set the Publisher of the object.
var_label

Get / Set a variable label
xsd_convert

Convert to XML Schema Definition (XSD) types
var_definition

Get / set a definition for a vector or a dataset
subject

Create/add/retrieve a subject
rights

Get/set the Rights of the object.
var_unit

Get / Set a unit of measure
vec_cast_named

From haven
identifier

Get/set the Identifier of the object.
var_namespace

Get / Set a namespace of measure
as_numeric

Coerce a defined vector to numeric
dataset_df

Create a new dataset_df object
creator

Get/set the Creator of the object.
dataset_title

Get/set the title of a dataset
description

Get/set the Description of the object.
as_character

Coerce to character vector
defined

Create a semantically well-defined, labelled vector
dataset_to_triples

Dataset to triples (three columns)
dublincore

Add or get Dublin Core metadata
datacite

Create a bibentry object with DataCite metadata fields
iris_dataset

Edgar Anderson's Iris Data
n_triples

Create N-Triples
n_triple

Create an N-Triple
language

Get/Set the primary language of the dataset
provenance

Get or update provenance information
publication_year

Get/set the publication_year of the object.
get_bibentry

Get/set the Bibentry of the object.
geolocation

Get/set the Geolocation of the object.
id_to_column

Add identifier to columns