Learn R Programming

modeldata (version 1.3.0)

hepatic_injury_qsar: Predicting hepatic injury from chemical information

Description

A quantitative structure-activity relationship (QSAR) data set to predict when a molecule has risk associated with liver function.

Arguments

Value

hepatic_injury_qsar

a tibble

Details

This data set was used to develop a model for predicting compounds' probability of causing hepatic injury (i.e. liver damage). This data set consisted of 281 unique compounds; 376 predictors were measured or computed for each. The response was categorical (either "none", "mild", or "severe"), and was highly unbalanced.

This kind of response often occurs in pharmaceutical data because companies steer away from creating molecules that have undesirable characteristics. Therefore, well-behaved molecules often greatly outnumber undesirable molecules. The predictors consisted of measurements from 184 biological screens and 192 chemical feature predictors. The biological predictors represent activity for each screen and take values between 0 and 10 with a mode of 4. The chemical feature predictors represent counts of important sub-structures as well as measures of physical properties that are thought to be associated with hepatic injury.

Columns:

  • class: ordered and factor (levels: 'none', 'mild', and 'severe')

  • bio_assay_001 - bio_assay_184: numeric

  • chem_fp_001 - chem_fp_192: numeric

Examples

Run this code
data(hepatic_injury_qsar)
str(hepatic_injury_qsar)

Run the code above in your browser using DataLab