Distribution coefficients at pH 7.4 (logD7.4) dataset from Wang et, al.
Usage
data(logd1k)
Arguments
Format
A list with 2 components:
x - data frame with 1,000 rows (samples) and 80 columns (predictors)
y - numeric vector of length 1,000 (response)
The first 1000 compounds in the original dataset were selected.
Details
This dataset contains distribution coefficients at pH 7.4 (logD7.4)
for 1,000 compounds, and 80 molecular descriptors computed with RDKit.
References
Jian-Bing Wang, Dong-Sheng Cao, Min-Feng Zhu, Yong-Huan Yun, Nan Xiao,
and Yi-Zeng Liang. "In silico evaluation of logD7.4 and comparison with
other prediction methods."
Journal of Chemometrics 29, no. 7 (2015): 389--398.