riboflavinv100: Riboflavin Production Data (Top 100 Genes)
Description
This dataset is a subset of the riboflavin production data by Bacillus subtilis, containing \(n = 71\) observations. It includes the response variable (log-transformed riboflavin production rate) and the 100 genes with the largest empirical variances from the original dataset.
Usage
data(riboflavinv100)
Arguments
Format
y
Log-transformed riboflavin production rate (original name: q_RIBFLV). This is a continuous variable indicating the efficiency of riboflavin production by the bacterial strain.
x
A matrix of dimension \(71 \times 100\) containing the logarithm of the expression levels of the 100 genes with the largest empirical variances.
Details
This dataset is derived from the original riboflavin dataset, which contains 4088 gene expressions. The riboflavinV100 dataset is created for ease of reproduction in examples and contains only the 100 genes with the largest empirical variances. It is commonly used in statistical research for high-dimensional data analysis.
# Load the riboflavinv100 datasetdata(riboflavinv100)
# Display the dimensions of the datasetprint(dim(riboflavinv100$x))
print(length(riboflavinv100$y))