CTD: an information-theoretic method to interpret multivariate perturbations in the context of graphical models with applications in metabolomics and transcriptomics
Our novel network-based approach, CTD, “connects the dots” between metabolite perturbations observed in individual metabolomics profiles and a given disease state by calculating how connected those metabolites are in the context of a disease-specific network.
Using CTD in R.
Installation
We are now a CRAN package! Install on R 4.0+ with install.packages("CTD").
Alternatively, particularly if you have an earlier version of R installed, you can install using devtools: require(devtools) install_github(“BRL-BCM/CTD”).
Look at the package Rmd vignette.
Located in /inst/doc/CTD_Lab-Exercise.Rmd. It will take you across all the stages in the analysis pipeline, including:
- Background knowledge graph generation.
- The encoding algorithm: including generating node permutations using a network walker, converting node permutations into bitstrings, and calculating the minimum encoding length between k codewords.
- Calculate the probability of a node subset based on the encoding length.
- Calculate similarity between two node subsets, using a metric based on mutual information.
References
Thistlethwaite L.R., Petrosyan V., Li X., Miller M.J., Elsea S.H., Milosavljevic A. (2021). CTD: an information-theoretic method to interpret multivariate perturbations in the context of graphical models with applications in metabolomics and transcriptomics. Plos Comput Biol, 17(1):e1008550. https://doi.org/10.1371/journal.pcbi.1008550.