learn_graph: Learn principal graph from the reduced dimension space using reversed graph embedding

Description

Monocle3 aims to learn how cells transition through a biological program of gene expression changes in an experiment. Each cell can be viewed as a point in a high-dimensional space, where each dimension describes the expression of a different gene. Identifying the program of gene expression changes is equivalent to learning a trajectory that the cells follow through this space. However, the more dimensions there are in the analysis, the harder the trajectory is to learn. Fortunately, many genes typically co-vary with one another, and so the dimensionality of the data can be reduced with a wide variety of different algorithms. Monocle3 provides two different algorithms for dimensionality reduction via reduce_dimension (UMAP and tSNE). Both take a cell_data_set object and a number of dimensions allowed for the reduced space. You can also provide a model formula indicating some variables (e.g. batch ID or other technical factors) to "subtract" from the data so it doesn't contribute to the trajectory. The function learn_graph is the fourth step in the trajectory building process after preprocess_cds, reduce_dimension, and cluster_cells. After learn_graph, order_cells is typically called.

Usage

learn_graph(
  cds,
  use_partition = TRUE,
  close_loop = TRUE,
  learn_graph_control = NULL,
  verbose = FALSE
)

Arguments

cds

the cell_data_set upon which to perform this operation

use_partition

logical parameter that determines whether to use partitions calculated during cluster_cells and therefore to learn disjoint graph in each partition. When use_partition = FALSE, a single graph is learned across all partitions. Default is TRUE.

close_loop

logical parameter that determines whether or not to perform an additional run of loop closing after estimating the principal graphs to identify potential loop structure in the data space. Default is TRUE.

learn_graph_control

NULL or a list of control parameters to be passed to the reversed graph embedding function. Default is NULL. A list of potential control parameters is provided in details.

verbose

Whether to emit verbose output during graph learning.

Value

an updated cell_data_set object

Optional <code>learn_graph_control</code> parameters

euclidean_distance_ratio:: The maximal ratio between the euclidean distance of two tip nodes in the spanning tree and the maximum distance between any connecting points on the spanning tree allowed to be connected during the loop closure procedure. Default is 1.
geodesic_distance_ratio:: The minimal ratio between the geodesic distance of two tip nodes in the spanning tree and the length of the diameter path on the spanning tree allowed to be connected during the loop closure procedure. (Both euclidean_distance_ratio and geodesic_distance_ratio need to be satisfied to introduce the edge for loop closure). Default is 1/3.
minimal_branch_len:: The minimal length of the diameter path for a branch to be preserved during graph pruning procedure. Default is 10.
orthogonal_proj_tip:: Whether to perform orthogonal projection for cells corresponding to the tip principal points. Default is FALSE.
prune_graph:: Whether or not to perform an additional round of graph pruning to remove small insignificant branches. Default is TRUE.
scale:
ncenter:
rann.k:: Maximum number of nearest neighbors to compute in the reversed graph embedding. Set rann.k=NULL to let learn_graph estimate rann.k. Default is 25.
maxiter:
eps:
L1.gamma:
L1.sigma: