Learn R Programming

misha

The misha package is a toolkit for analysis of genomic data. it implements an efficient data structure for storing genomic data, and provides a set of functions for data extraction, manipulation and analysis.

Installation

You can install the released version of misha from CRAN with:

install.packages("misha")

And the development version from GitHub with:

remotes::install_github("tanaylab/misha")

Usage

See the Genomes vignette for instructions on how to create a misha database for common genomes.

See the user manual for more usage details.

Running scripts from old versions of misha (< 4.2.0)

Starting in misha 4.2.0, the package no longer stores global variables such as ALLGENOME or GROOT. Instead, these variables are stored in a special environment called .misha. This means that scripts written for older versions of misha will no longer work. To run such scripts, either add a prefix of .misha$ to all those variables (.misha$ALLGENOME instead of ALLGENOME), or run the following command before running the script:

ALLGENOME <<- .misha$ALLGENOME
GROOT <<- .misha$GROOT
ALLGENOME <<- .misha$ALLGENOME
GINTERVID <<- .misha$GINTERVID
GITERATOR.INTERVALS <<- .misha$GITERATOR.INTERVALS
GROOT <<- .misha$GROOT
GWD <<- .misha$GWD
GTRACKS <<- .misha$GTRACKS

Copy Link

Version

Install

install.packages('misha')

Monthly Downloads

169

Version

4.3.6

License

MIT + file LICENSE

Issues

Pull Requests

Stars

Forks

Maintainer

Aviezer Lifshitz

Last Published

March 6th, 2025

Functions in misha (4.3.6)

gdb.create_genome

Create and Load a Genome Database
gdir.cwd

Returns the current working directory in Genomic Database
gintervals.2d.all

Returns 2D intervals that cover the whole genome
gdir.create

Creates a new directory in Genomic Database
gintervals.diff

Calculates difference of two intervals sets
gintervals.chrom_sizes

Returns number of intervals per chromosome
gintervals.2d.band_intersect

Intersects two-dimensional intervals with a band
gintervals

Creates a set of 1D intervals
gintervals.exists

Tests for a named intervals set existence
gintervals.quantiles

Calculates quantiles of a track expression for intervals
gintervals.liftover

Converts intervals from another assembly
gintervals.ls

Returns a list of named intervals sets
gintervals.force_range

Limits intervals to chromosomal range
gdb.set_readonly_attrs

Sets read-only track attributes
gintervals.union

Calculates a union of two sets of intervals
gquantiles

Calculates quantiles of a track expression
gextract

Returns evaluated track expression
gintervals.save

Creates a named intervals set
gintervals.is.bigset

Tests for big intervals set
gintervals.import_genes

Imports genes and annotations from files
gintervals.neighbors

Finds neighbors between two sets of intervals
gintervals.mapply

Applies a function to values of track expressions
gdir.rm

Deletes a directory from Genomic Database
grevcomp

Get reverse complement of DNA sequence
gintervals.all

Returns 1D intervals that cover the whole genome
gintervals.summary

Calculates summary statistics of track expression for intervals
gintervals.intersect

Calculates an intersection of two sets of intervals
gtrack.create_dirs

Create directories needed for track creation
gtrack.array.extract

Returns values from 'Array' track
gtrack.array.get_colnames

Returns column names of array track
gsample

Returns samples from the values of track expression
gscreen

Finds intervals that match track expression
gpartition

Partitions the values of track expression
gtrack.var.get

Returns value of a track variable
glookup

Returns values from a lookup table based on track expression
gtrack.create_pwm_energy

Creates a new track from PSSM energy function
gintervals.rbind

Combines several sets of intervals
gtrack.2d.import

Creates a 2D track from tab-delimited file
gtrack.var.ls

Returns a list of track variables for a track
gtrack.2d.import_contacts

Creates a track from a file of inter-genomic contacts
gtrack.array.import

Creates an array track from array tracks or files
gintervals.canonic

Converts intervals to canonic form
gintervals.update

Updates a named intervals set
%>%

Pipe operator
gtrack.array.set_colnames

Sets column names of array track
gintervals.load

Loads a named intervals set
gsegment

Divides track expression into segments
gintervals.rm

Deletes a named intervals set
gtrack.liftover

Imports a track from another assembly
gseq.extract

Returns DNA sequences
gtrack.lookup

Creates a new track from a lookup table based on track expression
gintervals.load_chain

Loads assembly conversion table from a chain file
gtrack.attr.import

Imports track attributes values
gtrack.attr.export

Returns track attributes values
gtrack.attr.get

Returns value of a track attribute
gtrack.create_sparse

Creates a 'Sparse' track from intervals and values
gtrack.ls

Returns a list of track names
giterator.cartesian_grid

Creates a cartesian-grid iterator
giterator.intervals

Returns iterator intervals
gsummary

Calculates summary statistics of track expression
gtrack.attr.set

Assigns value to a track attribute
gtrack.2d.create

Creates a 'Rectangles' track from intervals and values
gtrack.exists

Tests for a track existence
gvtrack.array.slice

Defines rules for a single value calculation of a virtual 'Array' track
gtrack.convert

Converts a track to the most current format
gtrack.modify

Modifies track contents
gvtrack.create

Creates a new virtual track
gtrack.var.rm

Deletes a track variable
gtrack.create

Creates a track from a track expression
gtrack.var.set

Assigns value to a track variable
gvtrack.iterator

Defines modification rules for a one-dimensional iterator in a virtual track
gvtrack.ls

Returns a list of virtual track names
gtrack.import

Creates a track from WIG / BigWig / BedGraph / tab-delimited file
gvtrack.rm

Deletes a virtual track
gwget

Downloads files from FTP server
gtrack.import_mappedseq

Creates a track from a file of mapped sequences
gvtrack.info

Returns the definition of a virtual track
gtrack.import_set

Creates one or more tracks from multiple WIG / BigWig / BedGraph / tab-delimited files on disk or FTP
gvtrack.iterator.2d

Defines modification rules for a two-dimensional iterator in a virtual track
gtrack.info

Returns information about a track
gtrack.rm

Deletes a track
gtrack.smooth

Creates a new track from smoothed values of track expression
gwilcox

Calculates Wilcoxon test on sliding windows over track expression
misha-package

Toolkit for analysis of genomic data
gcis_decay

Calculates distribution of contact distances
gdb.reload

Reloads database from the disk
gbins.summary

Calculates summary statistics of a track expression for bins
gdist

Calculates distribution of track expressions
gintervals.2d

Creates a set of 2D intervals
gcluster.run

Runs R commands on a cluster
.misha

An environment for storing the package global variables
gdb.create

Creates a new Genomic Database
gdb.get_readonly_attrs

Returns a list of read-only track attributes
gdir.cd

Changes current working directory in Genomic Database
gcompute_strands_autocorr

Computes auto-correlation between the strands for a file of mapped sequences
gsetroot

Initializes connection with Genomic Database
gbins.quantiles

Calculates quantiles of a track expression for bins