Learn R Programming

GeneBreak R package

Breakpoints of copy number aberrations (CNA) indicate underlying DNA breaks and thereby regions involved in structural variants. The availability of large copy number data cohorts enables to identify genes recurrently affected by structural variants. This package allows you to systematically identify recurrent CNA-associated breakpoint genes. Breakpoint detection and a tailored annotation approach for breakpoint-to-gene mapping are implemented, which takes the gene position in relation to breakpoint interval into account. Furthermore, statistics with multiple testing correction is incorporated to obtain recurrent breakpoint events. The method can be applied both on copy number data obtained from microarrays, such as array Comparative Genomic Hybridization (CGH), and on whole genome sequencing.

Installation

devtools::install_github( "stefvanlieshout/GeneBreak" )

This requires package "devtools":

install.packages("devtools")

Sample workflow

Output from the Copy Number analysis workflows of CGHcall for cgh data and QDNAseq for NGS data can serve as input for the analysis of GeneBreak.

The test-data used in this example contains data from one chromosome for a total of 200 samples.

library( "GeneBreak" )

# read vignette for more explanation about the workflow
vignette( "GeneBreak")

# explore built-in data
data( package="GeneBreak" )

# get more information about built-in data
help( "copynumber.data.chr20" )

# load built-in dataset (object from CGHcall)
data( "copynumber.data.chr20" )

# load built-in gene annotation dataset (hg19 and hg38 are also availabe)
data( "ens.gene.ann.hg18" )

# setup the breakpoint data
breakpoints <- getBreakpoints( data = copynumber.data.chr20 )

# print some information about the object
breakpoints

# take a peek at the data access options
accessOptions( breakpoints )

# optionally filter the data
breakpointsFiltered <- bpFilter( breakpoints )

# add/setup the gene data 
breakpointsFiltered <- addGeneAnnotation( breakpointsFiltered, ens.gene.ann.hg18 )

# perform gene analysis
breakpointGenes <- bpGenes( breakpointsFiltered )

# get recurrent breakpoints
breakpointStatistics <- bpStats( breakpointGenes )

# print object information to screen
breakpointStatistics

# print some information of top 5 recurrently affected genes
head( recurrentGenes( breakpointStatistics ) )
#  A total of 14 recurrent breakpoint genes (at FDR < 0.1)
#           Gene sampleCount featureTotal        pvalue           FDR
# 25468   PCMTD2          64            4 1.350385e-103 8.899035e-101
# 25481 C20orf69          33            3  5.522293e-44  1.819595e-41
# 14751    BFSP1           8            5  3.941447e-07  8.658045e-05
# 16066   ABHD12          10            9  5.756361e-05  9.483605e-03
# 15305 C20orf26           7           18  2.748743e-04  3.622843e-02
# 3493      HAO1           5            5  6.528175e-04  3.961727e-02

# plot breakpoint frequencies of one chromosome
bpPlot( breakpointStatistics, plot.chr=c(20) )

More information or help

Contact us at...

Background

More background...

Copy Link

Version

Version

1.2.0

License

GPL-2

Issues

Pull Requests

Stars

Forks

Last Published

February 15th, 2017

Functions in GeneBreak (1.2.0)

ens.gene.ann.hg38

Gene Annotation hg38
featureChromosomes,CopyNumberBreakPoints-method

Access Object featureChromosomes. This method returns a vector with feature chromosomes.
bpPlot,CopyNumberBreakPoints-method

bpPlot
copynumber.data.chr21

CGHcall Example copynumber data hg18 chr21
GeneBreak

GeneBreak: A package for gene breakpoint detection on copy number abberation data
bpFilter,CopyNumberBreakPoints-method

bpFilter
bpGenes,CopyNumberBreakPointGenes-method

bpGenes
ens.gene.ann.hg18

Gene Annotation hg18
addGeneAnnotation,CopyNumberBreakPoints-method

addGeneAnnotation
getBreakpoints

getBreakpoints
accessOptions,CopyNumberBreakPoints-method

Access Object Data. This method lists possible functions to access the data of the object.
breakpointData,CopyNumberBreakPoints-method

Access Object breakpointData. This method returns a dataframe with breakpoint values per feature.
copynumber.data.chr20

CGHcall Example copynumber data hg18 chr20
breakpointsPerGene,CopyNumberBreakPointGenes-method

Access Object breakpointsPerGene. This method returns a dataframe with breakpoints per gene.
geneChromosomes,CopyNumberBreakPointGenes-method

Access Object geneChromosomes. This method returns a vector with gene chromosomes.
geneInfo,CopyNumberBreakPointGenes-method

Access Options geneInfo. This method returns a dataframe with gene annotations.
featureInfo,CopyNumberBreakPoints-method

Access Options featureInfo. This method returns a dataframe with feature annotations.
namesFeatures,CopyNumberBreakPoints-method

Access Object namesFeatures. This method returns a vector with feature names.
CopyNumberBreakPointGenes-class

An S4 class to represent a CopyNumberBreakPointGenes object
featuresPerGene,CopyNumberBreakPointGenes-method

Access Object featuresPerGene. This method returns a vector with gene-related features for a particular gene.
segmentData,CopyNumberBreakPoints-method

Access Object segmentData. This method returns a dataframe with segment values.
recurrentGenes,CopyNumberBreakPointGenes-method

Access Options recurrentGenes. This method returns a dataframe that contains genes that are recurrently affected across samples based on a FDR threshold.
ens.gene.ann.hg19

Gene Annotation hg19
copynumber.data.chr18

CGHcall Example copynumber data hg18 chr18
sampleNames,CopyNumberBreakPoints-method

Access Object sampleNames. This method returns a vector with sample names.
callData,CopyNumberBreakPoints-method

Access Object callData. This method returns a dataframe with feature call values.
CopyNumberBreakPoints-class

An S4 class to represent a CopyNumberBreakPoints object.
bpStats,CopyNumberBreakPoints-method

bpStats