Learn R Programming

phyclust

  • License:
  • Download:
  • Author: Wei-Chen Chen and Karin Dorman

Phylogenetic clustering (phyloclustering) is an evolutionary Continuous Time Markov Chain model-based approach to identify population structure from molecular data without assuming linkage equilibrium. The package phyclust (Chen 2011) provides a convenient implementation of phyloclustering for DNA and SNP data, capable of clustering individuals into subpopulations and identifying molecular sequences representative of those subpopulations. It is designed in C for performance, interfaced with R for visualization, and incorporates other popular open source programs including ms (Hudson 2002), seq-gen (Rambaut and Grassly 1997), Hap-Clustering (Tzeng 2005) and PAML baseml (Yang 1997, 2007), for simulating data, additional analyses, and searching the best tree. See the phyclust website for more information, documentations and examples.

Installation

phyclust requires

  • R version 3.0.0 or higher.
  • R package ape.

The package can be installed from the CRAN via the usual install.packages("phyclust"), or via the devtools package:

library(devtools)
install_github("snoweye/phyclust")

Copyright

See phyclust/inst/Documents/ for files in src/msdir/, src/seq-gen/, src/paml_baseml, and R/ttzeng-*.r.

Copy Link

Version

Install

install.packages('phyclust')

Monthly Downloads

1,867

Version

0.1-34

License

GPL (>= 2)

Maintainer

Last Published

September 6th, 2023

Functions in phyclust (0.1-34)

bootstrap.seq

Bootstrap Sequences from a Fitted Model and Star Tree.
RRand

Rand Index and Adjusted Rand Index
bootstrap.seq.data

Bootstrap a seq.data from a Fitted Model.
.label.method

Label Method
.init.procedure

Initialization Procedures for EM Algorithms
as.star.tree

Coerce a Rooted Tree to a Star Tree in Class phylo
file.write

Write Data to Files by Formats
data.phylip.pony

Great Pony 524 EIAV rev Dataset in the phylip Format
.substitution.model

Substitution Models for Mutation Processes
data.fasta.pony

Great Pony 625 EIAV rev Dataset in the Fasta Format
find.best

Find the Best Solution of phyclust
standard.code

Standard Codes and ids for Nucleotides, SNPs, Codon, Amino Acid and Genetic Code
file.read

Read Data from Files by Formats and Return a seq.data Object
.se.model

Sequencing Error Model
code2nid

Transfer Codes (A, G, C, T, -) and nids (0, 1, 2, 3, 4)
.show.option

Show Available Options
ms

Generating Samples under a Wright-Fisher Neutral Model of Genetic Variation
nid.aid.cid

Transfer nids (0, 1, ..., 4) , aids (0, 1, ..., 21) and cids (0, 1, ..., 64)
code2snp

Transfer Nucleotide Codes / nids and SNPs / sids
haplo.post.prob

Tzeng's Method: Haplotype Grouping for SNP Sequences
gen.star.tree

Generate a Rooted Tree with a Star Shape
gen.seq

Generate Sequences Given a Rooted Tree.
getcut.fun

Tzeng's Method: Finding the Best Number of Clusters
bootstrap.star.trees

Bootstrap a Star Tree from a Fitted Model.
bootstrap.star.trees.seq

Bootstrap Sequences from a Fitted Model.
phyclust

The Main Function of phyclust
phyclust.e.step

One E-Step of phyclust
gen.unit.K

Generate Comprehensive Trees.
get.rooted.tree.height

Get a Rooted Tree Height
phyclust.se

The Main Function of phyclust for Sequencing Error Models
data.phylip.crohn

Crohn's Disease SNP Dataset in the phylip Format
phyclust.se.update

Update phyclust Results by the Sequencing Error Model
paml.baseml

Phylogenetic Analysis by Maximum Likelihood for Nucleotide Sequences
phyclust.edist

Evolution Distance of Sequences
print.object

Functions for Printing or Summarizing Objects According to Classes
find.consensus

Find the Consensus Sequence
prune.Mu

Prune the Center Sequences Mu
phyclust.logL

Log-Likelihood of phyclust
phyclust.m.step

One M-Step of phyclust
read.seqgen

Read seqgen's Results and Return a seq.data
plotdots

Dots Plots of Sequences for Visual Comparisons
phyclust.em.step

One EM-step of phyclust
phyclust.update

Update phyclust Results
seq.data

A Toy Dataset in Class seq.data
snp2sid

Transfer SNP codes (1, 2, -) and sids (0, 1, 2)
plothist

Plot Histogram to Compare Number of Mutations.
plotstruct

Struct Plots of Observations Based on Posterior Probabilities
rescale.rooted.tree

Rescale a Rooted Tree's Height
plotnj

Plot an Unrooted Trees.
plotgaps

Gaps Plots of Sequences for Visual Comparisons
gen.equal.star.anc.dec

Generate Comprehensive Trees.
seqgen

Seq-Gen
all.internal

All Internal Functions of phyclust
all.internal.tzeng

All Internal Functions of Tzeng's Methods
phyclust.Pt

Transition Probabilities of phyclust Given Time
.Color

Colors for Identifying Clusters in Plots
.identifier

Identifiers for Evolution Models
.EMC

EM Control
.em.method

EM Methods and Algorithms
phyclust-package

Phyloclustering -- Phylogenetic Clustering
.edist.model

Evolution Distance Model
.init.method

Initialization Methods for EM Algorithms
.code.type

Code Types of Dataset and Substitution Models
.boundary.method

Boundary Methods for Population Proportions
.EMControl

EM Control Generator