Learn R Programming

PhyloMeasures (version 2.1)

mpd.pvalues: Computes the p-values of the Mean Pairwise Distance measure

Description

Calculates the p-values of the mean pairwise distance (MPD) measure for sets of tips on a phylogeny. The p-values can be calculated under two different null models which maintain species richness.

Usage

mpd.pvalues(tree, matrix, null.model="uniform", abundance.weights, reps=1000, seed)

Arguments

tree
A phylo tree object
matrix
A matrix with binary (0/1) values, where each row represents a tip set. Each column name in the matrix must match a tip label on the input tree. If not all values in the matrix are binary, we consider two cases; if the matrix contains only non-negative values, all values are coerced to binary ones and a warning message is printed. If the matrix contains at least one negative value, the function throws an error.
null.model
A character vector (string) that defines which null model is used for computing the p-values of the measure. There are two possible null models that can be used for computing the p-values: these are "uniform" and "sequential". Both models maintain species richness. More specifically, the available models are defined as follows:

  • "uniform" considers samples with equal (uniform) probability among all possible tip samples of the same richness.

  • "sequential" is an abundance-weighted null model where species samples are chosen based on the same method as R's sample function. Unlike the other model (which is computed analytically), this model uses Monte-Carlo randomization.
  • This argument is optional, and its default value is set to "uniform".

    abundance.weights
    A vector of positive numeric values. These are the abundance weights that will be used if option "sequential" is selected. The names stored at the vector must match the names of the tips in the tree. This argument is redundant if the "uniform" model is selected.
    reps
    An integer that defines the number of Monte-Carlo random repetitions that will be performed when using the "sequential" model. This argument is redundant if the other null model is selected.
    seed
    A positive integer that defines the random seed used in the Monte-Carlo randomizations of the "sequential" model. This argument is optional, and becomes redundant if the other null model is selected.

    Value

    References

    Tsirogiannis, C. and B. Sandel. 2015. PhyloMeasures: A package for computing phylogenetic biodiversity measures and their statistical moments. Ecography, doi: 10.1111/ecog.01814, 2015.

    Tsirogiannis, C., B. Sandel and D. Cheliotis. 2012. Efficient computation of popular phylogenetic tree measures. Algorithms in Bioinformatics, LNCS 7534: 30-43.

    See Also

    mpd.moments

    Examples

    Run this code
    #Load phylogenetic tree of bird families from package "ape"
    data(bird.families, package = "ape")
    
    #Create 100 random communities with 50 families each
    comm = matrix(0,nrow = 100,ncol = length(bird.families$tip.label))
    for(i in 1:nrow(comm)) {comm[i,sample(1:ncol(comm),50)] = 1}
    colnames(comm) = bird.families$tip.label
    
    #Calculate p-values under the uniform model
    mpd.pvalues(bird.families,comm, reps=1000)
    
    # Create random abundance weights
    weights = runif(length(bird.families$tip.label))
    names(weights) = bird.families$tip.label
    
    # Calculate p-values under the sequential model
    mpd.pvalues(bird.families,comm,null.model="sequential",
                abundance.weights=weights, reps=1000)
    

    Run the code above in your browser using DataLab