Learn R Programming

VLF (version 1.1)

aminoAcidFun: Amino Acid VLF Analysis Function

Description

Runs the full amino acid VLF analysis for the user and outputs results.

Usage

aminoAcidFun(x, p = 0.001, seqlength = 216, own = NULL)

Value

modal

A vector containing the amino acid sequence that occurs most often in the dataset.

con100

The number of amino acid positions that are 100% conserved in the sequence

conp

The number of amino acid positions that are (1-p)% conserved in the sequence

combine

The number of amino acid positions that are (1-p)% conserved when combining the first and second modal sequences.

specimen

A vector containing the number of VLFs for each specimen in the dataset.

position

A vector containing the number of VLFs for each position in the sequences.

sas

A matrix containing vectors of single and shared amino acid VLF counts for each position of the sequence.

VLFmatrix

A matrix containing only those specimen that have VLFs as well as the amino acid at the positions that contain VLFs and NAs in all other positions.

ownSpecCount

A vector containing the number of VLFs for each specimen in the users own specified dataset. Only appears if own is not NULL.

ownPosCount

A vector containing the number of VLFs for each position in the sequences of the users own specified dataset. Only appears if own is not NULL.

ownVLFMatrix

A matrix containing only those amino acids at the positions that contain VLFs and NAs in all other positions of the sequence. Only appears if own is not NULL.

ownVLFreduced

ownVLFMatrix

A matrix containing only those specimen that have VLFs as well as the amino acids at the positions that contain VLFs and NAs in all other positions of the sequence. Only appears if own is not NULL.

Arguments

x

A matrix of amino acid sequences with the first column containing the unique specimen identifier, the second column containing the species name and the remaining columns containing the amino acid sequence.

p

A VLF designation frequency cut-off to be used within the analysis. By default p = 0.001.

seqlength

The length of the amino acid sequence. By default seqlength = 216.

own

If the user wants to compare their own sequences separate from reference sequences, then argument can be used. Similar to x, this argument is a matrix of amino acid sequences with the first column containing the unique specimen identifier, the second column contains the species name and the remaining columns containing the nucleotide sequence. By default own = NULL.)

Author

Taryn B. T. Athey and Paul D. McNicholas

Examples

Run this code
if (FALSE) data(birds_aminoAcids)
bird_aaAnalysis <- aminoAcidFun(birds_aminoAcids)

Run the code above in your browser using DataLab