This function is basically a wrapper for function computePI.
This function translate DNA sequence into protein, and compute the theoretical isoelectric point
(pI) of this protein.
A FASTA file loaded by function read.fasta of
seqinr-package.
label
Optional. String. Indicate the label of the sequences such as
"NonCoding", "Coding".
on.ORF
Logical. If TRUE, pI will be calculated on
the longest ORF region. NOTE: If TRUE, the input has to be DNA sequences.
(Default: FALSE)
auto.full
Logical. When on.ORF = TRUE but no ORF can be found,
if auto.full = TRUE, pI will be calculated on full sequences automatically;
if auto.full is FALSE, the sequences that have no ORF will be discarded.
Ignored when on.ORF = FALSE. (Default: FALSE)
ambiguous.base
If TRUE, ambiguous bases are taken into account when
translating DNA sequences into proteins.
parallel.cores
Integer. The number of cores for parallel computation.
By default the number of cores is 2. Users can set as -1 to run
this function with all cores.
Author
HAN Siyu
Details
This function can compute the pI of DNA sequences. Method CPC2 (Kang et al. 2017) uses this
feature to identify lncRNAs, and this feature is evaluated in the article LncFinder (Han et al. 2018).
Using this function, the theoretical pI can be computed on full sequence or the longest ORF region.
In CPC2, pI is calculated on ORF region.