melfcc: MFCC Calculation

Description

Calculate Mel-frequency cepstral coefficients.

Usage

melfcc(samples, sr = samples@samp.rate, wintime = 0.025, 
    hoptime = 0.01, numcep = 12, lifterexp = 0.6, htklifter=FALSE,
    sumpower = TRUE, preemph = 0.97, dither = FALSE,
    minfreq = 0, maxfreq = sr/2, nbands = 40, bwidth = 1, 
    dcttype = c("t2", "t1", "t3", "t4"), 
    fbtype = c("mel", "htkmel", "fcmel", "bark"), usecmp = FALSE, 
    modelorder = NULL, spec_out=FALSE, frames_in_rows=TRUE)

Arguments

samples

Object of class Wave.

Sampling rate of the signal.

wintime

Window length in sec.

hoptime

Step between successive windows in sec.

numcep

Number of cepstra to return.

lifterexp

Exponent for liftering; 0 = none;

htklifter

Use HTK sin lifter.

sumpower

If sumpower=TRUE the frequency scale transformation is based on the powerspectrum, if sumpower=FALSE it is based on its squareroot (absolute value of the spectrum) and squared afterwards.

preemph

Apply pre-emphasis filter [1 -preemph] (0 = none).

dither

Add offset to spectrum as if dither noise.

minfreq

Lowest band edge of mel filters (Hz).

maxfreq

Highest band edge of mel filters (Hz).

nbands

Number of warped spectral bands to use.

bwidth

Width of spectral bands in Bark/Mel.

dcttype

Type of DCT used - 1 or 2 (or 3 for HTK or 4 for feacalc).

fbtype

Auditory frequency scale to use: mel, bark, htkmel, fcmel

usecmp

Apply equal-loudness weighting and cube-root compression (PLP instead of LPC).

modelorder

If > 0, fit a linear prediction (autoregressive-) model of this order and calculation of cepstra out of lpcas

spec_out

Should matrices of the power- and the auditory-spectrum be returned

frames_in_rows

Return time frames in rows instead of columns (original Matlab code)

Value

cepstraCepstral coefficients of the input signal (one time frame per row/column)
aspectrumAuditory spectrum (spectrum after transformation to Mel/Bark scale) of the signal
pspectrumPower spectrum of the input signal.
lpcasIf modelorder > 0 the linear prediction coefficients (LPC/PLP).

concept

mfcc
lpc
plp
mel
cepstrum

Details

Calculation of the MFCCs imlcudes the following steps:

Preemphasis filtering

Take the absolute value of the STFT (usage of Hamming window) Warp to auditory frequency scale (Mel/Bark) Take the DCT of the log-auditory-spectrum Return the first 'ncep' components

References

Daniel P. W. Ellis: http://www.ee.columbia.edu/~dpwe/resources/matlab/rastamat/

Examples

Run this code

testsound <- normalize(sine(400) + sine(1000) + square(250), "16")
  m1 <- melfcc(testsound)

  #Use PLP features to calculate cepstra and output the matrices like the
  #original Matlab code (note: modelorder limits the number of cepstra)
  m2 <- melfcc(testsound, numcep=9, usecmp=TRUE, modelorder=8, 
    spec_out=TRUE, frames_in_rows=FALSE)

Run the code above in your browser using DataLab