3
@page sig2fv_manual sig2fv
4
@brief *Generate signal processing coefficients from waveforms*
7
@section synopsis Synopsis
11
`sig2fv` is used to create signal processing feature vector analysis on speech
13
The following types of analysis are provided:
15
- Linear prediction (LPC)
16
- Cepstrum coding from lpc coefficients
17
- Mel scale cepstrum coding via fbank
18
- Mel scale log filterbank analysis
19
- Line spectral frequencies
20
- Linear prediction reflection coefficients
21
- Root mean square energy
23
- fundamental frequency (pitch)
24
- calculation of delta and acceleration coefficients of all of the above
26
The -coefs option is used to specify a list of the names of what sort
27
of basic processing is required, and -delta and -acc are used for
28
delta and acceleration coefficients respectively.
31
@section options Options
35
@section sig2fv-examples Examples
37
Fixed frame basic linear prediction:
39
To produce a set of linear prediction coefficients at every 10ms, using
40
pre-emphasis and saving in EST format:
42
$ sig2fv kdt_010.wav -o kdt_010.lpc -coefs "lpc" -otype est -shift 0.01 -preemph 0.5
44
**Pitch Synchronous linear prediction**:
45
The following used the set of pitchmarks in kdt_010.pm as the centres
46
of the analysis windows.
48
$ sig2fv kdt_010.wav -pm kdt_010.pm -o kdt_010.lpc -coefs "lpc" -otype est -shift 0.01 -preemph 0.5
50
F0, Linear prediction and cepstral coefficients:
52
$ sig2fv kdt_010.wav -o kdt_010.lpc -coefs "f0 lpc cep" -otype est -shift 0.01
54
Note that pitchtracking can also be done with the
55
`pda` program. Both use the same underlying
56
technique, but the pda program offers much finer control over the
57
pitch track specific processing parameters.
59
Energy, Linear Prediction and Cepstral coefficients, with a 10ms frame shift
60
during analysis but a 5ms frame shift in the output file:
62
$ sig2fv kdt_010.wav -o kdt_010.lpc -coefs "f0 lpc cep" -otype est -S 0.005
65
Delta and acc coefficients can be calculated even if their base form is not
66
required. This produces normal energy coefficients and cepstral delta coefficients:
68
$ sig2fv ../kdt_010.wav -o kdt_010.lpc -coefs "energy" -delta "cep" -otype est
70
Mel-scaled cepstra, Delta and acc coefficients, as is common in speech
73
$ sig2fv ../kdt_010.wav -o kdt_010.lpc -coefs "melcep" -delta "melcep" -acc "melcep" -otype est -preemph 0.96