Normfeat¶
Copyright 2014-2020 Anthony Larcher and Sylvain Meignier
frontend
provides methods to process an audio signal in order to extract
useful parameters for speaker verification.
-
frontend.normfeat.
cep_sliding_norm
(features, win=301, label=None, center=True, reduce=False)[source]¶ Performs a cepstal mean substitution and standard deviation normalization in a sliding windows. MFCC is modified.
- Parameters
features – the MFCC, a numpy array
win – the size of the sliding windows
label – vad label if available
center – performs mean subtraction
reduce – performs standard deviation division
-
frontend.normfeat.
cms
(features, label=None, global_mean=None)[source]¶ Performs cepstral mean subtraction
- Parameters
features – a feature stream of dimension dim x nframes where dim is the dimension of the acoustic features and nframes the number of frames in the stream
label – a logical vector
global_mean – pre-computed mean to use for feature normalization if given
- Returns
a feature stream
-
frontend.normfeat.
cmvn
(features, label=None, global_mean=None, global_std=None)[source]¶ Performs mean and variance normalization
- Parameters
features – a feature stream of dimension dim x nframes where dim is the dimension of the acoustic features and nframes the number of frames in the stream
global_mean – pre-computed mean to use for feature normalization if given
global_std – pre-computed standard deviation to use for feature normalization if given
label – a logical verctor
- Returns
a sequence of features
-
frontend.normfeat.
rasta_filt
(x)[source]¶ Apply RASTA filtering to the input signal.
- Parameters
x – the input audio signal to filter. cols of x = critical bands, rows of x = frame same for y but after filtering default filter is single pole at 0.94
-
frontend.normfeat.
stg
(features, label=None, win=301)[source]¶ Performs feature warping on a sliding window
- Parameters
features – a feature stream of dimension dim x nframes where dim is the dimension of the acoustic features and nframes the number of frames in the stream
label – label of selected frames to compute the Short Term Gaussianization, by default, al frames are used
win – size of the frame window to consider, must be an odd number to get a symetric context on left and right
- Returns
a sequence of features