GlottDNN Vocoder
The GlottDNN package contains two main parts:
1) The glottal vocoder written in C++ - Dependencies: libsndfile, libgsl, libconfig
2) Python scripts for vocoder analysis, synthesis and training a DNN excitation model: - Dependencies: python, numpy, theano
Installing
The vocoder C++ code has the following library dependencies:
- libgsl
(GNU scientific library), for basic linear algebra and FFT etc.
- libsndfile
for reading and writing audio files
- libconfig++
for reading structured configuration files
Usually the best way to install the dependencies is with the system package manager. For example, in Ubuntu 14.04, use apt-get
install the packages libgsl0-dev
, libsndfile1-dev
, libconfig++-dev
The C++ part uses a standard GNU autotools build system. To compile the vocoder, run the following commands in this directory:
./configure
make
Since the build targets are rather generically named Analysis
and Synthesis
, you might not want them in your default system PATH. To choose
./configure --prefix=/your/install/path/bin
make install
Some typical use cases:
Train excitation model 16kHz sample rate
- modify config/config_default_16k.cfg
- modify python/config_default_16k.py
- run feature extraction and training script
python python/GlottDnnScript.py \
python/config_default_16k.py
Please contact the authors for any questions, or open an issue at github:
Lauri Juvela: lauri.juvela@aalto.fi Manu Airaksinen: manu.airaksinen@aalto.fi
This code is licenced under the MIT licence, see LICENCE for more information