Ahocoder download website

Ahocoder parameterizes speech waveforms into three different streams: log-f0, cepstral representation of the spectral envelope, and maximum voiced frequency. It provides high accuracy during analysis and high quality during reconstruction. It is adequate for statistical parametric speech synthesis and voice conversion. Furthermore, it can be used just for basic speech manipulation and transformation (pitch level and variance, speaking rate, vocal tract length...).

Ahocoder is reported to be a very good complement for HTS. The output files generated by Ahocoder contain float numbers without header, so they are fully compatible with the HTS demo scripts in the HTS website. You can use the same configuration as in the STRAIGHT-based demo, using the "bap" stream to handle maximum voiced frequency (set its dimension to 1 both in data/Makefile and in scripts/Config.pm).

For more technical details, please have a look to these related publications:

Through this website you can get an executable version of the tool. For any doubt, suggestion, comment, feedback, etc, please contact the main author D. Erro: derro(a)aholab.ehu.es

Download now!



AHOLAB Signal Processing Laboratory, UPV/EHU.

Alda. Urquijo s/n, 48013 Bilbao, Spain. Phone: +34 946017245. Fax: +34 946014259.
Contact email: derro(a)aholab.ehu.es