Anne Manero: Implementation and evaluation of a Spanish TTS based on FastPitch

Estudiante: Maite Fontecha

Directora: Eva Navas, Inma Hernáez

Fecha de defensa: Septiembre de 2022

Descripción:

Text-to-speech (TTS) generates speech from text. This tool helps improve people’s quality
of life. However, when extending these models to support languages like Spanish, we find
scarce databases, data processing tools, and model training resources.
In this thesis, I implemented and evaluated a Spanish TTS model on FastPitch with a 10
hour database. FastPitch is a neural network-based end-to-end TTS system that allows for
prosody transformations. I first researched state-of-art TTS and preprocessed the dataset,
then implemented and evaluated the model. As a result, several resources are provided:
tools for raw database processing, methods for linguistic module adaptation, a clean dataset
and a quality TTS system in Spanish.
This model’s quality is compared with two vocoders (WaveGlow/HiFiGan) and two other
state-of-art acoustic models (FastSpeech2/Tacotron2). The FastPitch model synthesized
with HiFiGan vocoder obtained the highest quality results. To conclude, prosody transformation
experiments at inference resulted successful with this FastPitch Spanish TTS.
Keywords: Text-To-Speech, Spanish, acoustic models, data preprocessing, Deep Neural
Networks.

Previous post Itxasne Díez: Machine listening para la detección y clasificación sonora en entornos reales Next post In the Media: Capaces de Comunicar

at://did:plc:d63onl5bo63jnl6tzbd3wqk4/app.bsky.feed.post/3lu6aa23lvc2z
Source: @aholab.bsky.social - Aholab Taldea Published on 2025-07-17
at://did:plc:d63onl5bo63jnl6tzbd3wqk4/app.bsky.feed.post/3ltj4k3gcs222
Source: @aholab.bsky.social - Aholab Taldea Published on 2025-07-09
at://did:plc:d63onl5bo63jnl6tzbd3wqk4/app.bsky.feed.post/3lt2hwpwyq226
Source: @aholab.bsky.social - Aholab Taldea Published on 2025-07-03
Aholab Taldea (@aholab.bsky.social)
Source: @aholab.bsky.social on Bluesky Published on 2025-07-03
at://did:plc:d63onl5bo63jnl6tzbd3wqk4/app.bsky.feed.post/3lsvhcoujok2s
Source: @aholab.bsky.social - Aholab Taldea Published on 2025-07-01
Aholab Taldea (@aholab.bsky.social)
Source: @aholab.bsky.social on Bluesky Published on 2025-07-01
at://did:plc:d63onl5bo63jnl6tzbd3wqk4/app.bsky.feed.post/3lrairh3l3c27
Source: @aholab.bsky.social - Aholab Taldea Published on 2025-06-10
Aholab Taldea (@aholab.bsky.social)
Source: @aholab.bsky.social on Bluesky Published on 2025-06-10
at://did:plc:d63onl5bo63jnl6tzbd3wqk4/app.bsky.feed.post/3lqf2fs5qq22j
Source: @aholab.bsky.social - Aholab Taldea Published on 2025-05-30
Aholab Taldea (@aholab.bsky.social)
Source: @aholab.bsky.social on Bluesky Published on 2025-05-30

(no title)
17 July, 2025
Gaur Itxasne Diezek bere tesia defendatu du: Machine Listening para la detección y clasificación sonora en entornos urbanos. Zorionak, txapeldun!
(no title)
9 July, 2025
We are hiring! More details at https://aholab.ehu.eus/aholab/hiring-3-researchers/ 3 ikerlari bilatzen dugu. Incorporamos 3 puestos de investigación.
(no title)
3 July, 2025
Gure lana erakusteko aukera izan dugu. [contains quote post or other embedded content]
(no title)
1 July, 2025
Gu ere @hitz-zentroa.bsky.social bileran gure lanaren berri kontatzen.
(no title)
10 June, 2025
Ostegunean, hitzaldia. El jueves, charla. https://aholab.ehu.eus/aholab/eu/talk-voces-sinteticas-personalizadas-para-que-sirven-el-proyecto-ahomytts/
(no title)
30 May, 2025
Primeran aritu dire gure kideak #IKERGAZTE2025 kongresuan!