{"id":3035,"date":"2021-07-08T17:08:17","date_gmt":"2021-07-08T15:08:17","guid":{"rendered":"https:\/\/aholab.ehu.eus\/aholab\/?p=3035"},"modified":"2023-04-25T11:39:15","modified_gmt":"2023-04-25T09:39:15","slug":"anne-manero-elaboracion-de-un-corpus-en-castellano-para-grabacion-de-senales-emg-para-interfaces-de-habla-silenciosa","status":"publish","type":"post","link":"https:\/\/aholab.ehu.eus\/aholab\/anne-manero-elaboracion-de-un-corpus-en-castellano-para-grabacion-de-senales-emg-para-interfaces-de-habla-silenciosa\/","title":{"rendered":"Anne Manero: Implementation and evaluation of a Spanish TTS based on FastPitch"},"content":{"rendered":"\n<p><strong>Estudiante<\/strong>: Maite Fontecha<\/p>\n\n\n\n<p><strong>Directora<\/strong>: Eva Navas, Inma Hern\u00e1ez<\/p>\n\n\n\n<p><strong>Fecha de defensa<\/strong>: Septiembre de 2022<\/p>\n\n\n\n<p><strong>Descripci\u00f3n<\/strong>: <\/p>\n\n\n\n<p>Text-to-speech (TTS) generates speech from text. This tool helps improve people\u2019s quality<br>of life. However, when extending these models to support languages like Spanish, we find<br>scarce databases, data processing tools, and model training resources.<br>In this thesis, I implemented and evaluated a Spanish TTS model on FastPitch with a 10<br>hour database. FastPitch is a neural network-based end-to-end TTS system that allows for<br>prosody transformations. I first researched state-of-art TTS and preprocessed the dataset,<br>then implemented and evaluated the model. As a result, several resources are provided:<br>tools for raw database processing, methods for linguistic module adaptation, a clean dataset<br>and a quality TTS system in Spanish.<br>This model\u2019s quality is compared with two vocoders (WaveGlow\/HiFiGan) and two other<br>state-of-art acoustic models (FastSpeech2\/Tacotron2). The FastPitch model synthesized<br>with HiFiGan vocoder obtained the highest quality results. To conclude, prosody transformation<br>experiments at inference resulted successful with this FastPitch Spanish TTS.<br>Keywords: Text-To-Speech, Spanish, acoustic models, data preprocessing, Deep Neural<br>Networks.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Estudiante: Maite Fontecha Directora: Eva Navas, Inma Hern\u00e1ez Fecha de defensa: Septiembre de 2022 Descripci\u00f3n: Text-to-speech (TTS) generates speech from text. This tool helps improve people\u2019s qualityof life. However, when extending these models to support languages like Spanish, we findscarce databases, data processing tools, and model training resources.In this thesis, I implemented and evaluated a&#8230;<\/p>\n","protected":false},"author":2,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_es_post_content":"","_es_post_name":"","_es_post_excerpt":"","_es_post_title":"","_eu_post_content":"","_eu_post_name":"","_eu_post_excerpt":"","_eu_post_title":"","_en_post_content":"<!-- wp:paragraph -->\n<p><strong>Estudiante<\/strong>: Maite Fontecha<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:paragraph -->\n<p><strong>Directora<\/strong>: Eva Navas, Inma Hern\u00e1ez<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:paragraph -->\n<p><strong>Fecha de defensa<\/strong>: Septiembre de 2022<\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:paragraph -->\n<p><strong>Descripci\u00f3n<\/strong>: <\/p>\n<!-- \/wp:paragraph -->\n\n<!-- wp:paragraph -->\n<p>Text-to-speech (TTS) generates speech from text. This tool helps improve people\u2019s quality<br>of life. However, when extending these models to support languages like Spanish, we find<br>scarce databases, data processing tools, and model training resources.<br>In this thesis, I implemented and evaluated a Spanish TTS model on FastPitch with a 10<br>hour database. FastPitch is a neural network-based end-to-end TTS system that allows for<br>prosody transformations. I first researched state-of-art TTS and preprocessed the dataset,<br>then implemented and evaluated the model. As a result, several resources are provided:<br>tools for raw database processing, methods for linguistic module adaptation, a clean dataset<br>and a quality TTS system in Spanish.<br>This model\u2019s quality is compared with two vocoders (WaveGlow\/HiFiGan) and two other<br>state-of-art acoustic models (FastSpeech2\/Tacotron2). The FastPitch model synthesized<br>with HiFiGan vocoder obtained the highest quality results. To conclude, prosody transformation<br>experiments at inference resulted successful with this FastPitch Spanish TTS.<br>Keywords: Text-To-Speech, Spanish, acoustic models, data preprocessing, Deep Neural<br>Networks.<\/p>\n<!-- \/wp:paragraph -->","_en_post_name":"anne-manero-elaboracion-de-un-corpus-en-castellano-para-grabacion-de-senales-emg-para-interfaces-de-habla-silenciosa","_en_post_excerpt":"","_en_post_title":"Anne Manero: Implementation and evaluation of a Spanish TTS based on FastPitch","edit_language":"en","footnotes":""},"categories":[62],"tags":[],"class_list":["post-3035","post","type-post","status-publish","format-standard","hentry","category-master-thesis-finished"],"_links":{"self":[{"href":"https:\/\/aholab.ehu.eus\/aholab\/wp-json\/wp\/v2\/posts\/3035","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/aholab.ehu.eus\/aholab\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/aholab.ehu.eus\/aholab\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/aholab.ehu.eus\/aholab\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/aholab.ehu.eus\/aholab\/wp-json\/wp\/v2\/comments?post=3035"}],"version-history":[{"count":3,"href":"https:\/\/aholab.ehu.eus\/aholab\/wp-json\/wp\/v2\/posts\/3035\/revisions"}],"predecessor-version":[{"id":3481,"href":"https:\/\/aholab.ehu.eus\/aholab\/wp-json\/wp\/v2\/posts\/3035\/revisions\/3481"}],"wp:attachment":[{"href":"https:\/\/aholab.ehu.eus\/aholab\/wp-json\/wp\/v2\/media?parent=3035"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/aholab.ehu.eus\/aholab\/wp-json\/wp\/v2\/categories?post=3035"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/aholab.ehu.eus\/aholab\/wp-json\/wp\/v2\/tags?post=3035"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}