Javier Saldaña: Evaluation of Deep Speech model’s performance on Common Voice corpora in Spanish and Basque

Estudiante: Javier Saldaña

Directora: Eva Navas, Inma Hernáez

Fecha de defensa: Septiembre de 2022

Descripción:

Speech recognition is one of the main fields within Natural Language Processing, and its usage is widespread in different professional domains. Great advancements have taken place in the last decades regarding the development of automatic speech recognizers, for both computing power and algorithms have been greatly enhanced. Notwithstanding, the data availability continues to be an issue when implementing Artificial Intelligent models, for most corpora pertain to the private domain and obtaining data is only possible for a few companies with enough economical resources to afford it.
We support that science should be free and that anyone could develop their own speech recognizers whether they have the required knowledge to do so. Hence, in our project, we aim at evaluating the performance of a well-known open-source recognizer, DeepSpeech, on a large publicly available corpus, Common Voice, for both Spanish and Basque tongues at a general level. In our experiment, we test the model by altering three important parameters, namely the version of the corpus, the integration or disuse of a scorer and the presence or absence of repetitions within the training set. We also carry out a statistical evaluation of the content of our corpora, and we give our opinion regarding the current validation policy of Common Voice corpora.
Keywords: Speech

Previous post Open position: Programa Investigo Next post Secretos de las Telecomunicaciones

(no title)
24 June, 2026
Presenting our work at Odyssey 2026 in Lisbon
(no title)
8 June, 2026
https://aholab.ehu.eus/aholab/summer-course-deep-learning-for-speech-processing/
(no title)
11 May, 2026
https://aholab.ehu.eus/aholab/shape-the-future-of-speech-ai/
(no title)
31 March, 2026
HiTZ zentroak ahotsa euskaraz ezagutu eta sintetizatzeko eredu ireki berriak argitaratu ditu https://www.ehu.eus/eu/web/campusa/-/hitz-zentroak-ahotsa-euskaraz-ezagutu-eta-sintetizatzeko-eredu-ireki-berriak-argitaratu-ditu El centro HiTZ publica nuevos modelos abiertos de reconocimiento y síntesis de voz en euskera https://www.ehu.eus/es/web/campusa/-/hitz-zentroak-ahotsa-euskaraz-ezagutu-eta-sintetizatzeko-eredu-ireki-berriak-argitaratu-ditu @hitz-zentroa.bsky.social
(no title)
23 March, 2026
The BrAIn2Lang website is now online. This project explores how speech and language can be decoded from brain activity, bringing together neuroimaging and speech technologies. aholab.ehu.eus/brain2lang/
(no title)
12 February, 2026
We’re organizing a Special Session on Speech & Language Technologies in Healthcare at #Odyssey2026 (Lisbon) From voice-based diagnosis to assistive and inclusive communication technologies — research meeting real clinical impact. Submit by March 15 https://odyssey2026.inesc-id.pt/speech-and-language-technologies-in-healthcare/ Join us!
(no title)
11 February, 2026
Gorabehera baten ondorioz, web zerbitzu batzuk ez dabiltza ondo. Konpontzen ari gara. Barkatu. Due to an incident, some web services are not working properly. We’re fixing it. Sorry. Por una incidencia, algunos servicios web no funcionan correctamente. Estamos trabajando en ello. Disculpad.
(no title)
4 February, 2026
Santa Ageda bezpera dugu! Goazen kantari! Entzun nahi duzue bizkaieraren fonotekan daukagun herri literatura? l.eus/5n7kqica Hona hemen adibide bat! l.eus/hmwkluwl
(no title)
30 January, 2026
Publiko egin da EMG-Voc ReSSint Database datu-basea ELRAren bidez The EMG-Voc ReSSint Database has been made publicly available through ELRA. Se ha hecho pública a través de ELRA la base de datos EMG-Voc ReSSint Database https://islrn.org/resources/057-914-072-202-4/ https://catalog.elra.info/en-us/repository/browse/ELRA-S0498/
(no title)
14 January, 2026
Presentando en el Congreso Internacional de Fonética Experimental, CIFE X, en la Universidad de Córdoba. uco.congressus.es/cife2026/