FenixEdu™

Dissertação

{en_GB=Text-to-Speech Synthesis in European Portuguese using Deep Learning} {} EVALUATED

Detalhes: {pt=Esta dissertação de mestrado tem como principal objetivo a síntese de fala a partir de texto para o Português Europeu utilizando técnicas de "deep learning". A motivação para este trabalho é em primeiro lugar a de construir uma voz de criança que é bastante necessária e por outro lado para o desenvolvimento de uma ferramenta de síntese de fala expressiva.Nos sintetizadores há duas limitações sentidas em muitas áreas de aplicação, em particular na interação com robôs e jogos sérios. Este tema inclui diversas áreas de estudo tais como, aprendizagem automática, síntese de fala, linguística e aquisição de fala.Este relatório descreve o trabalho realizado, a ferramenta (Merlin) utilizada para síntese de fala e as conclusões finais deste projeto. A escolha do Merlin, foi guiada pela aplicação desejada para o projeto, pela experiência prévia da equipa, e pela informação disponível sobre cada um dos métodos,dentro do grupo que envolve "deep learning".O trabalho realizado com o Merlin irá permitir a síntese de fala com voz de criança, estando maioritariamente dependente de se gravar uma criança., en=This thesis has one main goal: to synthesise speech from text for European Portuguese, using recent deep learning techniques. The motivation was to use this work on one hand as a first step for building a much needed child’s voice, and on the other hand as a framework to synthesise expressive speech. There are two limitations that are faced by synthesizers in many areas of application, in particular, in the interaction with robots and serious games. This topic involves many areas of study such as machine learning, speech synthesis, linguistics and speech acquisition. This article reports the work done, the framework (Merlin) used to synthesise speech and the final conclusions of this project. The choice of Merlin was guided by the target application, the previous experience of the team, and the available information about each method, within the range of speech synthesis meth-ods involving deep learning. The work done with Merlin will enable the synthesis of a child’s voice depending mainly on the recordings of a child.}
Keywords: {pt=Síntese de Fala, Sistemas de conversão texto-para-fala, Língua Portuguesa, "Deep Learning", Voz de criança, Síntese de fala Expressiva, en=Speech Synthesis, Text-to-speech Systems, Portuguese Language, Deep Learning, Child’s Voice, Expressive Speech Synthesis}

Discussão: novembro 20, 2018, 11:10