Vlastimil Gular’s life took an unwelcome turn a year ago: minor surgery on his vocal cords revealed throat cancer, which led to the loss of his larynx and with it, his voice.
But the 51-year-old father of four is still chatting away using his own voice rather than the tinny timbre of a robot, thanks to an innovative app developed by two Czech universities.
It was developed for patients set to lose their voice due to a laryngectomy, or removal of the larynx, a typical procedure for advanced stages of throat cancer. The joint project of the University of West Bohemia in Pilsen, Prague’s Charles University and two private companies – CertiCon and SpeechTech – kicked off nearly two years ago.
The technology uses recordings of a patient’s voice to create synthetic speech that can be played on their mobile phones, tablets or laptops via the app.
Ideally, patients need to record more than 10,000 sentences to provide scientists with enough material to produce their synthetic voice.
“We edit together individual sounds of speech so we need a lot of sentences,” said Dr Jindrich Matousek, an expert on text-to-speech synthesis, speech modelling and acoustics who heads the project at the Pilsen university.
But there are drawbacks: Patients facing laryngectomies usually have little time or energy to do the recordings in the wake of a diagnosis that requires swift treatment.
To address these difficulties, scientists came up with a more streamlined method for the app, which is supported by the Technology Agency of the Czech Republic. Working with fewer sentences – ideally 3,500 but as few as 300 – this method uses advanced statistical models such as artificial neural networks.
“You use speech models with certain parameters to generate synthesised speech,” said Dr Matousek. “Having more data is still better, but you can achieve decent quality with less data of a given voice.”
Besides Czech, the Pilsen scientists have also created synthesised speech samples in English, Russian and Slovak.