
Unfortunately, using Polly is not free, but you can set up an account and test Polly for one year free of charge (monthly limits apply). Ivona was speech synthesizer developed in Poland and bought by Amazon few years ago. What is really cool: it can speak 20 languages, includes different voices and it sounds very natural :) Amazon’s former text to speech engine was Ivona (this is now deprecated) – which for Polish people may sound familiar. I finally found what I was looking for in Amazon Web Service called Amazon Polly.


They are now promoting their new Google Cloud Speech Api, but it can only turn speech to text, which is the opposite what I want right now ) But after googling for a while it turned out that google’s tts is somehow restricted to Android only. So, I gave up on Jasper and focused on finding a good solution for text to speech in the first place. I want my Pi to read me my new polish-written emails, and with eSpeak engine this sounds like a completely unclear robot’s babble ) Jasper allows you to configure web-based engines like: google speech api, wit.ai or Ivona, but this is either buggy or obsolete as I was unable to get it running and couldn’t find any new threads on the internet regarding problems I was facing. My pronunciation must be very bad, as Jasper understood like every fifth word and spoke back with very „robotic” voice. I was able to talk with Jasper only with the combination of Pocketsphinx (stt) and eSpeak (stt) and only in English. You can configure Jasper by choosing tts (text to speech) and stt (speech to text) engines of your choice. I took me two long evenings to get it running, but the results were not satisfying. I played around with Jasper, an open source platform for creating voice controlled applications. So obviously, I need my Pi to be able to speak and listen. I bought my Pi with the idea of turning it into a small Siri-like personal assistant, that will be capable of performing simple tasks like checking emails, fetching weather forecast, traffic info and so on.
