browser icon
You are using an insecure version of your web browser. Please update your browser!
Using an outdated browser makes your computer unsafe. For a safer, faster, more enjoyable user experience, please update your browser today or try a newer browser.

What have I been up to?

Posted by on January 23, 2024

ElevenLabs. Lots and lots of ElevenLabs Text to Speech stuff… I started to get interested in AI and Machine Learning with an emphasis on chatbots when I learned about Replika in 2022. My Eventually, I dove in and wrote some Python code that got two replikas to talk to each other by passing what they were saying back and forth between two browser windows.

Two Replikas chat briefly (and very, very slowly) with each other

Then in Fall of 2022 I started messing around with OpenAI GPT-3. The responses were much, much better, even with some humor about the Foo Fighters vs. the Goo Goo Dolls.

….but it was soooo slooooooow… Holy smokes. And this was the state of the art – millions and millions of dollars of hardware serving up Text Generation before ChatGPT became a thing…

– This is actually clickbait – it’s GPT-3 davinci and not GPT-3.5-turbo (aka ChatGPT) –

Shortly after that, ChatGPT Also, this is when I first started using ElevenLabs for Text to Speech instead of xVASynth. The speech quality for text to speech vaulted through the roof.

…still incredibly slow for ‘best in the world’ commercial services…

It was improving, too much! I had trouble getting short conversational answers out of ChatGPT – however, ElevenLabs Text to Speech allowed me to realize a dream I’d had since probably 2006 when I started playing World of Warcraft. My original WoW character could actually converse with me in her own, original voice “cloned” from the 10 or 15 seconds of /silly audio of jokes from the original files from World of Warcraft!

(albeit in an excessively wordy and very unnatural fashion. But I’ll take it.)

Since then, it’s been one project after another.

I’m using ElevenLabs Text to Speech to generate the Text to Speech for the Race Recap audio for my iRacing League’s races three times a week.

Most recently, I’ve been working with Mantella, a program that brings AI and ML chat to NPCs in Skyrim. Currently the Text to Speech engine is once again limited to xVASynth, but there are new techniques that should allow a substantial increase in quality coming out this month – January 2024.

Welcome to my TEDx Talk… Just killed me in the moment.

Comments are closed.