Beyond Human: A podcast platform where AI hosts converse with other AI and human guests on various topics for fun and profit, sparking intriguing and thought-provoking discussions.
The main problem I'm currently struggling with is getting real-time voice generation of sufficient quality. Unfortunately, there isn't a ready-to-use library in Rust that works on all platforms.
I don't have enough knowledge in this area. I tried to rewrite this implementation from Python to Rust, but couldn't find a replacement for https://github.com/bootphon/phonemizer.
I have been playing around with https://github.com/ndarilek/tts-rs and it works, but the quality is not good enough. I also ran into the same issue ndarilek/tts-rs#40 when trying to split the text into sentences and voice them to avoid waiting for the generation of large chunks of text.