📰 Source: upgoat.net | Upgoat
✍️ Original author: SithEmpire
⬆️ score: 1
v/AI · by u/SithEmpire
📝 Original content:
I have the LocalAI host interface working, one of the few which understands audio as a possible output. Between its library integration for backends and models, it’s been decent for a good many generative uses.
Unless I’m missing something, the AI TTS landscape looks much more messy and in-development, very little “just works” beyond some bland voices which are barely better than non-AI TTS from a decade ago. Many models seem to have their own backend just for that, which becomes an impasse when someone used a crappy rust port which doesn’t download anymore. Projects with great samples, but model data designed to work only via REPL with a load of python packages, and ends up unusable because its libavcodec wrapper doesn’t compile and running an inference anyway then attempts to UPLOAD itself to huggingface (I have no idea why).
Ultimately I’m looking for good pronunciation, a bit of expression which at least pretends it’s talking to a friend instead of narrating at me, and where to find good non-english models.
This post was automatically imported by OratioRepostBot.


