XaiJu
Multiic
Multiic

patreon


announcement: local speech synthesis. bw LoRAs

bringing u style LoRAs and another kick-ass announcement:

upd: the update will be released on sunday

the game will have local speech synthesis from any voice reference file (.wav at least 6 seconds), using the coqui xtts-2 model.
quality’s above average, a bit below openai, runs fast and free, and voice cloning is super easy to use. compiling the binaries for this is a real pain, but I hope to have something for u by friday sunday •ᴗ•

now for the LoRAs:

there’ll also be a bugfix, generation speed via sd will be improved, and support for flux models for images will be added - on my laptop I couldn’t comfortably test flux, u need at least 16gb of vram, and it’s much slower than sdxl models

announcement: local speech synthesis. bw LoRAs

Comments

You can run flux with nunchaku on 8GB VRAM or less, it may be worth it for the better control over scene composition.

Anarchitect

nice LoRa , i like this b&w style too !

Youz


More Creators