XaiJu
RayenAI
RayenAI

patreon


Riko Project 2026-02-13

Riko Project 2026-02-13

Comments

having problems with audio_playback

Christian Matthews

Update from a Mac user. Just keep throwing the errors and files into claude and eventually it worked. Have to start all 3 servers one by one but she’s alive.

Sky Lee

Easy fix, just go to the "project riko" folder and edit the "start servers" file. Look for the Line with: echo Starting GPT-SoVITS Server (using run_api_v2.bat)... start "GPT-SoVITS" cmd /k "cd /d %SOVITS_PATH% && call go-api.bat" Change it to: echo Starting GPT-SoVITS Server (using run_api_v2.bat)... start "GPT-SoVITS" cmd /k "cd /d %SOVITS_PATH% && call runtime\python.exe api_v2.py" That will do the tick :)

KurioForBlue

Can someone help? I can’t run it cuz go-api.bat dosnt exist so i can’t run voice , is there a solution for it and /or can someone send it please or help?

Viktor Konyushenko

mac user here, how does one connect the SOvits? the mac version seems to have many different files, can't seem to connect the dots on a mac. Was wondering if any mac users gave it a shot? Thanks!

Sky Lee

Finaly fixed it! now, for everyone that has a rtx 50 series card and want to use it with GPT-SOVITS, you need a specific GPT-SOVITS version, not the one that can be download at the github repo., bc. of missmatching pytorch version for rtx 50 series. So that beening said, delete your currently GPT-SOVITS folder and just download this version from huggingface and extracted it. Then continue with it like in the video and boom -> It works :D https://huggingface.co/lj1995/GPT-SoVITS-windows-package/blob/main/GPT-SoVITS-v4-20250529-nvidia50.7z Dont forget to like his repo. on huggingface for this solution :) Also for the ones that has plenty VRAM on GPU, change the values in this file "tts_infer" to this, for faster responses (will load model completely in VRAM) \GPT_SoVITS\configs -> Find this file "tts_infer" and open it with your editor. device: cuda is_half: false

KurioForBlue

Ok found the error! :) It is again the RTX 50 Series that is causing a headache xD The problem is, that with the default GPT-SOVITS with api_v2 the "cuda" libs are not loading, bc. of missmatching versions. I need to find a way to integrated it natively with the one click version of GPT-SOVITS :D For the time being, i also found a workaround but it will definitely cook your CPU, and mine is a i7 13700k^^ Workaround: Navigate to this folder in your GPT-SOVITS Directory: 1. \GPT_SoVITS\configs -> Find this file "tts_infer" and open it with your editor. 2. Change the first lines to this: custom: bert_base_path: GPT_SoVITS/pretrained_models/chinese-roberta-wwm-ext-large cnhuhbert_base_path: GPT_SoVITS/pretrained_models/chinese-hubert-base device: cpu is_half: false What it does: In default it will take the cuda device (your GPU) to compile the txt2speech, but that will not work, like i said, so instead it will use your CPU for that, but keep in mind, it will load your cpu to 100% xD Will Update later with a "Cuda" solution :)

KurioForBlue

Got the same error :( Error in main loop: [Errno 2] No such file or directory: 'client\\audio\\output_32c3a876d4484519bba6bf00d8e4cd29.wav' It does also happen, when i try to execute the check.py. The strange thing is, that GPT-SOVITS will crash at decoding procedure, so maybe we need to set a output folder or something for temp decoding idk.

KurioForBlue

Error in sovits_gen: HTTPConnectionPool(host='127.0.0.1', port=9880): Max retries exceeded with url: /tts (Caused by NewConnectionError("HTTPConnection(host='127.0.0.1', port=9880): Failed to establish a new connection: [WinError 10061] No connection could be made because the target machine actively refused it")) Error in main loop: [Errno 2] No such file or directory: 'client\\audio\\output_32c3a876d4484519bba6bf00d8e4cd29.wav' I'm getting this whenever I try to initiate a conversation, any clue of what could be happening?

psychosykes

nvm i switched it to run api_v2.py

noobte

SoVITS doesnt seem to have the go-api.bat, does anyone have a solution?

noobte

he prob lacks the hardware. For fast streaming you need some serious vram

Doktor Sebo

Question: if you are using almost everything locally, why don't you instead of using gpt API just make it use a soemthing local like ollama, why spend money on gpt when you can do it for free and it doesnt take anytime at all to generate reponse compared to generating voice, and not hard to switch your code!

lava


More Creators