Can we get a bit more info? Does it run locally? What specs does it need? Which technology does it use, something open-ended like Whisper? Or something faster with a prefefined set of sentences like VOSK? Which TTS engine does it use? Does it do other languages than just English?
services.tabby.enable = true; services.tabby.acceleration = "cuda";
? Could be another way.