Can run online (large scale requires local latent decoding)

Can run locally with low VRAM (download gguf)