Self-hosted inference: cost, latency, and compliance tradeoffs | saasbrowser.ai