Premium lane
Use stronger reasoning capacity for fragile edits, planning, code review, and final passes.
Models and routing
Power users trust gateways when they can see how model choice, fallback, budget protection, and continuation work before a production run depends on it.
Use stronger reasoning capacity for fragile edits, planning, code review, and final passes.
Compress or downgrade long context before a run consumes premium fair-use unnecessarily.
Keep eligible work moving after premium fair-use with economy routing and explicit limits.
Retry through eligible providers when uptime or latency makes the first route unsuitable.
Routing receipt roadmap