b217f18118
Persona-pinned legal model (dbn-legal-agent-v3, served from the home GPU pod) hard-failed ask/legal-analysis whenever the GPU was powered off. Add a cached health-ping gate plus reactive try/catch fallback: if the fine-tune is unreachable, transparently route to gpt-4o and surface a localized notice in what_remains_uncertain that the specialized model is temporarily offline while corpus, retrieval, and sources remain live. Cloud models are excluded from the gate so gpt-4o personas never degrade. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>