Strixa AI
TopicsSearchPricing
Sign inStart tracking
Strixa AI
TopicsSearchPricing
Sign inStart tracking
S
Intelligence HubEnterprise Workspace
New Tracking
Topics DirectoryTrend AnalysisEvidence PanelSignal FeedTechnical Events
DocumentationAccount
Topics Directory/Model Routing and Gateways
Stage: Expansion

Model Routing and Gateways

Track important changes in Model Routing and Gateways, including capabilities, product updates, adoption signals, risks, and evidence worth continued monitoring.

MODEL ROUTINGTRACKING
Live from /v1/topics/model_routing_and_gateways
Timeline
6 events
Signals
6 signal records
Evidence
6 evidence items
Sources
2 sources

HighTrend velocity

6 hours agoLatest tracked change

Subscribe to Topic

Signal Feed

Changes worth continued tracking

6 unique signals
  1. pull requestMay 19, 2026, 10:34 AM

    Make repeated run cancel requests idempotent for interrupted runs

    A TOCTOU race in the gateway cancel flow caused two concurrent `cancel_run` calls to both pass the initial existence check, so when the first request interrupted the run the second still returned `409` even though cancellation had effectively happened. The fix is a post-cancel re-read under the authoritative state; if that status is `interrupted` or the record is gone, the router now returns `202`, while preserving `409` for truly non-cancellable states.

    What ChangedA TOCTOU race in the gateway cancel flow caused two concurrent `cancel_run` calls to both pass the initial existence check, so when the first request interrupted the run the second still returned `409` even though cancellation had effectively happened. The fix is a post-cancel re-read under the authoritative state; if that status is `interrupted` or the record is gone, the router now returns `202`, while preserving `409` for truly non-cancellable states.
    Why It MattersOperators and API clients that retry or issue concurrent cancel requests now avoid unpredictable 409 failures, so cancel workflows become reliable instead of intermittently failing halfway through automation scripts or user-driven retries. Concretely, the router now treats already-interrupted or cleaned-up runs as a successful terminal outcome (202) after a cancellation attempt, while still surfacing 409 when the run is genuinely non-cancellable; continue watching for regressions in status edge cases and any latency overhead from the extra status read under high cancel-throughput workloads.
    Final score 81Confidence 971 evidence itemcancel_rungateway routerrun statusTOCTOU raceinterrupted202 accepted409 conflict_cancel_conflict_detail
    Analyze Evidence
  2. pull requestMay 19, 2026, 4:28 PM

    rUFlo: stop false dependency cycles from TypeScript type-only imports

    This PR fixes kg-extract so TypeScript `import type` edges are no longer counted as runtime dependencies, preventing false graph cycles, and updates kg-traverse to use an enabled AgentDB path while adding CI smoke checks for the new behavior.

    What ChangedThis PR fixes kg-extract so TypeScript `import type` edges are no longer counted as runtime dependencies, preventing false graph cycles, and updates kg-traverse to use an enabled AgentDB path while adding CI smoke checks for the new behavior.
    Why It MattersDevelopers and operators running ruflo’s knowledge-graph extraction on large TypeScript codebases will get cleaner outputs with fewer fake dependency-cycle alerts, which reduces wasted debugging and makes refactoring decisions more trustworthy; watch for any parser/template changes or SKILL.md drift that could reintroduce false-positive cycle detection. The pipeline now also locks in the expected tool and relation rules through CI checks, so a future change that re-enables the wrong controller or regresses `import type` handling is more likely to be caught before merging.
    Final score 80Confidence 931 evidence itemkg-extractkg-traversetype-depends-onimport typeagentdb_pattern-searchagentdb_semantic-routeSKILL.mdscripts/smoke-kg-extract-type-imports.mjs
    Analyze Evidence
  3. pull requestMay 19, 2026, 3:42 AM

    Aider adds OrcaRouter as a first-class provider path

    Aider introduces native OrcaRouter support (`orcarouter/` model prefixes) so users can invoke OrcaRouter-hosted models through existing aider flows with one provider-style key, instead of treating it as an external add-on path.

    What ChangedAider introduces native OrcaRouter support (`orcarouter/` model prefixes) so users can invoke OrcaRouter-hosted models through existing aider flows with one provider-style key, instead of treating it as an external add-on path.
    Why It MattersAider users can now run a broad set of third-party models via one OrcaRouter entry (`orcarouter/<vendor>/<model>`) with consistent metadata and routing, which makes model experiments and provider switching in day-to-day workflows faster and less error-prone. The router implementation reuses aider’s existing OpenAI-compatible request path by rewriting to `openai/<rest>` for litellm and injecting OrcaRouter credentials/headers, so teams should watch pricing-feed availability and format changes, 24-hour cache drift, and any provider-side API/base-url incompatibilities that could cause wrong model selection or unexpected token-cost estimates.
    Final score 80Confidence 931 evidence itemOrcaRouteraiderOrcaRouterModelManagerModelInfoManagerModel.send_completionlitellmORCAROUTER_API_KEYorcarouter-auto
    Analyze Evidence
  4. pull requestMay 16, 2026, 3:03 AM

    Mulch CLI upgrade makes `ml prune` non-destructive by default

    Overstory’s dependency bump to `@os-eco/mulch-cli` 0.8.0 is mainly about safer record lifecycle handling: `ml prune` now defaults to soft-archiving stale records instead of deleting them, so cleanup becomes reversible when operators hit a bad classification. This is the key behavioral shift that changes how teams can recover from maintenance mistakes.

    What ChangedOverstory’s dependency bump to `@os-eco/mulch-cli` 0.8.0 is mainly about safer record lifecycle handling: `ml prune` now defaults to soft-archiving stale records instead of deleting them, so cleanup becomes reversible when operators hit a bad classification. This is the key behavioral shift that changes how teams can recover from maintenance mistakes.
    Why It MattersRepo operators and maintainers using `ml prune` can now recover from overly aggressive cleanup runs, because stale records are preserved in archive files instead of being deleted, reducing operational risk when maintaining shared knowledge data. The release also adds explicit archived-state tracking (`status: "archived"`, `archived_at`) and restoration support, improving rollback visibility and recovery flow. Watch for scripted cleanup workflows that assumed deletion, monitor archive growth, and validate cross-domain restore behavior so recovery does not become ambiguous.
    Final score 77Confidence 931 evidence item@os-eco/mulch-climl prunesoft-archive.mulch/archiveml restore
    Analyze Evidence
  5. pull requestMay 19, 2026, 11:45 PM

    Refresh Goose canonical model registry to include new Gemini entries

    The PR regenerates Goose’s canonical model registry from models.dev and updates the provider metadata snapshot so the catalog reflects newly available model/provider data, including google/gemini-3.5-flash. This is a catalog data refresh of routing metadata rather than a runtime or training change.

    What ChangedThe PR regenerates Goose’s canonical model registry from models.dev and updates the provider metadata snapshot so the catalog reflects newly available model/provider data, including google/gemini-3.5-flash. This is a catalog data refresh of routing metadata rather than a runtime or training change.
    Why It MattersDevelopers and operators using Goose for model routing can now discover and use newly listed models like google/gemini-3.5-flash faster, without manual intervention to patch registry files, which can shorten time to enable new model capabilities in deployments. The refresh updates `canonical_models.json` and `provider_metadata.json` from the source catalog, so the next step to watch is whether any regenerated entries have incorrect metadata or unavailable endpoints in real traffic before broader production routing.
    Final score 74Confidence 961 evidence itemcanonical model registryprovider metadata snapshotmodels.devgoogle/gemini-3.5-flash
    Analyze Evidence
  6. releaseMay 16, 2026, 12:34 AM

    Versioned storage downloads for cache-safe CDN updates

    InsForge v2.1.6 adds `?v=<etag>` to storage download URLs to force CDN cache-busting when artifact content changes.

    What ChangedInsForge v2.1.6 adds `?v=<etag>` to storage download URLs to force CDN cache-busting when artifact content changes.
    Why It MattersUsers and operators pulling release artifacts are less likely to deploy stale binaries from CDN caches after an update, which reduces rollout inconsistencies and unexpected behavior caused by outdated files. Because the URL now changes with content, cache invalidation becomes content-driven; monitor cache hit/miss signals and any edge/client URL normalization behavior to ensure query parameters are preserved and that ETag values always change when bytes do.
    Final score 66Confidence 931 evidence itemstorageCDNETagdownload URLcache busting
    Analyze Evidence

Topic Timeline

How the topic has changed over time

6 events
  1. May 19, 2026, 11:45 PM

    pull request

    Refresh Goose canonical model registry to include new Gemini entries

    The PR regenerates Goose’s canonical model registry from models.dev and updates the provider metadata snapshot so the catalog reflects newly available model/provider data, including google/gemini-3.5-flash. This is a catalog data refresh of routing metadata rather than a runtime or training change.
    ContributionRebuilt and synchronized the canonical model registry and provider metadata files from models.dev, adding new provider/model entries and reducing stale catalog data for model routing.
    ImpactDevelopers and operators using Goose for model routing can now discover and use newly listed models like google/gemini-3.5-flash faster, without manual intervention to patch registry files, which can shorten time to enable new model capabilities in deployments. The refresh updates `canonical_models.json` and `provider_metadata.json` from the source catalog, so the next step to watch is whether any regenerated entries have incorrect metadata or unavailable endpoints in real traffic before broader production routing.
  2. May 19, 2026, 4:28 PM

    pull request

    rUFlo: stop false dependency cycles from TypeScript type-only imports

    This PR fixes kg-extract so TypeScript `import type` edges are no longer counted as runtime dependencies, preventing false graph cycles, and updates kg-traverse to use an enabled AgentDB path while adding CI smoke checks for the new behavior.
    ContributionImplemented a concrete dependency-classification fix that separates type-only imports from value imports in kg-extract, eliminating phantom cycle links, then re-routed traversal to a supported AgentDB search controller and added explicit smoke tests (contract + behavioral fixture) to prevent the regression.
    ImpactDevelopers and operators running ruflo’s knowledge-graph extraction on large TypeScript codebases will get cleaner outputs with fewer fake dependency-cycle alerts, which reduces wasted debugging and makes refactoring decisions more trustworthy; watch for any parser/template changes or SKILL.md drift that could reintroduce false-positive cycle detection. The pipeline now also locks in the expected tool and relation rules through CI checks, so a future change that re-enables the wrong controller or regresses `import type` handling is more likely to be caught before merging.
  3. May 19, 2026, 10:34 AM

    bugfix

    Make repeated run cancel requests idempotent for interrupted runs

    A TOCTOU race in the gateway cancel flow caused two concurrent `cancel_run` calls to both pass the initial existence check, so when the first request interrupted the run the second still returned `409` even though cancellation had effectively happened. The fix is a post-cancel re-read under the authoritative state; if that status is `interrupted` or the record is gone, the router now returns `202`, while preserving `409` for truly non-cancellable states.
    ContributionChanged the cancel path to be idempotent in race windows by re-reading run state after the lock-protected `cancel()` step and routing responses by authoritative post-lock status (`interrupted`/missing => 202, otherwise 409 via conflict logic).
    ImpactOperators and API clients that retry or issue concurrent cancel requests now avoid unpredictable 409 failures, so cancel workflows become reliable instead of intermittently failing halfway through automation scripts or user-driven retries. Concretely, the router now treats already-interrupted or cleaned-up runs as a successful terminal outcome (202) after a cancellation attempt, while still surfacing 409 when the run is genuinely non-cancellable; continue watching for regressions in status edge cases and any latency overhead from the extra status read under high cancel-throughput workloads.
  4. May 19, 2026, 3:42 AM

    pull request

    Aider adds OrcaRouter as a first-class provider path

    Aider introduces native OrcaRouter support (`orcarouter/` model prefixes) so users can invoke OrcaRouter-hosted models through existing aider flows with one provider-style key, instead of treating it as an external add-on path.
    ContributionAdd a native OrcaRouter provider integration path in aider: new `OrcaRouterModelManager` loads and caches OrcaRouter model catalog data (max tokens, input/output costs), `ModelInfoManager` resolves `orcarouter/` model metadata when litellm lacks built-in details, and `Model.send_completion` routes calls through `api.orcarouter.ai` using `ORCAROUTER_API_KEY` while preserving aider’s existing alias and per-provider behavior.
    ImpactAider users can now run a broad set of third-party models via one OrcaRouter entry (`orcarouter/<vendor>/<model>`) with consistent metadata and routing, which makes model experiments and provider switching in day-to-day workflows faster and less error-prone. The router implementation reuses aider’s existing OpenAI-compatible request path by rewriting to `openai/<rest>` for litellm and injecting OrcaRouter credentials/headers, so teams should watch pricing-feed availability and format changes, 24-hour cache drift, and any provider-side API/base-url incompatibilities that could cause wrong model selection or unexpected token-cost estimates.
  5. May 16, 2026, 3:03 AM

    dependency update

    Mulch CLI upgrade makes `ml prune` non-destructive by default

    Overstory’s dependency bump to `@os-eco/mulch-cli` 0.8.0 is mainly about safer record lifecycle handling: `ml prune` now defaults to soft-archiving stale records instead of deleting them, so cleanup becomes reversible when operators hit a bad classification. This is the key behavioral shift that changes how teams can recover from maintenance mistakes.
    ContributionUpgrades overstory’s mulch-cli dependency to 0.8.0, where the prune lifecycle behavior changes from hard deletion to soft archive by default; stale records are retained in archive files with lifecycle metadata, enabling controlled recovery instead of immediate data loss.
    ImpactRepo operators and maintainers using `ml prune` can now recover from overly aggressive cleanup runs, because stale records are preserved in archive files instead of being deleted, reducing operational risk when maintaining shared knowledge data. The release also adds explicit archived-state tracking (`status: "archived"`, `archived_at`) and restoration support, improving rollback visibility and recovery flow. Watch for scripted cleanup workflows that assumed deletion, monitor archive growth, and validate cross-domain restore behavior so recovery does not become ambiguous.
  6. May 16, 2026, 12:34 AM

    release

    Versioned storage downloads for cache-safe CDN updates

    InsForge v2.1.6 adds `?v=<etag>` to storage download URLs to force CDN cache-busting when artifact content changes.
    ContributionThe release introduces ETag-based version parameters on generated download links so updated artifacts get a new URL identity and avoid being silently served as stale content by caches.
    ImpactUsers and operators pulling release artifacts are less likely to deploy stale binaries from CDN caches after an update, which reduces rollout inconsistencies and unexpected behavior caused by outdated files. Because the URL now changes with content, cache invalidation becomes content-driven; monitor cache hit/miss signals and any edge/client URL normalization behavior to ensure query parameters are preserved and that ETag values always change when bytes do.

Evidence Trail

  1. github_pull_request

    aaif-goose/goose PR #9331: chore: update canonical model registry

    Regenerate canonical model registry from models.dev, update provider metadata snapshot, and include newly available Gemini entries such as google/gemini-3.5-flash.

    Open Source
  2. github_pull_request

    ruvnet/ruflo PR #2055: fix: #2049 kg-extract type-import classification + #2047/#2048 follow-up notes

    Adds a `type-depends-on` relation with `import type` carve-out logic in kg-extract, switches kg-traverse from disabled `agentdb_semantic-route` to enabled `agentdb_pattern-search`, and introduces `smoke-kg-extract-type-imports.mjs` checks to enforce the classifier contract and behavior.

    Open Source
  3. github_pull_request

    bytedance/deer-flow PR #3058: fix(gateway): make cancel idempotent for already-interrupted runs

    Fixes #3055: repeated cancel calls on an active run could return 409 about 50% of the time; the router now re-fetches status after `cancel()` and returns 202 when the run is already interrupted or removed.

    Open Source
  4. github_pull_request

    aider-ai/aider PR #5147: feat: add OrcaRouter (orcarouter/) as a first-class provider

    PR #5147 adds OrcaRouter provider integration by wiring pricing/catalog metadata and completion routing so `orcarouter/<vendor>/<model>` calls are handled as an aider-native model source.

    Open Source

Source Coverage

github pull request
5 events · 5 evidence items
6 hours ago
github release
1 event · 1 evidence item
4 days ago

Subscribe to this topic

Keep tracking Model Routing and Gateways with weekly digests and high-signal alerts once your account subscription is active.

Sign in to subscribeReview Pro tracking

Watching Next

Model Routing and Gateways tracks source-backed changes, trend stages, evidence volume, and the signals worth watching over time.

Turn on alerts