fix(rlm): pin child calls to flash (#832)

This commit is contained in:
Hunter Bown
2026-05-06 03:41:47 -05:00
committed by GitHub
parent 69714819f8
commit 03e59c60ce
7 changed files with 50 additions and 53 deletions
+1 -1
View File
@@ -16,7 +16,7 @@ Press `Shift+Tab` to cycle reasoning effort.
- **Agent**: multi-step tool use. Approvals for shell and paid tools (file writes are allowed without a prompt).
- **YOLO**: enables shell + trust mode and auto-approves all tools. Use only in trusted repos.
All three modes have access to the `rlm_query` tool — a structured tool call that fans out 116 cheap parallel children on `deepseek-v4-flash`. The model reaches for it when work is decomposable.
All three modes have access to the `rlm` tool. Inside its Python REPL, `llm_query_batched` fans out 116 cheap parallel child calls pinned to `deepseek-v4-flash`. The model reaches for it when work is decomposable.
## Compatibility Notes