codewhale

dgf1988/codewhale

Author	SHA1	Message	Date
Hunter Bown	49890d1244	Merge PR #3036 from Hmbown: hide internal IDs from normal UI — stable labels for turns and agents fix(tui): hide internal IDs from normal UI — stable labels for turns and agents	2026-06-10 22:20:06 -07:00
Claude	418ad5b744	Merge origin/main into v0.8.58-3030-hide-internal-ids — combine #3030 stable agent labels with #3033 AgentProgress redraw throttle (both kept in App state and the AgentProgress arm)	2026-06-11 05:19:57 +00:00
Hunter Bown	eb610c83ee	Merge PR #3035 from Hmbown: throttle AgentProgress redraws to prevent freeze under subagent load fix(tui): throttle AgentProgress redraws to prevent freeze under subagent load	2026-06-10 22:11:17 -07:00
Hunter Bown	8fadd764d2	Merge PR #3042 from Hmbown: exec --allowed-tools, --disallowed-tools, --max-turns, --append-system-prompt feat(exec): add --allowed-tools, --disallowed-tools, --max-turns, --append-system-prompt	2026-06-10 22:11:07 -07:00
Hunter Bown	20fa626fb8	Merge PR #3041 from Hmbown: harvest error-message fixes — better tool denial and provider errors fix: harvest error-message fixes from PR #2933 — better tool denial + subagent conflict messages	2026-06-10 22:11:00 -07:00
Hunter Bown	b11d8d55c5	Merge PR #3044 from Hmbown: remote-smoke droplet loop — gh CLI, swapfile, agent-session bumps feat(remote-smoke): bump to v0.8.57, add gh CLI, swapfile, agent-session.sh, autonomous loop docs	2026-06-10 22:10:51 -07:00
Hunter Bown	f68059b9b3	Merge PR #3043 from Hmbown: agent-task issue template, labels, and runner protocol feat(docs): agent-task issue template, labels, and runner protocol	2026-06-10 22:10:42 -07:00
Hunter Bown	81b060928b	Merge PR #2579 from encyc: Phase 4 — replace Session.messages Vec with AppendLog refs(#2264): Phase 4 — replace Session.messages: Vec<Message> with AppendLog	2026-06-10 20:10:20 -07:00
Hunter Bown	544b44bd98	Merge PR #2892 from gordonlu: localize sandbox elevation dialog across 7 locales feat(i18n): localize sandbox elevation dialog across 7 locales	2026-06-10 20:10:05 -07:00
Claude	9de6c9d125	docs(remote-smoke): add gh auth setup-git + git identity to the autonomous-loop setup; qualify the AGENT_RUNNER.md cross-reference (file lands in #3043 ) with an on-branch fallback (#3022 ) Co-Authored-By: Claude <noreply@anthropic.com> https://claude.ai/code/session_018zaP8vUfTAsrE38L6h6fw5	2026-06-11 02:31:34 +00:00
Claude	e4ea208d53	docs(runner): fix resume example — exec has no 'latest' session alias; use --continue (#3021 ) Co-Authored-By: Claude <noreply@anthropic.com> https://claude.ai/code/session_018zaP8vUfTAsrE38L6h6fw5	2026-06-11 02:31:00 +00:00
Claude	5fb41cc209	test(errors): add #3020 test extensions — Plan-mode denial passes through verbatim, bare/model denials get the suffix; Model-Not-Exist + OpenAI-style rejections annotated; conflict error includes elapsed time; tighten mode-word predicate so 'model' no longer matches Co-Authored-By: Claude <noreply@anthropic.com> https://claude.ai/code/session_018zaP8vUfTAsrE38L6h6fw5	2026-06-11 02:30:30 +00:00
Claude	5a71d644f5	fix(tui): #3030 audit fixes — nickname beats generated Agent-N label; status bar uses stable labels (with raw-id fallback) for spawn/progress/complete; drop truncated raw id from compact detail line; add label/turn/step-counter tests Co-Authored-By: Claude <noreply@anthropic.com> https://claude.ai/code/session_018zaP8vUfTAsrE38L6h6fw5	2026-06-11 02:13:30 +00:00
Claude	df1b35ba0f	fix(tui): #3033 audit fix — throttled AgentProgress no longer cancels redraws owed to other events in the same drain batch; restore pre-event accumulator value; extract agent_progress_redraw_permitted + unit tests Co-Authored-By: Claude <noreply@anthropic.com> https://claude.ai/code/session_018zaP8vUfTAsrE38L6h6fw5	2026-06-11 02:03:17 +00:00
Claude	c15e937096	fix(exec): wire --disallowed-tools into the gate chain (deny wins over allow), filter the advertised tool catalog, honor --append-system-prompt in needs_engine, surface max-steps notice in text mode; add clap/gate/catalog tests (#3027 ) Co-Authored-By: Claude <noreply@anthropic.com> https://claude.ai/code/session_018zaP8vUfTAsrE38L6h6fw5	2026-06-11 00:31:05 +00:00
Hunter Bown	c98b7ea42c	fix: harvest error-message fixes from PR #2933 — better tool denial + subagent conflict messages (#3020 ) Three targeted error-message improvements extracted from community PR #2933 (author cy2311), with additional model-not-found annotation: 1. dispatch.rs format_tool_error: pass through self-explanatory messages that already name the cause (mode switch, allow_shell, feature flag, denied by user) instead of appending a conflicting generic suffix. Fixes the Plan-mode double-message (#2657). 2. subagent/mod.rs session-name conflict: include elapsed time (started Ns ago / NmNs ago) so the parent can distinguish a live worker from a stale/failed earlier spawn (#2656). 3. subagent/mod.rs annotate_child_model_error: catch model-not-found patterns (Model Not Exist, does not exist, no such model, etc.) in the raw error text even when the taxonomy classifies them as Internal rather than Authorization/State (#2653). Closes #2653, #2656, #2657. Credit: cy2311 for the dispatch.rs and subagent conflict hunks from #2933. Co-authored-by: cy2311 <29836092+cy2311@users.noreply.github.com>	2026-06-10 16:41:55 -07:00
Hunter Bown	b433989cc3	style: cargo fmt Co-Authored-By: Claude Fable 5 <noreply@anthropic.com>	2026-06-10 16:40:27 -07:00
Hunter Bown	06d680240c	style: cargo fmt Co-Authored-By: Claude Fable 5 <noreply@anthropic.com>	2026-06-10 16:40:17 -07:00
Hunter Bown	5483e1553d	feat(remote-smoke): bump to v0.8.57, add gh CLI, swapfile, agent-session.sh, autonomous loop docs (#3022 ) - setup-vm.sh: bump RELEASE_TAG default to v0.8.57, add gh CLI install step (official APT repo) and 4G swapfile creation (idempotent) - agent-session.sh: new sourceable helper that exports the provider key from /etc/codewhale/runtime.env for interactive agent sessions - README.md: update version refs, add agent-session.sh to layout, add Autonomous agent loop section with full pick->PR commands The droplet ops (binary upgrade, PAT setup, first end-to-end issue run) are documented as the next steps for the operator.	2026-06-10 16:20:57 -07:00
Hunter Bown	cef3b92964	feat(docs): agent-task issue template, labels, and runner protocol (#3021 ) Adds the distributed intelligence infrastructure so remote agents can autonomously execute v0.8.58 milestone issues: - .github/ISSUE_TEMPLATE/agent-task.yml — GitHub issue form with six required sections (Goal, Scope, Key files, Acceptance criteria, Verification, Out of scope). Auto-labels as agent-ready. - docs/AGENT_RUNNER.md — pick → claim → worktree → exec → verify → PR loop with safety rules, label semantics, and the issue body format. Labels agent-ready, agent-in-progress, needs-human already exist (created during milestone setup).	2026-06-10 16:19:07 -07:00
Hunter Bown	dbd9b9670d	feat(exec): add --allowed-tools, --disallowed-tools, --max-turns, --append-system-prompt (#3027 ) Headless exec hardening for benchmark/CI/droplet use: - New CLI flags: --allowed-tools, --disallowed-tools, --max-turns, --append-system-prompt - Add disallowed_tools to EngineConfig + command_denies_tool() helper - run_exec_agent threads all four flags into EngineConfig and Op::SendMessage - needs_engine now includes flag presence for standalone exec use	2026-06-10 16:17:33 -07:00
Hunter Bown	ec0789daf4	fix(tui): hide internal IDs from normal UI — stable labels for turns and agents (#3030 ) Three changes to replace raw UUIDs/hex-ids with stable user-facing labels: 1. Turn label: Add turn_counter to App, display "Turn N" instead of the raw runtime_turn_id UUID prefix. Full UUID preserved in hover text. 2. Agent labels: Add agent_counter + agent_label_map to App. Populated on AgentSpawned; sidebar rows use "Agent 1", "Agent 2" etc. instead of agent_<hex>. Nicknames and user-assigned names still take priority. 3. Step counter: Add format_step_counter() helper. When max_steps is u32::MAX (the unbounded sentinel), renders "step 16" instead of the meaningless "step 16/4294967295". Concrete step budgets still show the denominator.	2026-06-10 15:52:34 -07:00
Hunter Bown	7b1446f7b0	fix(tui): throttle AgentProgress redraws to prevent freeze under subagent load (#3033 ) When 4+ sub-agents run concurrently, each AgentProgress event triggers a full terminal redraw via received_engine_event → needs_redraw. The render loop saturates, sidebar recomputation dominates the frame budget, and terminal input events (including Ctrl+C) are starved. Limit progress-driven redraws to at most one per 100ms per agent. The status-animation timer (80ms cadence) still guarantees sidebar updates. Agent state is recorded immediately; the sidebar picks it up on the next permitted redraw. Adds last_agent_progress_redraw field to App to track throttle state.	2026-06-10 15:47:35 -07:00
Justin Gao	ebe828af27	fix: remove useless .into() on SavedSession.messages clone (#2579 ) SavedSession.messages is Vec<Message>, not AppendLog — .clone() already returns Vec<Message>, so .into() was a no-op conversion that triggered clippy::useless_conversion in CI lint.	2026-06-10 17:19:43 +08:00
Justin Gao	08904fde47	refs(#2264 ): Phase 4 — replace Session.messages: Vec<Message> with AppendLog (#2579 ) - Wire AppendLog as the backing store for Session.messages - Add Deref, From impls, and explicit mutation methods to AppendLog - Narrow API: remove DerefMut, add push_batch/truncate_to/trim_front/clear/last_mut - Update all direct message assignments to use .into() conversions - Update tests to deref through AppendLog for comparisons Rebased onto upstream/main (v0.8.57) to resolve merge conflicts.	2026-06-10 16:55:11 +08:00
Hunter B	b23067bacd	release: v0.8.57 — sleep-resume turns, docker fix, one-command release prep, changelog diet	2026-06-10 00:02:51 -07:00
Hunter B	f9c9764265	fix(tests): deflake prompt cache and MCP SSE mock-server tests - prompt_persist tests wrote through the developer's real ~/.codewhale/prompt_cache and raced each other (the eviction test could delete another test's entry). cache_dir() now honors CODEWHALE_PROMPT_CACHE_DIR and the tests run serialized against private tempdirs. - The raw TCP mock servers in mcp tests answered one request per socket but never advertised 'Connection: close', so reqwest pooled dead keep-alive connections and retried POSTs failed under parallel-suite load with 'connection closed before message completed' (~50% failure rate on the full suite).	2026-06-10 00:00:54 -07:00
Hunter B	ddd5df4b9b	chore: drop stale allow(dead_code) on AgentOpenTool (registered since v0.8.33)	2026-06-09 23:45:58 -07:00
Hunter B	c58ef8ddff	feat(release): generate the GitHub Release body from the CHANGELOG entry The workflow hardcoded install boilerplate plus a contributor list that had already drifted (v0.8.56's release thanked people 'for shaping v0.9.0'). The body now comes from scripts/release/generate-release-body.sh: static install/verify sections plus the tagged version's changelog section, which already carries the per-release credits.	2026-06-09 23:44:57 -07:00
Hunter B	4465459b69	feat(release): one-command version bump via prepare-release.sh; close version-drift gaps - scripts/release/prepare-release.sh bumps workspace + crate pins + npm wrapper + README install tags, refreshes Cargo.lock, regenerates the TUI changelog slice and web facts, then runs check-versions.sh - check-versions.sh now also gates web/lib/facts.generated.ts and the README install-tag examples (both drifted silently before) - .cnb.yml validates the pushed tag against Cargo.toml before generating mirror release notes - RELEASE_CHECKLIST/RUNBOOK updated accordingly (v0.8.56 needed 9 fix commits for exactly these sync points)	2026-06-09 23:43:15 -07:00
Hunter B	717d728163	feat: survive system sleep mid-turn — detect the suspend gap and retry the request (#2990 ) When the host sleeps while a model response is streaming, the connection dies on wake with 'Stream read error: error decoding response body' and the turn was lost. The engine now stamps every stream chunk with both monotonic and wall-clock time; Instant pauses across a suspend while SystemTime does not, so a >10s divergence on a stream error identifies a sleep/wake cycle. In that case the partial output is discarded and the identical request is re-issued (sharing the existing MAX_STREAM_RETRIES=3 budget) instead of failing the turn. Ordinary network flakes keep the deliberate no-retry-after-content policy from #103.	2026-06-09 23:40:42 -07:00
Hunter B	fad04a016f	polish: finish the rebrand in agent-facing surfaces - system prompt environment key deepseek_version -> codewhale_version - drop legacy .deepseek/instructions.md from the Local Law prompt tier (the engine still reads it for back-compat) - instructions-file truncation marker now states how many bytes were omitted so the model knows what it is missing - CODEWHALE_CHANGELOG const + user-facing /change strings - codewhale metrics doc headers	2026-06-09 23:36:32 -07:00
Hunter B	6dcdc19077	chore: drop unused deps (tracing-appender, zeroize, rustls in release), orphaned vendor lockfile and one-off verify_task.sh	2026-06-09 23:33:47 -07:00
Hunter B	44c13eb63f	fix(release): check-versions validates the generated TUI changelog slice, not byte equality The packaged changelog is now a recent-releases slice produced by scripts/sync-changelog.sh (which gains a --check mode); also restore the SECURITY.md contact line the version gate guards, and finish the stale binary-name sweep (--bin codewhale examples, qa harness doc).	2026-06-09 23:32:40 -07:00
Hunter B	626032ad6b	fix(devcontainer): use codewhale user/name/mount instead of pre-rebrand deepseek	2026-06-09 23:30:53 -07:00
Hunter B	26947bd407	fix(docker): stop copying legacy deepseek binaries that no longer exist The release docker job failed on 'cp: cannot stat .../release/deepseek'. Legacy entrypoints survive as symlinks to the codewhale binaries.	2026-06-09 23:30:45 -07:00
Hunter B	68aee8409f	chore: move HarmonyOS clang wrappers to scripts/ohos/	2026-06-09 23:24:13 -07:00
Hunter B	4a4ea63820	chore: add CODEOWNERS and dependabot config	2026-06-09 23:23:50 -07:00
Hunter B	a2bf9f806a	chore: gitignore benchmark results/ and __pycache__ under scripts/	2026-06-09 23:23:37 -07:00
Hunter B	6551106e79	docs: move internal design docs into docs/rfcs/	2026-06-09 23:23:25 -07:00
Hunter B	854274de1d	docs: remove internal US VM setup notes	2026-06-09 23:23:03 -07:00
Hunter B	bf2e5504a2	docs: remove stale internal docs (handoffs, old audits, orphaned roadmaps)	2026-06-09 23:22:32 -07:00
Hunter B	b4edb4e1ef	build: generate crates/tui/CHANGELOG.md as a 15-release slice instead of a full manual copy The /change command embeds this file into every binary via include_str!; it now carries only recent releases (regenerated by scripts/sync-changelog.sh, wired into the release checklist). The explicit-version test derives its fixture versions from the embedded slice instead of hardcoding old ones.	2026-06-09 23:22:00 -07:00
Hunter B	258d75376c	docs: archive changelog entries v0.8.39 and older to docs/CHANGELOG_ARCHIVE.md	2026-06-09 23:19:11 -07:00
Hunter B	bcaf655316	fix: bump npm wrapper version, add v0.8.56 compare link, resync TUI CHANGELOG	2026-06-09 22:35:50 -07:00
Hunter B	6a2ec32c00	fix: bump all internal crate dependency versions to 0.8.56	2026-06-09 22:30:27 -07:00
Hunter B	710b094c8a	fix: sync crates/tui/CHANGELOG.md with workspace root for v0.8.56 test gate	2026-06-09 22:26:24 -07:00
Hunter B	5783b05b0a	fix: update Cargo.lock after version bump to 0.8.56	2026-06-09 22:14:45 -07:00
Hunter B	ac56bd85b9	polish: drop deferred npm language from INSTALL.md, bump to v0.8.56	2026-06-09 22:12:41 -07:00
Hunter B	f1c73bda9b	polish: bump CNB install tags, remove deferred npm language, regen web facts to 0.8.56	2026-06-09 22:12:00 -07:00

1 2 3 4 5 ...

2551 Commits