codewhale

dgf1988/codewhale

Author	SHA1	Message	Date
Hunter Bown	a47b28e5d5	Complete v0.7.6 TUI polish and localization lane (#222 ) Squash-merge PR #222 after green CI and review cleanup.\n\nCloses #198, #199, #206, #207, #208, #209, #210, #212, #213, #214, #215, #216.	2026-04-29 13:06:51 -05:00
Hunter Bown	c2b2c284f6	release: v0.7.5 — token-basis fixes, shell timeout recovery, context/cache policy Issues #202, #203, #204, #205: - Cycle/seam triggers use active request input size + response headroom reserve, not lifetime cumulative API usage. - V4 hard-cycle headroom calibrated around fixed TURN_MAX_OUTPUT_TOKENS plus CONTEXT_HEADROOM_TOKENS safety buffer. - /tokens, /cost, footer/header labels, and docs now separate active context, turn telemetry, cumulative usage, cache hit/miss, context percent, and cost. - Foreground exec_shell timeout output tells the model the process was killed and suggests task_shell_start or background exec_shell plus poll/wait. - Added regression tests for active-token basis, V4 headroom, seam trigger basis, footer label behavior, and shell timeout recovery metadata. - Preserved #200/#201 policy: V4 default is append-only, prefix-cache preserving; replacement compaction, Flash seams, and capacity intervention remain opt-in.	2026-04-29 10:13:27 -05:00
Hunter Bown	0578eb701e	Add shell jobs and MCP manager to the TUI	2026-04-29 09:38:04 -05:00
Hunter Bown	41e8f2b5b2	Disable default compaction and opt in context seams	2026-04-29 09:12:20 -05:00
Hunter Bown	00c92e1c2a	Implement v0.7.4 long-running agent tools	2026-04-29 00:50:43 -05:00
Hunter Bown	6d8ab4c2b8	fix: close v0.7.2 issue cleanup	2026-04-28 23:09:19 -05:00
Hunter Bown	97846cd63a	release: include secrets crate in publish order	2026-04-28 16:39:22 -05:00
Hunter Bown	49d2be9e5c	refactor(tools): share tool result primitives from crate	2026-04-28 01:23:21 -05:00
Hunter Bown	4fb8372c1c	refactor(engine): split turn loop and capacity flow	2026-04-28 01:12:25 -05:00
Hunter Bown	3bc54b0bc0	fix(snapshot): harden side-git restore wiring	2026-04-28 00:46:24 -05:00
Hunter Bown	1f00ac6311	chore(release): drop auto npm publish, document manual flow, trim research PDF - Remove the `publish-npm` job from `release.yml`. It has been failing on every release with `npm error code EOTP` because the configured `NPM_TOKEN` doesn't bypass 2FA. Manual publish from a developer machine is the actual ship path; codify that. - Update `docs/RELEASE_RUNBOOK.md` "npm Wrapper Release" to describe the manual flow (`npm publish --access public` + OTP) and explain why the auto path is gone, with a recovery note for future Trusted-Publishing migration. - Refresh stale cross-reference comment in `publish-npm.yml` (the workflow remains as inert plumbing for an eventual Trusted Publishing setup). - Stop tracking `docs/DeepSeek_V4.pdf` (4.4 MB). It was never referenced outside test fixture filenames; the tests synthesize their own fake PDF. Add to `.gitignore` so a local copy can sit there without nagging. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-26 16:26:10 -05:00
Hunter Bown	feb3cf1e0c	feat: explain parallel fan-out caps in tool descriptions and error messages (fixes #81 )	2026-04-26 13:16:12 -05:00
Hunter Bown	c58d10ded1	feat(tools): mark alias tools with deprecation metadata Add `wrap_with_deprecation_notice` helper in the subagent module that merges a `_deprecation` block into a ToolResult's metadata. Applied exclusively on alias invocations: - `spawn_agent` → use `agent_spawn` (removed in v0.8.0) - `delegate_to_agent` → use `agent_spawn` (removed in v0.8.0) - `close_agent` → use `agent_cancel` (removed in v0.8.0) - `send_input` → use `agent_send_input` (removed in v0.8.0) Canonical names are unaffected. Each alias invocation also emits a `tracing::warn` so the deprecation appears in audit logs. Documents the deprecation schedule in `docs/TOOL_SURFACE.md`. Four unit tests verify the notice shape and that canonical tools stay clean. Refs #72 Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-26 12:32:26 -05:00
Hunter Bown	ca7ca9f75f	docs: drop stale handoff/migration/parity docs Removed: - `.claude/next-agent-prompt.md` (111 lines) — v0.4.6-era session prompt describing slices A/B/C that have all shipped. Successive sessions use fresh prompts (e.g. .deepseek/v0.6.0-overnight-review.md); this one is pure history. - `docs/archive/workspace_migration_status.md` (92 lines) — explicitly archived (April 11), describes a one-time migration that's complete. Removed enclosing `docs/archive/` directory too (was the only file). CHANGELOG entry from v0.4.x still narrates the archival as history. - `docs/parity_release_and_ci.md` (38 lines) — duplicates what `.github/workflows/parity.yml` and CONTRIBUTING.md already say authoritatively. Single source of truth wins. - `AI_HANDOFF.md` + `todo.md` (untracked, no commit needed) — `todo.md` was a 7-line pointer to AI_HANDOFF.md, which itself was an April 11 snapshot listing "remaining work" that's mostly delivered. CLAUDE.md is the live developer guide now. 1004/1004 tests still green; no doc/code references broken. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-26 00:27:58 -05:00
Hunter Bown	5f223adea6	v0.6.0: native rlm_query tool + scroll fix + cleanup Adds a structured rlm_query tool for parallel/batched LLM fan-out. The model calls it with one prompt or up to 16 concurrent prompts; children dispatch via tokio::join_all against the existing DeepSeek client. Default child model is deepseek-v4-flash; override per-call via the model field. Available in Plan / Agent / YOLO. Cost folds into the session's running total automatically. Fixes scroll-stuck regression (#56): TranscriptScroll::resolve_top and scrolled_by now use a three-level fallback chain (same line → same cell line 0 → nearest cell at-or-before) instead of teleporting to ToBottom when an anchor cell vanishes. Loosens command-safety chains (#57): cargo build && cargo test and similar chains of known-safe commands now escalate to RequiresApproval instead of being hard-blocked as Dangerous. Chains containing unknown commands still block. Suppresses the GettingCrowded footer chip — context-percent header already covers conversation pressure. Refactors: - Extracts file_mention parsing/completion/expansion (~450 LOC) from the 5,500-line ui.rs into crates/tui/src/tui/file_mention.rs. - Deletes truly unused helpers (write_bytes, timestamped_filename, extension_from_url, output_path, has_project_doc, primary_doc_path). Tests: 853 pass. cargo clippy --workspace -D warnings clean. cargo fmt --all -- --check clean. Closes #46 #47 #48 #49 #50 #53 #54 #55 #56 #57 #58.	2026-04-25 21:48:17 -05:00
Hunter Bown	027d6d19b6	docs(rlm): land Hetun design + helper layer + Sakana research methodology Captures the full RLM-fundamental story across the design doc, MODES.md, and the Hetun prompt. Tracking issues are now #46–#55 (helper layer filed as #53, Hetun as #54, vendoring as #55). What this nails down: - Hetun mode is added at the END of the Tab cycle (Plan → Agent → YOLO → Hetun → Plan), not as a Plan replacement. Default landing mode is unchanged so people don't accidentally start there. Plan stays as it is. - Mission-level approval, not block-level. Hetun runs a research phase, presents one mission card, and only executes after explicit user approval. Inside the execution turn the repl block runs straight through with no per-block prompts — that's the whole point of the mode. - The user's configured model is left alone on enter/exit. Pro/max users stay on Pro/max. The flash-as-coordinator behaviour is internal to the runtime (ZIGRLM_RLM_CMD always points to flash regardless of mode). No global model swap. - No /hetun slash command. Tab cycles into the mode; /plan keeps switching to Plan as today. - The helper layer (#53) is fundamental, not aleph-derived. A curated ~20-function ctx-helper module + AST-validated Python sandbox baked into the repl runtime so a single block can load → slice → fan out flash queries → aggregate without crossing tool boundaries. Inspired by aleph's pattern but our own native primitive — not a port. - Hetun research methodology adopts Sakana's Fugu patterns. The research phase is recursive novelty sampling + hierarchical narrative tree synthesis + multi-detector cross-verification (flash for breadth, Pro for depth) + hypothesis-verification loop. Not "fan out 8 fixed queries". This is what makes "Plan + Recursive Agents" meaningful versus a flash-coordinator wrapper. - No version-number framing anywhere. The plan ships as one cohesive RLM landing across #46/#48/#49/#50/#53/#54/#55 — order is dependency, not release schedule. We keep shipping. - Auto-compaction stays automatic. Removed a manual /compact nag from the Hetun prompt; the existing coherence + capacity system already handles this. Files: docs/rlm-design.md new — full design doc with Hetun details docs/research-react-vs-rlm.md new — supporting research treatment docs/MODES.md 4-mode cycle, Hetun added at end, Plan kept crates/tui/src/prompts/hetun.txt prompt teaching the recursive-novelty + hierarchical-synthesis + verification-loop rhythm, mission-card structure, two-step gate .gitignore ignore .claude/scheduled_tasks.lock runtime Closes nothing yet — implementation lands across the tracking issues. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-25 15:37:25 -05:00
Hunter Bown	82e4a564aa	refactor(#35 ): tighten agent prompt tool descriptions, drop alias dupes Tool-surface audit pass: - FILE OPERATIONS rewritten so each line states the niche, not just the verb. read_file mentions PDF auto-extraction + `pages` slicing. - New SEARCH section consolidates grep_files / file_search / web_search / fetch_url so the model sees them next to each other and picks the right one. fetch_url (#33) added; previously absent from the prompt. - request_user_input pulled out of FILE OPERATIONS into its own USER section — it never belonged there. - SUB-AGENTS list shrinks by 3: drops `spawn_agent` (use `agent_spawn`), `close_agent` (use `agent_cancel`), and the `agent_assign / assign_agent` dual-name. The underlying dispatchers still resolve those names, so existing sessions don't break — they just no longer pollute the model's tool list. Adds `docs/TOOL_SURFACE.md` with the rationale, the v0.5.1 final surface, and the dropped aliases. Calls out that grep_files is pure-Rust (no rg/grep shell-out, so the "fall back to grep" AC from #35 is vacuously satisfied — the tool has no shell dependency to fall back from). Closes #35.	2026-04-25 13:44:43 -05:00
Hunter Bown	0a394e1587	fix(#31 ): catch version drift in CI, not at release time Adds scripts/release/check-versions.sh and a `versions` CI job that runs on every push/PR. Verifies: - no per-crate Cargo.toml carries a literal version (must inherit the workspace version) - npm/deepseek-tui/package.json matches the workspace version - Cargo.lock is in sync with the manifests Closes #31.	2026-04-25 13:25:55 -05:00
Hunter Bown	29141bc89b	Add NIM env support and .env.example template	2026-04-25 07:21:43 -05:00
Hunter Bown	298f5c6c51	Merge branch 'claude/improve-deepseek-v4-harness-NxBpS' into main	2026-04-25 01:55:44 -05:00
Hunter Bown	853a39138c	feat: setup status/clean/dirs and protocol-recovery hardening Adds a compact `setup --status` view, a `setup --clean` for regenerable session checkpoints, and `--tools`/`--plugins` scaffolding for ~/.deepseek/{tools,plugins} so the extension model has a documented home that doctor can count. `doctor --json` lands as a CI-safe alternative to the human-readable doctor (skips the live API probe). Also locks down the engine's hostility to fake tool-call wrappers: filter_tool_call_delta and the marker constants are now testable, the streaming loop emits one compact status notice per turn when it strips a wrapper, and a new protocol_recovery integration test asserts that the legacy text parser never turns <function_calls> into a real tool call. Adds 23 unit tests + 14 integration tests covering both slices.	2026-04-25 06:26:07 +00:00
Hunter Bown	6ad3727fa0	Add provider switch and file mention attachments	2026-04-24 23:09:48 -05:00
Hunter Bown	16f62f7abf	Fix reasoning replay and context accounting for NIM	2026-04-24 18:42:18 -05:00
Hunter Bown	d0dc26ce25	Add NVIDIA NIM provider support for DeepSeek	2026-04-24 18:29:19 -05:00
Hunter Bown	f377b0f9d2	ci: add npm publish retry path	2026-04-24 16:15:48 -05:00
Hunter Bown	f3df8f5f26	ci: publish npm with trusted publishing	2026-04-24 16:13:59 -05:00
Hunter Bown	8323bedfb7	fix: restore default tui mouse scrolling	2026-04-24 11:41:31 -05:00
Hunter Bown	c4f9078712	release: deepseek tui 0.4.3	2026-04-24 10:48:35 -05:00
Hunter Bown	35595f8edc	fix: normalize legacy DeepSeek aliases to V4 flash	2026-04-23 23:08:44 -05:00
Hunter Bown	b7bd02d814	feat: DeepSeek V4 support with reasoning-effort control (0.4.0) Adds first-class DeepSeek V4 Pro and Flash support, updates the default model to deepseek-v4-pro, aligns legacy aliases with the current V4 1M context behavior, and fixes thinking-mode request handling. Key fixes: - Send DeepSeek's raw Chat Completions `thinking` parameter at the top level instead of SDK-only `extra_body`. - Preserve assistant `reasoning_content` for all prior thinking-mode tool-call turns so subsequent requests satisfy DeepSeek V4's replay requirement. - Fix npm wrapper concurrent first-run downloads by using per-process temporary download paths. - Add `.mailmap` so historical bot-attributed commits aggregate under Hunter Bown where mailmap is honored. Verified with the full local Rust gate, live DeepSeek V4 smoke, npm wrapper temp-install smoke, and green PR CI across Linux, macOS, and Windows.	2026-04-23 22:53:20 -05:00
Hunter Bown	dc8e94d705	docs: update documentation and cleanup for v0.3.33	2026-04-22 22:36:45 -05:00
Hunter Bown	cbcf35c1bd	release: prepare v0.3.32 — finance tool, header redesign, expanded tests Add Yahoo Finance quote tool with chart fallback, redesign header widget with proportional truncation and context bar, refactor footer status strip, expand test suite to 680+ tests, and fix blocking issues (usize underflow in header, tempdir leak in finance tests, per-call HTTP client creation).	2026-04-11 18:58:56 -05:00
Hunter Bown	b172b8d306	feat: remove Normal mode and consolidate to Agent (#4 ) Keep legacy /normal and settings fallback behavior mapped to Agent, align docs around the three visible modes, and include the current TUI and onboarding refinements in this worktree.	2026-03-12 11:32:25 -05:00
Hunter Bown	7b91169017	refactor: move source files into workspace crates - Move src/* into crates/tui/src/ to create a proper workspace structure - Add .claude/ and .trimtab/ directories for Trimtab closed-loop workflow - Add DEPENDENCY_GRAPH.md and update documentation - Update Cargo.toml files to reflect new crate dependencies - Update CI workflows and npm package scripts - All tests pass, release build works	2026-03-11 20:00:38 -05:00
Hunter Bown	37186c3d95	Workspace migration: split into modular crates, parity CI, release updates - Convert root to Cargo workspace with crates/ layout - Add deepseek-* crates mirroring Codex architecture - Add parity CI workflow with snapshot/protocol/state tests - Update release workflow to build both deepseek and deepseek-tui binaries - Bump version to 0.3.28	2026-03-02 17:52:46 -06:00
Hunter Bown	8d904129b5	Release 0.3.23	2026-02-24 10:31:54 -06:00
Hunter Bown	b88ce88a42	Improve TUI coherence, small-screen layout, and contrast guardrails	2026-02-19 10:09:41 -06:00
Hunter Bown	8b5f1bc83f	fix: harden runtime app pre-release issues	2026-02-18 11:12:40 -06:00
Hunter Bown	cfcdce3d03	feat: runtime and UX polish P1 features: - System prompt injection on session resume (ThreadRecord gains system_prompt field, ensure_engine_loaded passes it to Op::SyncSession) - Session resume bridge: GET/POST /v1/sessions/{id}, seed_thread_from_messages - Task detail panel with deep links (?task=<id>), timeline, tool calls - Tauri desktop build fix (CI=true in tauri:build script) P2 features: - Task detail auto-refresh polling for running/queued tasks (3s interval) - Session delete: DELETE /v1/sessions/{id} endpoint + palette delete button - Transcript full-text search input with combined role filter + search - Per-item copy-to-clipboard, collapse/expand for long outputs, filter chips Polish: - Typography scale CSS variables, skeleton loading utilities - Panel slide-in animation for task detail, fade-up for list items - Toast auto-dismiss (4s success, 6s error) with dismiss buttons - Focus trap in command palette, backdrop click to close - Escape key closes task detail panel - ARIA improvements: role=alert on error toasts, role=listbox on palette - Responsive breakpoints for tablet (task detail stacking, palette width) - prefers-reduced-motion respected throughout All validation checks pass: - cargo test (517 tests), cargo check - pnpm typecheck, lint (0 errors), test (36 tests) - pnpm web:build, desktop:build	2026-02-18 10:58:13 -06:00
Hunter Bown	1a04659a95	Add capacity memory controller and smoother TUI streaming	2026-02-17 16:09:07 -06:00
Hunter Bown	87884a1e84	Improve model handling, context recovery, and stabilize config tests	2026-02-16 17:57:22 -06:00
Hunter Bown	ab2c708ca7	feat: runtime API, task manager, and extensive improvements (v0.3.16) Major Features: - Runtime API for external integrations and turn management - Task manager with persistence and recovery - Shell output streaming and improved tool execution - Error taxonomy and audit logging - Command palette and UI enhancements Documentation: - Runtime API documentation - Operations runbook - Architecture updates Fixes: - Auto-compaction threshold and triggering logic - Doctor command API key validation - Clippy and formatting compliance	2026-02-16 10:51:39 -06:00
Hunter Bown	e0bccecd5c	Remove RLM/Duo modes and restore footer scroll	2026-02-03 18:29:36 -06:00
Hunter Bown	325aaefc00	Update tool parity and skills docs	2026-02-03 17:34:55 -06:00
Hunter Bown	a5c02c0eb4	release: v0.3.1	2026-01-27 01:04:48 -06:00
Hunter Bown	3204f556af	release: v0.3.0	2026-01-27 00:46:48 -06:00
Hunter Bown	2a5f40450a	Clean up repo for public release - Remove unnecessary files (tool_test_report.md, python/, pyproject.toml) - Remove internal docs (rlm_gap_analysis, VOICE_AND_TONE, PALETTE) - Remove pypi publish workflow - Fix clippy and rustdoc warnings for CI - Add note that Duo mode is experimental 🤖 Generated with [Claude Code](https://claude.ai/code)	2026-01-20 09:03:13 -06:00
Hunter Bown	6f1158a2d7	Initial release v0.1.0 DeepSeek TUI - Unofficial terminal UI + CLI for DeepSeek models. Features: - Interactive TUI with multiple modes (Normal, Plan, Agent, YOLO, RLM, Duo) - Comprehensive tool access with approval gating - File operations, shell execution, task management - Sub-agent system for parallel work - MCP integration for external tool servers - Session management and skills system - Cross-platform support (macOS, Linux, Windows) 🤖 Generated with [Claude Code](https://claude.ai/code)	2026-01-20 08:57:35 -06:00

48 Commits