codewhale

dgf1988/codewhale

Author	SHA1	Message	Date
Hunter Bown	bef1895bed	Merge pull request #518 from Hmbown/feat/489-memory-mvp feat(memory): user-memory MVP — persistent notes, `# ` quick-add, /memory, remember tool (#489–#493)	2026-05-03 08:18:47 -05:00
Hunter Bown	7547d168a4	feat(memory): user-memory MVP — persistent notes, # quick-add, /memory, remember tool (#489–#493) Adds a small, opt-in user-memory layer so the model has access to durable preferences and conventions across sessions, and the user can dump quick notes without leaving the TUI. ### What ships - Hierarchy loader (#490): on every prompt assembly the engine reads `Config::memory_path()` (defaults to `~/.deepseek/memory.md`, override via `memory_path` in config or `DEEPSEEK_MEMORY_PATH`) and injects the file as a `<user_memory>` block alongside the existing `<project_instructions>` block. Goes above the volatile-content boundary so prefix-cache stays warm. Oversize files (>100 KiB) are truncated with a marker. - `# foo` composer quick-add (#492): typing a single line that starts with `#` (but not `##` / `#!`) appends a timestamped bullet to the memory file and consumes the input — no turn fires. The composer status line surfaces the path that was written. Multi-`#` prefixes deliberately fall through so users can paste Markdown headings. - `/memory` slash command (#491): `/memory` (or `/memory show`) prints the resolved path and contents inline; `/memory path`, `/memory clear`, and `/memory edit` (prints `${VISUAL:-${EDITOR:-vi}} <path>`) cover the rest of the manual-curation surface. - `remember` tool (auto-update): model-callable tool that takes a `note` string and appends it as a bullet — the same persistence path as `# foo`. Auto-approved (writes only the user's own memory file). Only registered when memory is enabled, so it doesn't pollute the catalog when the feature is off. - Opt-in toggle (#493): default behaviour is off. Enable with `[memory] enabled = true` in `config.toml` or `DEEPSEEK_MEMORY=on` in the environment. ### What's wired - New `crates/tui/src/memory.rs` module (`load`, `as_system_block`, `compose_block`, `append_entry`). - New `crates/tui/src/tools/remember.rs` (`RememberTool` + 3 tests). - New `crates/tui/src/commands/memory.rs` (`memory(app, arg)` handler). - `EngineConfig` gains `memory_enabled: bool` + `memory_path: PathBuf`. - `ToolContext` gains `memory_path: Option<PathBuf>`. - `App` exposes `memory_path` + `use_memory` from `AppOptions` (previously destructured-and-dropped); `main.rs` populates `use_memory` from `config.memory_enabled()`. - `system_prompt_for_mode_with_context_and_skills` accepts an optional `user_memory_block` parameter; the engine computes it via `memory::compose_block(...)` and threads it through. - Composer Enter handler intercepts `# foo` only when `config.memory_enabled()` is true; otherwise falls through to existing turn-submission path. - `MemoryConfig` table (`[memory] enabled`) added to `Config`, surfaced in `config.example.toml`, plumbed through `merge_config`. ### Tests - 8 unit tests in `memory::tests` covering `load` (missing / whitespace / real), `as_system_block` (xml shape, empty input, oversize truncation), and `append_entry` (creation, repeated append, empty-after-strip rejection). - 3 unit tests in `tools::remember::tests` covering disabled-state error, successful append, and missing-`note`-field validation. ### Verification cargo fmt --all -- --check ✓ cargo clippy --workspace --all-targets --all-features --locked -- -D warnings ✓ cargo test --workspace --all-features --locked ✓ (1821 + supporting; was 1809 on main) Closes #490 #491 #492 #493 Refines #489 (EPIC parent — phase-1 MVP delivered; phase-2 items 494–497 stay on the v0.9.0 board) Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-03 02:51:17 -05:00
Hunter Bown	3013a54c78	feat(tui): emit OSC 8 hyperlinks so URLs are Cmd+click-openable (#498 ) Modern terminals (iTerm2, Terminal.app 13+, Ghostty, Kitty, WezTerm, Alacritty, recent gnome-terminal/konsole) make a URL clickable when it's wrapped in: \x1b]8;;TARGET\x1b\\LABEL\x1b]8;;\x1b\\ Terminals that don't understand the sequence simply render the visible LABEL and ignore the escape, so emitting OSC 8 is a strict UX upgrade for supporting terminals and a no-op for the rest. ### What's wired - New `crates/tui/src/tui/osc8.rs` module with `wrap_link(target, label)`, `strip_into(s, &mut out)`, and a process-wide `ENABLED` AtomicBool that defaults to `true`. - `markdown_render::render_line_with_links` now wraps recognized URLs (`http(s)://…`) in OSC 8 when the runtime flag is on. Display width is computed from the bare URL — the escapes are zero-width on supporting terminals. - `ui_text::line_to_string` and `line_to_plain` strip OSC 8 wrappers when the span content contains an escape, so selection / clipboard output carries clean URLs and not the raw escape codes. - `[tui] osc8_links: bool` config (default `true`) added to `TuiConfig`, documented in `docs/CONFIGURATION.md`, and surfaced in `config.example.toml`. `run_tui` applies it at startup. ### Tests - 7 unit tests in `osc8::tests` covering wrap, strip-with-ESC-terminator, strip-with-BEL-terminator, plain passthrough, mixed escapes, default state, and round-trip set/unset. - 2 markdown_render tests proving URLs in paragraph blocks emit the OSC 8 wrapper when enabled and emit plain text when disabled. - 2 ui_text tests proving `line_to_plain` strips OSC 8 wrappers from spans and passes plain spans through unchanged. Tests that touch the global ENABLED flag serialize through a static Mutex inside the test module so cargo's parallel runner can't observe a torn read. ### Verification cargo fmt --all -- --check ✓ cargo clippy --workspace --all-targets --all-features --locked -- -D warnings ✓ cargo test --workspace --all-features --locked ✓ (1820 + supporting; was 1809) Closes #498 Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-03 02:13:14 -05:00
Hunter Bown	ad8064b143	chore(v0.8.8): stabilization batch — sub-agent caps, mutex contention, RLM polish, CI cleanup Bundles the v0.8.8 stabilization fixes that were already implemented in the working tree, plus the workflow/doc reconciliation called out in #507. ### Sub-agent runtime fixes - #509 Default sub-agent cap raised to 10 (configurable via `[subagents].max_concurrent` in `config.toml`, hard ceiling 20). The running-count calculation now ignores non-running, no-handle, and finished handles so completed agents stop counting against the cap. - #510 `SharedSubAgentManager` is now `Arc<RwLock<...>>`; the read paths that previously held a `Mutex` for inspection now take a read lock, eliminating the multi-agent fan-out UI freeze. - #511 `compact_tool_result_for_context` summarizes `agent_result` / `agent_wait` payloads before they are folded into the parent context. - #512 RLM tool cards map to `ToolFamily::Rlm` and render `rlm`, not `swarm`. Stale "swarm" wording cleaned in docs/comments/tests. - #513 (foreground stopgap only) Foreground RLM work is visible in the Agents sidebar projection. Full async RLM lifecycle remains v0.8.9 — the issue stays open with a refined scope. ### TUI / UX fixes - #487 Offline composer queue is now session-scoped; legacy unscoped queues fail closed. - #488 Composer Option+Backspace deletes by word; cross-platform key routing helpers added. - #443/#444 Keyboard enhancement flags pop on normal AND panic exit; the raw-mode startup probe is now bounded by a configurable timeout. - #449 Production footer reads statusline colors from `app.ui_theme` rather than the bespoke palette. - #506 `display_path_with_home` no longer mutates `HOME` in tests; the flake on shared-env CI is gone. ### Self-update / packaging - #503 `update.rs` arch mapping uses release-asset naming (`arm64`/`x64`) instead of the raw Rust constants. The platform-asset selector also rejects `.sha256` siblings as primary binaries. Tests now live alongside the source in `mod tests` (the `#[path]`-based integration test was removed because it duplicated test runs and forced a `pub(crate)` helper that no real caller used). - `Max 5 in flight` wording updated in `agent_spawn` description, `prompts/base.md`, and `docs/TOOL_SURFACE.md` so the model sees the real default cap (10) and the configuration knob name. ### CI / release docs (#507) - Pruned three duplicated/dead workflows: `crates-publish.yml`, `parity.yml`, `publish-npm.yml`. Their gates already run in `ci.yml` for every push/PR. - `release.yml` build job now allows `parity` to be skipped (it only runs on tag push), unblocking `workflow_dispatch` reruns. The job still fails closed on a real parity failure. - `RELEASE_RUNBOOK.md` reconciled: crate publishing is documented as the manual `scripts/release/publish-crates.sh` flow (no automated workflow); references to the deleted workflows removed. - `CLAUDE.md` notes the `RELEASE_TAG_PAT` requirement for the auto-tag → release.yml chain (without it, the tag is created but `release.yml` does not fire) and documents the `workflow_dispatch` parity-skip behavior. ### Docs - `docs/COMPETITIVE_ANALYSIS.md` added — capability matrix vs OpenCode and Codex CLI, gap analysis, and recommended implementation order. ### Verification (this branch) - `cargo fmt --all -- --check` ✓ - `cargo check --workspace --all-targets --locked` ✓ - `cargo clippy --workspace --all-targets --all-features --locked -- -D warnings` ✓ - `cargo test --workspace --all-features --locked` ✓ (1809 + supporting) - Parity gates ✓ (snapshot, parity_protocol, parity_state) - `cargo build --release --locked -p deepseek-tui-cli -p deepseek-tui` ✓ - Lockfile drift guard ✓ - `deepseek doctor --json` clean - `deepseek eval` (offline harness) success=true, 0 tool errors Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-03 01:57:37 -05:00
Hunter Bown	7125172f67	fix(tui): tighten selection and live task panels	2026-05-02 21:05:15 -05:00
Hunter Bown	c2b2c284f6	release: v0.7.5 — token-basis fixes, shell timeout recovery, context/cache policy Issues #202, #203, #204, #205: - Cycle/seam triggers use active request input size + response headroom reserve, not lifetime cumulative API usage. - V4 hard-cycle headroom calibrated around fixed TURN_MAX_OUTPUT_TOKENS plus CONTEXT_HEADROOM_TOKENS safety buffer. - /tokens, /cost, footer/header labels, and docs now separate active context, turn telemetry, cumulative usage, cache hit/miss, context percent, and cost. - Foreground exec_shell timeout output tells the model the process was killed and suggests task_shell_start or background exec_shell plus poll/wait. - Added regression tests for active-token basis, V4 headroom, seam trigger basis, footer label behavior, and shell timeout recovery metadata. - Preserved #200/#201 policy: V4 default is append-only, prefix-cache preserving; replacement compaction, Flash seams, and capacity intervention remain opt-in.	2026-04-29 10:13:27 -05:00
Hunter Bown	41e8f2b5b2	Disable default compaction and opt in context seams	2026-04-29 09:12:20 -05:00
Hunter Bown	6d8ab4c2b8	fix: close v0.7.2 issue cleanup	2026-04-28 23:09:19 -05:00
Hunter Bown	27527699db	Merge branch 'feat/v070-lsp' (#136 LSP diagnostics) # Conflicts: # config.example.toml # crates/config/src/lib.rs # crates/tui/src/config.rs # crates/tui/src/core/engine.rs # crates/tui/src/main.rs # crates/tui/src/runtime_threads.rs # crates/tui/src/tui/ui.rs	2026-04-28 01:03:36 -05:00
Hunter Bown	41daab3ca0	Merge branch 'feat/v070-snapshots' (#137 side-git snapshots) # Conflicts: # crates/config/src/lib.rs # crates/tui/src/config.rs	2026-04-28 00:58:16 -05:00
Hunter Bown	0781b7c203	feat(session): #137 prune stale workspace snapshots at session boot `run_interactive` now calls `session_manager::prune_workspace_snapshots_at_boot` right after the system-skills installer, dropping any snapshot in the side-git repo older than 7 days (default; configurable via the new `[snapshots]` section in `config.example.toml`). The helper is non-fatal: a missing `git` binary, read-only home, or absent snapshot dir all log a single WARN (or DEBUG for the count of pruned commits) and return, so the TUI keeps starting even when retention can't run. Also document the snapshot subsystem in `config.example.toml` — disk-footprint expectations, where the side repo lives, and how `/restore` / `revert_turn` consume it. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-28 00:31:57 -05:00
Hunter Bown	03dcc74833	docs(skills): #140 document /skill install + [skills] config section - README: add a "Publishing your own skill" section explaining the `github:owner/repo` install path, the multi-skill `skills/<name>/` layout, and how to submit to the curated registry. - config.example.toml: document `[skills] registry_url` / `max_install_size_bytes` next to the existing `[network]` section so users see the network-gate dependency in context. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-28 00:30:10 -05:00
Hunter Bown	05a1032e00	feat(lsp): #136 post-edit diagnostics injection Inject LSP diagnostics as a synthetic user message after every successful file edit (`edit_file`, `apply_patch`, `write_file`) so the agent sees compile breaks before its next reasoning step. Largest agent-quality lever in v0.7.0. Pieces: - `crates/tui/src/lsp/`: thin JSON-RPC stdio client (no `tower-lsp`), per-language registry, diagnostics renderer producing the `<diagnostics file="…">` block format. `LspManager` owns lazily spawned per-language transports keyed by `Language`. - `core/engine.rs`: hook on the success branch of the tool-result loop derives the edited file path(s) per tool, queries the LspManager with a 5 s timeout, and collects rendered blocks into `pending_lsp_blocks`. The queue is flushed as a `text` content block on the next request iteration so the model sees the diagnostics before it streams its next turn. - `[lsp]` config schema (`enabled`, `poll_after_edit_ms`, `max_diagnostics_per_file`, `include_warnings`, optional `servers` override) with built-in defaults for rust-analyzer, gopls, pyright, typescript-language-server, and clangd. - Failure modes are non-blocking by design: a missing LSP binary logs a one-time warning and skips the hook; a crashed server or poll timeout simply drops that turn's diagnostics. The agent's work is never blocked. Tests: 24 unit tests cover language detection, registry overrides, filter/sort/truncate behavior, and the rendered block format. Three engine-level tokio tests exercise the full path through a fake transport (no real LSP server is ever spawned in CI). Acceptance criteria (per #136): - Edit introducing a type error -> next request body contains `<diagnostics file="…">` block at the right line/col. - `[lsp] enabled = false` -> no diagnostics injected. - Snapshot test exercises full path with mock transport. - LSP binary not on PATH -> one-time warning, agent proceeds. - 5 s timeout, errors-only by default. - Transports spawn lazily on first edit per language. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-28 00:29:28 -05:00
Hunter Bown	87ac2e9bdc	Merge branch 'feat/v070-network' (#135 egress network policy + audit) - crates/tui/src/network_policy.rs (NEW, ~300 LOC) — NetworkPolicy/NetworkPolicyDecider/NetworkAuditor - crates/tui/src/tools/{fetch_url,web_search,spec}.rs + mcp.rs — gate before egress - crates/tui/src/core/engine.rs + runtime_threads.rs + tui/ui.rs — surface NetworkDenied - crates/config/src/lib.rs + tui/src/config.rs + config.example.toml — [network] schema - Subdomain-prefix matching with deny-wins precedence - Audit format: <RFC3339> network <host> <tool> <decision> # Conflicts: # crates/tui/src/config.rs	2026-04-28 00:07:42 -05:00
Hunter Bown	a054789f79	Merge branch 'feat/v070-osc9' (#132 OSC 9 desktop notification on long-turn completion) - crates/tui/src/tui/notifications.rs (NEW) — Method enum {Auto/Osc9/Bel/Off}, notify_done() - crates/tui/src/tui/mod.rs + ui.rs — register module + hook EngineEvent::TurnComplete - crates/tui/src/config.rs — NotificationsConfig (method/threshold_secs/include_summary) - config.example.toml — [notifications] section - 9 unit tests including tmux DCS passthrough wrapping	2026-04-28 00:06:26 -05:00
Hunter Bown	abbb86cdd2	feat(network): #135 add [network] config schema for policy Adds the `[network]` table to both the workspace config crate (`ConfigToml`) and the live tui config (`Config`), plus a documented example block in `config.example.toml`. Schema: ```toml [network] default = "prompt" # allow \| deny \| prompt allow = ["api.deepseek.com", "github.com"] deny = [] audit = true ``` `NetworkPolicyToml::into_runtime()` builds a runtime `NetworkPolicy` so the engine can construct a `NetworkPolicyDecider` without reaching across crate boundaries. Defaults preserve pre-v0.7.0 behavior: when the section is absent, no policy is enforced.	2026-04-28 00:02:34 -05:00
Hunter Bown	6432d47c53	feat: #132 emit OSC 9 / BEL desktop notification on long turn completion Adds crates/tui/src/tui/notifications.rs with Method enum (Auto/Osc9/Bel/Off), notify_done / notify_done_to helpers, tmux DCS passthrough, and 9 unit tests. Wires the hook at the TurnComplete event in tui/ui.rs so turns >= threshold_secs (default 30 s) emit an escape to stdout; method auto-detects iTerm.app/Ghostty/ WezTerm for OSC 9 and falls back to BEL. Config exposed under [notifications] in config.toml and documented in config.example.toml. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-27 23:52:58 -05:00
Hunter Bown	ce3d1a73b9	feat(approval): #131 port opencode arity dict for prefix-based auto_allow matching Add COMMAND_ARITY dictionary (160+ entries: git, npm, yarn, pnpm, cargo, docker, kubectl, go, pip, gh, rustup, deno, bun) and classify_command() so auto_allow = ["git status"] matches git status -s / --porcelain but not git push. Wire classify_command() into approval_cache command_prefix key construction. 50 colocated unit tests including an explicit auto_allow semantics test. Document prefix semantics in config.example.toml. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-27 23:49:56 -05:00
Hunter Bown	f7fe5e09a5	fix(#37 ): make NIM a peer provider in config.example.toml + setup status A user couldn't find an `NVIDIA_API_KEY` block in `~/.deepseek/config.toml` because the example file only mentioned NIM as commented-out alternates to the top-level keys. Two fixes: - `config.example.toml` now has explicit `[providers.deepseek]` and `[providers.nvidia_nim]` sections (placed after all top-level keys so the TOML still parses cleanly), each documenting `api_key` / `base_url` / `model` plus the env vars that override them. Both providers can be stored at once and toggled via `/provider` or `--provider` without re-entering keys. - `setup --status` "missing api_key" message is now provider-aware: on `nvidia-nim` it points at `NVIDIA_API_KEY` + `[providers.nvidia_nim]` + `deepseek auth set --provider nvidia-nim`, instead of the DeepSeek-only hint. Audit verified: the v0.5.0 multi-turn replay fix path (`should_replay_reasoning_content` → `requires_reasoning_content` in `crates/tui/src/client.rs:1796`) keys off the model name (matches `deepseek-v4`), not the provider, so NIM-hosted V4 models get the replay automatically. No NIM-specific 400-class regression there. Closes #37 (docs/UX); the live multi-turn-against-NIM verification remains a manual smoke step listed in the issue (no NIM creds in CI).	2026-04-25 13:52:44 -05:00
Hunter Bown	6ad3727fa0	Add provider switch and file mention attachments	2026-04-24 23:09:48 -05:00
Hunter Bown	d0dc26ce25	Add NVIDIA NIM provider support for DeepSeek	2026-04-24 18:29:19 -05:00
Hunter Bown	8323bedfb7	fix: restore default tui mouse scrolling	2026-04-24 11:41:31 -05:00
Hunter Bown	c4f9078712	release: deepseek tui 0.4.3	2026-04-24 10:48:35 -05:00
Hunter Bown	35595f8edc	fix: normalize legacy DeepSeek aliases to V4 flash	2026-04-23 23:08:44 -05:00
Hunter Bown	b7bd02d814	feat: DeepSeek V4 support with reasoning-effort control (0.4.0) Adds first-class DeepSeek V4 Pro and Flash support, updates the default model to deepseek-v4-pro, aligns legacy aliases with the current V4 1M context behavior, and fixes thinking-mode request handling. Key fixes: - Send DeepSeek's raw Chat Completions `thinking` parameter at the top level instead of SDK-only `extra_body`. - Preserve assistant `reasoning_content` for all prior thinking-mode tool-call turns so subsequent requests satisfy DeepSeek V4's replay requirement. - Fix npm wrapper concurrent first-run downloads by using per-process temporary download paths. - Add `.mailmap` so historical bot-attributed commits aggregate under Hunter Bown where mailmap is honored. Verified with the full local Rust gate, live DeepSeek V4 smoke, npm wrapper temp-install smoke, and green PR CI across Linux, macOS, and Windows.	2026-04-23 22:53:20 -05:00
Hunter Bown	7b91169017	refactor: move source files into workspace crates - Move src/* into crates/tui/src/ to create a proper workspace structure - Add .claude/ and .trimtab/ directories for Trimtab closed-loop workflow - Add DEPENDENCY_GRAPH.md and update documentation - Update Cargo.toml files to reflect new crate dependencies - Update CI workflows and npm package scripts - All tests pass, release build works	2026-03-11 20:00:38 -05:00
Hunter Bown	8d904129b5	Release 0.3.23	2026-02-24 10:31:54 -06:00
Hunter Bown	1a04659a95	Add capacity memory controller and smoother TUI streaming	2026-02-17 16:09:07 -06:00
Hunter Bown	87884a1e84	Improve model handling, context recovery, and stabilize config tests	2026-02-16 17:57:22 -06:00
Hunter Bown	ab2c708ca7	feat: runtime API, task manager, and extensive improvements (v0.3.16) Major Features: - Runtime API for external integrations and turn management - Task manager with persistence and recovery - Shell output streaming and improved tool execution - Error taxonomy and audit logging - Command palette and UI enhancements Documentation: - Runtime API documentation - Operations runbook - Architecture updates Fixes: - Auto-compaction threshold and triggering logic - Doctor command API key validation - Clippy and formatting compliance	2026-02-16 10:51:39 -06:00
Hunter Bown	6c55d80dea	feat: world-class TUI improvements + deepseek-v3.2 default (v0.3.15) - Real SSE streaming (token-by-token display via Chat Completions API) - Token cost tracking with real DeepSeek pricing in header - Context window usage percentage display - Atomic session writes (crash safety via temp+rename) - Responses API recovery (periodic probe instead of permanent fallback) - Tool parse failure user-visible warnings - Default model changed to deepseek-v3.2 everywhere - V3.2 reasoning/thinking support - V3.2 pricing and context window entries	2026-02-08 12:46:51 -06:00
Hunter Bown	e0bccecd5c	Remove RLM/Duo modes and restore footer scroll	2026-02-03 18:29:36 -06:00
Hunter Bown	325aaefc00	Update tool parity and skills docs	2026-02-03 17:34:55 -06:00
Hunter Bown	3204f556af	release: v0.3.0	2026-01-27 00:46:48 -06:00
Hunter Bown	6f1158a2d7	Initial release v0.1.0 DeepSeek TUI - Unofficial terminal UI + CLI for DeepSeek models. Features: - Interactive TUI with multiple modes (Normal, Plan, Agent, YOLO, RLM, Duo) - Comprehensive tool access with approval gating - File operations, shell execution, task management - Sub-agent system for parallel work - MCP integration for external tool servers - Session management and skills system - Cross-platform support (macOS, Linux, Windows) 🤖 Generated with [Claude Code](https://claude.ai/code)	2026-01-20 08:57:35 -06:00

35 Commits