codewhale

dgf1988/codewhale

Author	SHA1	Message	Date
Hunter Bown	f7fe5e09a5	fix(#37 ): make NIM a peer provider in config.example.toml + setup status A user couldn't find an `NVIDIA_API_KEY` block in `~/.deepseek/config.toml` because the example file only mentioned NIM as commented-out alternates to the top-level keys. Two fixes: - `config.example.toml` now has explicit `[providers.deepseek]` and `[providers.nvidia_nim]` sections (placed after all top-level keys so the TOML still parses cleanly), each documenting `api_key` / `base_url` / `model` plus the env vars that override them. Both providers can be stored at once and toggled via `/provider` or `--provider` without re-entering keys. - `setup --status` "missing api_key" message is now provider-aware: on `nvidia-nim` it points at `NVIDIA_API_KEY` + `[providers.nvidia_nim]` + `deepseek auth set --provider nvidia-nim`, instead of the DeepSeek-only hint. Audit verified: the v0.5.0 multi-turn replay fix path (`should_replay_reasoning_content` → `requires_reasoning_content` in `crates/tui/src/client.rs:1796`) keys off the model name (matches `deepseek-v4`), not the provider, so NIM-hosted V4 models get the replay automatically. No NIM-specific 400-class regression there. Closes #37 (docs/UX); the live multi-turn-against-NIM verification remains a manual smoke step listed in the issue (no NIM creds in CI).	2026-04-25 13:52:44 -05:00
Hunter Bown	24b8945010	feat(#32 ): basic session-handoff convention via .deepseek/handoff.md Minimum-viable version of the handoff artifact described in #32: - New `HANDOFF_RELATIVE_PATH = ".deepseek/handoff.md"` convention. - `system_prompt_for_mode_with_context` now reads that path on every prompt rebuild and prepends a `## Previous Session Handoff` block to the system prompt when the file is non-empty. A fresh agent gets the prior session's blockers/decisions/files-touched in turn-1 context with zero discovery cost. - Agent prompt updated to make the convention explicit: "if the block appears, read it first; before exit/`/compact`, write or update it via `write_file`." - `.deepseek/` is already gitignored, so the handoff travels with the workspace but doesn't pollute commits unless the user opts in. Tests cover: present-and-non-empty (block injected with file content), missing file (no block), empty/whitespace-only file (no block). A unique marker in the injected block (`"left a handoff at .deepseek/..."`) discriminates the actual block from the agent prompt's own description of the convention. Out of scope for v0.5.1: a `/handoff` slash command, a startup banner toast, automatic write on exit, and the diff-against-HEAD-on-resume mechanism. The agent can already write the file via `write_file` when the user types `write a session handoff`. Closes #32.	2026-04-25 13:48:22 -05:00
Hunter Bown	82e4a564aa	refactor(#35 ): tighten agent prompt tool descriptions, drop alias dupes Tool-surface audit pass: - FILE OPERATIONS rewritten so each line states the niche, not just the verb. read_file mentions PDF auto-extraction + `pages` slicing. - New SEARCH section consolidates grep_files / file_search / web_search / fetch_url so the model sees them next to each other and picks the right one. fetch_url (#33) added; previously absent from the prompt. - request_user_input pulled out of FILE OPERATIONS into its own USER section — it never belonged there. - SUB-AGENTS list shrinks by 3: drops `spawn_agent` (use `agent_spawn`), `close_agent` (use `agent_cancel`), and the `agent_assign / assign_agent` dual-name. The underlying dispatchers still resolve those names, so existing sessions don't break — they just no longer pollute the model's tool list. Adds `docs/TOOL_SURFACE.md` with the rationale, the v0.5.1 final surface, and the dropped aliases. Calls out that grep_files is pure-Rust (no rg/grep shell-out, so the "fall back to grep" AC from #35 is vacuously satisfied — the tool has no shell dependency to fall back from). Closes #35.	2026-04-25 13:44:43 -05:00
Hunter Bown	07ae792068	fix(#38 ): show provider chip in header when not on default DeepSeek The reasoning-effort tier (`max` chip + whale icon) and the live/context indicators were the only signals on the right of the header. Switching to nvidia-nim left the right-hand side identical to a DeepSeek session, so it wasn't obvious at a glance that requests were going to a different backend. Now: when `app.api_provider != Deepseek`, the header surfaces a bold `NIM` chip on the right, leftmost in the chip cluster (so it survives the narrow-width fallback variants in `right_spans`). Default-DeepSeek sessions are unchanged — `provider_label = None` short-circuits the chip. Closes #38.	2026-04-25 13:41:52 -05:00
Hunter Bown	ba40ae4aac	feat(#34 ): auto-extract text from PDFs in read_file `read_file` now detects PDFs by extension or `%PDF-` magic bytes and shells out to `pdftotext -layout` (poppler) to return plain text directly to the model. New optional `pages` arg accepts `N` or `N-M` slices so big papers can be read in pieces without burning context. When `pdftotext` isn't on `$PATH`, the tool returns a structured `{type: "binary_unavailable", kind: "pdf", reason, hint}` payload with install hints (`brew install poppler` / `apt install poppler-utils`) instead of crashing or returning UTF-8 garbage from a binary file. Tests cover extension detection (case-insensitive), magic-byte sniffing on extension-less files, the negative case for plain text, the pages arg parser (single, range, whitespace, invalid forms), and the binary_unavailable branch when `pdftotext` is absent. .docx / .epub / .html stripping deferred — same dispatch can take more extractors later. Closes #34.	2026-04-25 13:36:30 -05:00
Hunter Bown	7f2c382343	feat(#33 ): add fetch_url tool for direct HTTP GET Complements `web_search` for cases where the URL is already known — GitHub repos, blog posts, spec pages — and a search-engine round trip is overkill or actively unhelpful (which #25 had been making worse). Surface: - `fetch_url(url, format?, max_bytes?, timeout_ms?)` - `format`: `markdown` (default), `text`, `raw` - HTTPS preferred, http:// allowed; non-http schemes rejected up front - Follows up to 5 redirects; 1 MB default cap (10 MB hard ceiling); 15 s default timeout (60 s ceiling) - HTML responses are stripped to readable text via the same regex pattern used by `web_search` (script/style strip → tag strip → entity decode → whitespace collapse) - 4xx / 5xx responses still return the body (with `success: false`) so the caller can read JSON error envelopes Capabilities: `ReadOnly + Network`. Approval: `Auto` (matches `web_search`). Registered in `with_web_tools` so it's available wherever `web_search` is. Tests cover: format parsing aliases, scheme rejection, missing/empty url validation, html-to-text stripping. The over-the-wire cases (redirect chains, oversized truncation) are exercised by integration tests once the test suite is wired to a local mock HTTP server — deferring that since the unit tests already lock in the input validation and HTML processing. Closes #33.	2026-04-25 13:33:22 -05:00
Hunter Bown	017ac97d0d	feat(#30 ): debug-log reasoning_content replay size per request The thinking-mode sanitizer now sums the byte size of every replayed `reasoning_content` field in the outgoing chat-completions body and emits an `info`-level log line: Reasoning-content replay: 7 assistant message(s), ~3.2K input tokens (12,884 chars) being re-sent in this request This is visible under `RUST_LOG=deepseek_tui=info` (or higher). It's the first step toward the footer/status-line indicator described in #30 — the model's input-side reasoning replay is now observable per turn, even before it gets a dedicated UI surface. Tests cover both branches: bodies that already have reasoning_content (count is summed across all assistant turns) and bodies where the sanitizer had to inject the `(reasoning omitted)` placeholder (the placeholder bytes are included in the count since they ship over the wire). Footer integration deferred — that needs a new event from client → engine → TUI to surface the count alongside `cache N%` / `$X.XX`. Part of #30 remains open.	2026-04-25 13:28:44 -05:00
Hunter Bown	0a394e1587	fix(#31 ): catch version drift in CI, not at release time Adds scripts/release/check-versions.sh and a `versions` CI job that runs on every push/PR. Verifies: - no per-crate Cargo.toml carries a literal version (must inherit the workspace version) - npm/deepseek-tui/package.json matches the workspace version - Cargo.lock is in sync with the manifests Closes #31.	2026-04-25 13:25:55 -05:00
Hunter Bown	fafb76063d	ci: fix unused import + cargo fmt drift from #27 The #27 per-mode context budget commit (`1be18e69`) replaced calls to compaction_threshold_for_model with compaction_threshold_for_model_and_effort but left the old name in the import list, which fails under -Dwarnings on Build, Test, and the npm wrapper smoke job. Also re-runs cargo fmt over the four files the lint job flagged.	2026-04-25 13:21:16 -05:00
Hunter Bown	9e7cfc951a	docs: README v0.5.0 callout + CHANGELOG entry, drop cargo install - README: drop the `cargo install deepseek-tui` / `deepseek-tui-cli` block and the crates.io badge — those packages have been lagging the workspace version for several releases. Source install (`cargo install --path crates/tui`) remains documented for hackers. - README: add a "What's new in v0.5.0" section pointing at the thinking-mode tool-call 400 fix and the #25 web.run cleanup. - CHANGELOG: add the [0.5.0] - 2026-04-25 entry covering the per-message reasoning_content rule, the wire-payload sanitizer, and #25.	2026-04-25 13:15:35 -05:00
Hunter Bown	1be18e691b	feat(#27 ): per-mode soft context budget for V4 compaction trigger Add compaction_threshold_for_model_and_effort() with mode-aware soft caps based on DeepSeek V4 paper Figure 9 recall-quality data: Plan / off -> 64K (paper eval: 8K-128K) Agent / high -> 192K (paper eval: 128K) YOLO / max -> 384K (paper eval: 384K-512K) Previously, the 80%-of-window rule gave 800K for V4's 1M window, which is well past the point where MRCR MMR collapses (0.49 at 1M). Non-V4 models keep the legacy 80% rule. None/unknown effort defaults to agent-tier (192K).	2026-04-25 12:58:35 -05:00
Hunter Bown	ccc9554ef4	fix(#25 ): strip phantom web.run references from prompts and web_search tool Replace all web.run mentions with web_search in prompt files (base, agent, yolo, plan, normal) and update web_search.rs description. Model was trying web.run which doesn't exist, wasting turns on validation errors. Also remove [cite:ref_id] citation format which required web.run's ref_id system. Partial fix for #25 — web_search reliability improvements (real search provider) still needed.	2026-04-25 12:51:12 -05:00
Hunter Bown	97f245d5ac	docs: add TUI screenshot to README	2026-04-25 12:48:42 -05:00
Hunter Bown	19f8d83d3b	release: v0.5.0 — fix multi-turn tool call 400 error (missing reasoning_content on assistant messages with tool_calls)	2026-04-25 12:27:53 -05:00
Hunter Bown	67b232b063	Release v0.4.9: thinking-mode reasoning_content fix + README refresh ### Fixed - DeepSeek thinking-mode tool-call rounds now always replay reasoning_content in all subsequent requests (including across new user turns), matching the documented API contract that assistant tool-call messages must retain their reasoning content forever. Previously, reasoning_content was cleared after the current user turn completed, which could cause HTTP 400 errors. - Missing reasoning_content on a tool-call assistant message now substitutes a safe placeholder ("(reasoning omitted)") instead of dropping the tool calls and their matching tool results, preventing orphaned conversation chains and API 400 rejections. - Session checkpoint now persists a Thinking-block placeholder for tool-call turns that produced no streamed reasoning text, keeping on-disk sessions structurally correct for subsequent requests. - Token estimation for compaction now counts thinking tokens across ALL tool-call rounds (not just the current user turn), aligning with the updated reasoning_content replay rule. ### Changed - Internal crate dependency pins bumped 0.4.5 → 0.4.9 to match workspace. - npm wrapper version and deepseekBinaryVersion bumped to 0.4.9. - README fully rewritten: clearer feature highlights, V4 model focus, keyboard shortcut table, improved docs index, and more engaging layout. - CHANGELOG entry for 0.4.9 with comparison URLs.	2026-04-25 12:00:08 -05:00
Hunter Bown	41c54f08aa	release: deepseek-tui 0.4.8	2026-04-25 10:48:00 -05:00
Hunter Bown	56ea63e2c6	Merge pull request #20 from Hmbown/codex/v4-pro-discount-pricing [codex] Track DeepSeek V4 Pro discount pricing	2026-04-25 10:44:22 -05:00
Hunter Bown	94834b2eb4	fix: track DeepSeek V4 Pro discount pricing	2026-04-25 10:34:02 -05:00
Hunter Bown	dc8f8496d6	Merge pull request #19 from Hmbown/codex/release-0.4.7 release: deepseek-tui 0.4.7 + Devin .env empty-key fix	2026-04-25 08:04:10 -05:00
Hunter Bown	5c1086fe9e	release: deepseek-tui 0.4.7 Bump workspace version 0.4.6 -> 0.4.7 and ship the bug fix flagged by Devin on PR #18: an uncommented `DEEPSEEK_API_KEY=` line in .env.example caused `cp .env.example .env` to load an empty key, which `apply_env_overrides` then propagated into the config and `Config::validate()` rejected on startup with "api_key cannot be empty string". - .env.example: comment out the empty `DEEPSEEK_API_KEY=` placeholder. - crates/tui/src/config.rs: skip empty `DEEPSEEK_API_KEY` env values in `apply_env_overrides`, matching the facade's empty-string filter. - Add `apply_env_overrides_ignores_empty_api_key` regression test. - Bump Cargo.toml workspace version, npm wrapper version + binary version, and Cargo.lock.	2026-04-25 07:59:01 -05:00
Hunter Bown	dff592bc6d	Merge pull request #18 from Hmbown/codex/deepseek-tui-app-working Verify app smoke and add NIM env support	2026-04-25 07:46:41 -05:00
Hunter Bown	29141bc89b	Add NIM env support and .env.example template	2026-04-25 07:21:43 -05:00
Hunter Bown	fe4f261a5d	release: deepseek tui 0.4.6 Bundles new work since v0.4.4: NIM provider support, file mention attachments, provider switch UX, setup --status/--clean/--tools/--plugins, doctor --json, and protocol-recovery hardening that strips fake tool-call wrappers from streaming model output.	2026-04-25 02:01:04 -05:00
Hunter Bown	7f0444d26b	fix: re-add DEFAULT_TEXT_MODEL import dropped by merge Local main's unpushed commits had removed DEFAULT_TEXT_MODEL from the crate::config import in main.rs, but the merged branch's new code at two call sites still uses it. Textual three-way merge took the local import line and the branch's call sites, producing a build break. Re-add the symbol to the import.	2026-04-25 01:57:17 -05:00
Hunter Bown	298f5c6c51	Merge branch 'claude/improve-deepseek-v4-harness-NxBpS' into main	2026-04-25 01:55:44 -05:00
Hunter Bown	853a39138c	feat: setup status/clean/dirs and protocol-recovery hardening Adds a compact `setup --status` view, a `setup --clean` for regenerable session checkpoints, and `--tools`/`--plugins` scaffolding for ~/.deepseek/{tools,plugins} so the extension model has a documented home that doctor can count. `doctor --json` lands as a CI-safe alternative to the human-readable doctor (skips the live API probe). Also locks down the engine's hostility to fake tool-call wrappers: filter_tool_call_delta and the marker constants are now testable, the streaming loop emits one compact status notice per turn when it strips a wrapper, and a new protocol_recovery integration test asserts that the legacy text parser never turns <function_calls> into a real tool call. Adds 23 unit tests + 14 integration tests covering both slices.	2026-04-25 06:26:07 +00:00
Hunter Bown	6ad3727fa0	Add provider switch and file mention attachments	2026-04-24 23:09:48 -05:00
Hunter Bown	16f62f7abf	Fix reasoning replay and context accounting for NIM	2026-04-24 18:42:18 -05:00
Hunter Bown	d0dc26ce25	Add NVIDIA NIM provider support for DeepSeek	2026-04-24 18:29:19 -05:00
Hunter Bown	7d8c40893f	chore: map github web committer	2026-04-24 16:29:01 -05:00
Hunter Bown	411153c549	chore: normalize contributor identities	2026-04-24 16:27:54 -05:00
Hunter Bown	e06071c177	docs: remove stale readme screenshot	2026-04-24 16:23:50 -05:00
Hunter Bown	f377b0f9d2	ci: add npm publish retry path	2026-04-24 16:15:48 -05:00
Hunter Bown	f3df8f5f26	ci: publish npm with trusted publishing	2026-04-24 16:13:59 -05:00
Hunter Bown	8323bedfb7	fix: restore default tui mouse scrolling	2026-04-24 11:41:31 -05:00
Hunter Bown	6c14e2e1f4	release: deepseek tui 0.4.4	2026-04-24 11:02:07 -05:00
Hunter Bown	c4f9078712	release: deepseek tui 0.4.3	2026-04-24 10:48:35 -05:00
Hunter Bown	d713032b9f	ci: allow token-backed npm publish	2026-04-24 00:21:35 -05:00
Hunter Bown	d89b33330f	fix: preserve DeepSeek V4 reasoning tool turns	2026-04-24 00:01:10 -05:00
Hunter Bown	ffa75f07e5	fix: relax v4 compaction and transcript scroll	2026-04-23 23:31:18 -05:00
Hunter Bown	35595f8edc	fix: normalize legacy DeepSeek aliases to V4 flash	2026-04-23 23:08:44 -05:00
Hunter Bown	b7bd02d814	feat: DeepSeek V4 support with reasoning-effort control (0.4.0) Adds first-class DeepSeek V4 Pro and Flash support, updates the default model to deepseek-v4-pro, aligns legacy aliases with the current V4 1M context behavior, and fixes thinking-mode request handling. Key fixes: - Send DeepSeek's raw Chat Completions `thinking` parameter at the top level instead of SDK-only `extra_body`. - Preserve assistant `reasoning_content` for all prior thinking-mode tool-call turns so subsequent requests satisfy DeepSeek V4's replay requirement. - Fix npm wrapper concurrent first-run downloads by using per-process temporary download paths. - Add `.mailmap` so historical bot-attributed commits aggregate under Hunter Bown where mailmap is honored. Verified with the full local Rust gate, live DeepSeek V4 smoke, npm wrapper temp-install smoke, and green PR CI across Linux, macOS, and Windows.	2026-04-23 22:53:20 -05:00
Hunter Bown	dc8e94d705	docs: update documentation and cleanup for v0.3.33	2026-04-22 22:36:45 -05:00
Hunter Bown	f4dbf828c9	Footer polish: remove FOOTER_HINT, simplify footer rendering - Remove FOOTER_HINT color constant from palette - Drop footer clock label and related synchronization logic - Simplify footer status line layout and narrow-terminal handling - Update tests to align with simplified footer logic - Remove empty state placeholder text for cleaner UI - Bump version to 0.3.33	2026-04-11 20:20:18 -05:00
Hunter Bown	cbcf35c1bd	release: prepare v0.3.32 — finance tool, header redesign, expanded tests Add Yahoo Finance quote tool with chart fallback, redesign header widget with proportional truncation and context bar, refactor footer status strip, expand test suite to 680+ tests, and fix blocking issues (usize underflow in header, tempdir leak in finance tests, per-call HTTP client creation).	2026-04-11 18:58:56 -05:00
Hunter Bown	3db7f05c60	ci: fix npm smoke runner and bump wrapper to 0.3.32	2026-03-12 21:48:58 -05:00
Hunter Bown	5c82f08c58	docs: tighten README onboarding and usage copy	2026-03-12 21:41:08 -05:00
Hunter Bown	ea3cc3d0d2	fix: address PR #4 follow-ups (#5 ) * fix: address PR #4 follow-ups Honor low_motion in the default tool transcript path and align composer cursor padding with the rendered placeholder. Add focused regression tests for both behaviors. * lint: remove redundant imports in empty_state test, reuse create_test_app The test had inner `use` statements for Config, App, TuiOptions, and PathBuf that duplicated the module-level test imports. It also manually constructed App instead of calling the existing create_test_app() helper. * fix: replace useless format!("{text}") with text.to_string() in details_affordance_line * test: pin composer_density in cursor test to avoid sensitivity to loaded settings Settings::load() may return a non-default composer_density on some CI environments. Explicitly set ComposerDensity::Comfortable so the expected cursor position is deterministic across all platforms. * fix: make tool low_motion test robust against coarse Windows timers Use a 2× cycle offset so the animated frame index is 2 (maximally distant from 0), giving 1800 ms of headroom before the animation could wrap back to index 0. The previous 1× offset left only ~15 ms of margin, causing flaky failures on Windows where Instant resolution is approximately 15.6 ms. * fix: correct headroom comment in tool animation test (3600ms, not 1800ms) * fix: resolve lint, parity, and Windows test failures - Fix rustfmt line-length issue in history.rs tool animation test - Settings::path() now respects DEEPSEEK_CONFIG_PATH for Windows test compat - doctor_check_mcp_server recognizes Unix-style absolute paths on Windows - Use checked_sub for Instant arithmetic in web_run tests to prevent underflow on freshly-booted Windows CI runners * fix: expand ~ in DEEPSEEK_CONFIG_PATH when resolving settings path ---------	2026-03-12 21:29:14 -05:00
Hunter Bown	b172b8d306	feat: remove Normal mode and consolidate to Agent (#4 ) Keep legacy /normal and settings fallback behavior mapped to Agent, align docs around the three visible modes, and include the current TUI and onboarding refinements in this worktree.	2026-03-12 11:32:25 -05:00
Hunter Bown	14bb2bfd98	release: prepare for v0.3.31 - Update CHANGELOG with workspace refactor and clippy fix - Fix collapsible_if lint in web browsing session manager	2026-03-11 20:11:32 -05:00

1 2 3

142 Commits