dgf1988/codewhale

T

Hunter Bown adb60ba6b0 perf+fix(file_mention): address Gemini and Devin code-review findings

Five issues from the PR #105 review pass:

1. Lazy file-tree index for fuzzy_resolve (HIGH, Gemini)
   The previous `fuzzy_resolve` walked the workspace up to depth 6 on every
   miss. A user typing several non-existent paths in one message could
   trigger multiple disk-intensive walks. Replace with a `OnceLock`-backed
   basename → paths index, built once on first miss and reused thereafter.

2. Cache cwd at construction (MEDIUM, Gemini)
   `std::env::current_dir()` was a syscall on every `resolve` call (up to
   8 mentions per message). Capture once in `Workspace::new()` and store
   in the struct.

3. Include directories in fuzzy match (MEDIUM, Gemini)
   The mention system supports directory listings, but fuzzy_resolve was
   restricted to `is_file()`. Allow directories too.

4. Drop `Path::canonicalize` from the mention loop (MEDIUM, Gemini)
   `Workspace::resolve` already returns absolute paths when the workspace
   root is absolute (always true in TUI use). Removed the per-mention
   `canonicalize` syscall on the message-send hot path. The rare
   symlink-aliasing dedup miss is an acceptable cost.

5. Gate <missing-file> blocks through dedup set (Devin)
   The `Err` arm in `local_context_from_file_mentions` pushed a
   `<missing-file>` block before reaching the `seen.insert` check, so the
   same non-existent mention typed twice produced duplicate blocks and
   wasted prompt tokens. Restructured the loop so all blocks (existing
   AND missing) flow through the dedup gate.

Bonus: replaced the test that was mutating the global cwd (race-prone
across tests) with one that constructs `Workspace::with_cwd` explicitly.
Added a second test exercising the lazy-index + directory-match paths.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

2026-04-26 17:08:33 -05:00

.github

chore(release): drop auto npm publish, document manual flow, trim research PDF

2026-04-26 16:26:10 -05:00

assets

chore: drop unreferenced assets/hero.png

2026-04-26 00:23:08 -05:00

crates

perf+fix(file_mention): address Gemini and Devin code-review findings

2026-04-26 17:08:33 -05:00

docs

chore(release): drop auto npm publish, document manual flow, trim research PDF

2026-04-26 16:26:10 -05:00

npm/deepseek-tui

release: v0.6.2

2026-04-26 13:56:40 -05:00

scripts/release

fix(#31 ): catch version drift in CI, not at release time

2026-04-25 13:25:55 -05:00

.env.example

release: deepseek-tui 0.4.7

2026-04-25 07:59:01 -05:00

.gitignore

chore(release): drop auto npm publish, document manual flow, trim research PDF

2026-04-26 16:26:10 -05:00

.mailmap

chore: map github web committer

2026-04-24 16:29:01 -05:00

AGENTS.md

docs(agents): document deepseek as the canonical CLI binary

2026-04-26 14:01:30 -05:00

Cargo.lock

merge: 'deepseek metrics' CLI (closes #70 )

2026-04-26 14:22:27 -05:00

Cargo.toml

release: v0.6.2

2026-04-26 13:56:40 -05:00

CHANGELOG.md

release: v0.6.1 — pricing update, remove light theme + theme setting

2026-04-26 11:56:41 -05:00

config.example.toml

fix(#37 ): make NIM a peer provider in config.example.toml + setup status

2026-04-25 13:52:44 -05:00

CONTRIBUTING.md

docs: update documentation and cleanup for v0.3.33

2026-04-22 22:36:45 -05:00

DEPENDENCY_GRAPH.md

docs: update documentation and cleanup for v0.3.33

2026-04-22 22:36:45 -05:00

LICENSE

Initial release v0.1.0

2026-01-20 08:57:35 -06:00

README.md

release: v0.6.1 — pricing update, remove light theme + theme setting

2026-04-26 11:56:41 -05:00

README.md

DeepSeek TUI

A terminal-native coding agent for DeepSeek V4 models — with 1M-token context, thinking-mode reasoning, and full tool-use.

npm i -g deepseek-tui

What is it?

DeepSeek TUI is a coding agent that runs entirely in your terminal. It gives DeepSeek's frontier models direct access to your workspace — reading and editing files, running shell commands, searching the web, managing git, and orchestrating sub-agents — all through a fast, keyboard-driven TUI.

Built for DeepSeek V4 (deepseek-v4-pro / deepseek-v4-flash) with 1M-token context windows and native thinking-mode (chain-of-thought) streaming. See the model's reasoning unfold in real time as it works through your tasks.

Key Features

Native RLM (rlm_query tool) — fans out 1–16 cheap deepseek-v4-flash children in parallel against the existing DeepSeek client for batched analysis, decomposition, or parallel reasoning
Thinking-mode streaming — shows DeepSeek's chain-of-thought as it reasons about your code
Full tool suite — file ops, shell execution, git, web search/browse, apply-patch, sub-agents, MCP servers
1M-token context — automatic intelligent compaction when context fills up
Three interaction modes — Plan (read-only explore), Agent (interactive with approval), YOLO (auto-approved). Decomposition-first system prompts teach the model to todo_write, update_plan, and spawn sub-agents before acting
Reasoning-effort tiers — cycle through off → high → max with Shift+Tab
Session save/resume — checkpoint and resume long sessions
HTTP/SSE runtime API — deepseek serve --http for headless agent workflows
MCP protocol — connect to Model Context Protocol servers for extended tooling
Live cost tracking — per-turn and session-level token usage and cost estimates
Dark theme — DeepSeek-blue palette

Quickstart

npm install -g deepseek-tui
deepseek

On first launch you'll be prompted for your DeepSeek API key. You can also set it ahead of time:

# via CLI
deepseek login --api-key "YOUR_DEEPSEEK_API_KEY"

# via env var
export DEEPSEEK_API_KEY="YOUR_DEEPSEEK_API_KEY"
deepseek

Using NVIDIA NIM

deepseek auth set --provider nvidia-nim --api-key "YOUR_NVIDIA_API_KEY"
deepseek --provider nvidia-nim

# or per-process:
DEEPSEEK_PROVIDER=nvidia-nim NVIDIA_API_KEY="..." deepseek

Install from source

git clone https://github.com/Hmbown/DeepSeek-TUI.git
cd DeepSeek-TUI
cargo install --path crates/tui --bin deepseek-tui --locked   # requires Rust 1.85+
cargo install --path crates/cli --bin deepseek --locked

What's new in v0.6.0

🌊 `rlm_query` — recursive language models as a first-class tool

The model now has direct access to a native recursive-LLM primitive. Inspired by Alex Zhang's RLM work and Sakana AI's published research on novelty search, but trimmed to what an agent loop actually needs: one tool, structured args, no DSL.

// Single child:
rlm_query({ "prompt": "Summarise this 4k-line log: ..." })

// 8 parallel children, indexed result:
rlm_query({
  "prompts": [
    "Review src/foo.rs for race conditions: ...",
    "Review src/foo.rs for input validation: ...",
    "Review src/foo.rs for error-handling gaps: ...",
    "..."
  ]
})

// Promote one call to Pro:
rlm_query({ "prompt": "Hard reasoning here", "model": "deepseek-v4-pro" })

Children run concurrently against the existing DeepSeek client via tokio — no external binary, no Python sandbox, no fenced-block DSL. Returns a single string for one prompt or [i] ... indexed blocks for many. Available in Plan / Agent / YOLO. The cost is folded into the session's running total automatically.

Other changes

Scroll position survives content rewrites — anchor fallback now clamps to the nearest surviving cell instead of teleporting to the bottom (#56)
Looser command-safety chains — cargo build && cargo test is no longer blocked outright; chains of known-safe commands escalate to RequiresApproval instead of Dangerous (#57)
Multi-turn tool calls no longer 400 on thinking-mode models — reasoning_content is replayed across user-message boundaries with a safe placeholder when the round produced none

Full history: CHANGELOG.md.

Models & Pricing

DeepSeek TUI targets DeepSeek V4 models with 1M-token context windows by default.

Model	Context	Input (cache hit)	Input (cache miss)	Output
`deepseek-v4-pro`	1M	$0.003625 / 1M*	$0.435 / 1M*	$0.87 / 1M*
`deepseek-v4-flash`	1M	$0.0028 / 1M	$0.14 / 1M	$0.28 / 1M

Legacy aliases deepseek-chat and deepseek-reasoner silently map to deepseek-v4-flash.

NVIDIA NIM hosted variants (deepseek-ai/deepseek-v4-pro, deepseek-ai/deepseek-v4-flash) use your NVIDIA account terms — no DeepSeek platform billing.

*DeepSeek lists the Pro rates above as a limited-time 75% discount valid until 2026-05-05 15:59 UTC; the TUI cost estimator falls back to base Pro rates after that timestamp.

Usage

deepseek                                      # interactive TUI
deepseek "explain this function"              # one-shot prompt
deepseek --model deepseek-v4-flash "summarize" # model override
deepseek --yolo                               # YOLO mode (auto-approve tools)
deepseek login --api-key "..."                # save API key
deepseek doctor                               # check setup & connectivity
deepseek models                               # list live API models
deepseek sessions                             # list saved sessions
deepseek resume --last                        # resume latest session
deepseek serve --http                         # HTTP/SSE API server

Keyboard shortcuts

Key	Action
`Tab`	Cycle mode: Plan → Agent → YOLO
`Shift+Tab`	Cycle reasoning-effort: off → high → max
`F1`	Help
`Esc`	Back / dismiss
`Ctrl+K`	Command palette
`@path`	Attach file/directory context in composer
`/attach <path>`	Attach image/video media references

Modes

Mode	Behavior
Plan 🔍	Read-only investigation — model explores and proposes a decomposition plan (`update_plan` + `todo_write`) before making changes
Agent 🤖	Default interactive mode — multi-step tool use with approval gates; model outlines work via `todo_write` before requesting writes
YOLO ⚡	Auto-approve all tools in a trusted workspace; model still creates `todo_write`/`update_plan` to keep work visible and trackable

Configuration

~/.deepseek/config.toml — see config.example.toml for every option.

Key environment overrides:

Variable	Purpose
`DEEPSEEK_API_KEY`	API key
`DEEPSEEK_BASE_URL`	API base URL
`DEEPSEEK_MODEL`	Default model
`DEEPSEEK_PROVIDER`	Provider: `deepseek` (default) or `nvidia-nim`
`DEEPSEEK_PROFILE`	Config profile name
`NVIDIA_API_KEY`	NVIDIA NIM API key

Quick diagnostics:

deepseek-tui setup --status    # read-only status check (API key, MCP, sandbox, .env)
deepseek-tui doctor --json     # machine-readable doctor output for CI
deepseek-tui setup --tools --plugins  # scaffold tools/ and plugins/ directories

DeepSeek context caching is automatic — when the API returns cache hit/miss token fields, the TUI includes them in usage and cost tracking.

Full reference: docs/CONFIGURATION.md

Documentation

Doc	Topic
ARCHITECTURE.md	Codebase internals
CONFIGURATION.md	Full config reference
MODES.md	Plan / Agent / YOLO modes
MCP.md	Model Context Protocol integration
RUNTIME_API.md	HTTP/SSE API server
RELEASE_RUNBOOK.md	Release process
OPERATIONS_RUNBOOK.md	Ops & recovery

Contributing

See CONTRIBUTING.md. Pull requests welcome!

Not affiliated with DeepSeek Inc.

License

MIT

Languages

Rust 94%

TypeScript 2.6%

JavaScript 1.6%

Shell 0.8%

Python 0.6%

Other 0.1%

README.md Unescape Escape

DeepSeek TUI

What is it?

Key Features

Quickstart

Using NVIDIA NIM

What's new in v0.6.0

🌊 rlm_query — recursive language models as a first-class tool

Other changes

Models & Pricing

Usage

Keyboard shortcuts

Modes

Configuration

Documentation

Contributing

License

README.md

🌊 `rlm_query` — recursive language models as a first-class tool