Live repro: in a session producing content rapidly (sub-agent running, multiple tool calls), the user scrolls up to read earlier output. Their scroll position briefly takes effect, then snaps back to the live tail when the next stream chunk arrives. Symptom is "scrolling is broken / takes over instead of the transcript". Root cause in `crates/tui/src/tui/widgets/mod.rs:188-210`: * The user's mouse-scroll-up sets `transcript_scroll = at_line(N)` and `user_scrolled_during_stream = true`. * During render, `resolve_top` clamps the state against `max_start = total_lines.saturating_sub(visible_lines)`. If `max_start < N` (transcript shrunk between scrolls and render — e.g., a sub-agent in-progress card collapsed into a smaller finished card, or the content briefly fits in one screen), `resolve_top` returns `Self::to_bottom()` (TAIL_SENTINEL). * `is_at_tail()` on the post-resolve state returns `true`. * The auto-clear at line 208 fires → `user_scrolled_during_stream = false`. * Next `add_message` / sub-agent envelope sees `is_at_tail() && !user_scrolled_during_stream` and calls `scroll_to_bottom()`. The user is yanked off their position mid-read. `scrolled_by` has the same trapdoor: when `total_lines <= visible_ lines` it returns `to_bottom()` regardless of scroll direction (line 145-148 in scrolling.rs). A user scroll-up while content fits in one screen produces `to_bottom()` → `is_at_tail()` true → auto-clear → next chunk yanks. The fix ======= Snapshot whether the user's PRIOR state was deliberately tail (`is_at_tail()` BEFORE `resolve_top`), and only clear the lock when: 1. Prior state was already TAIL_SENTINEL (deliberate, set by `scrolled_by` reaching `max_start` while scrolling DOWN, or by `scroll_to_bottom()`). 2. AND `total_lines > visible_lines` (so "tail" is meaningful — if the whole transcript fits, "is_at_tail" is trivially true and clearing the lock would yank the user back to bottom on the next chunk despite their explicit scroll-up). This preserves all the legitimate clear paths: * `TurnComplete` event clears the lock at the per-turn boundary (`ui.rs:879`). * User invokes `scroll_to_bottom()` explicitly via key/menu (`app.rs:2459`). * User scrolls down enough that `scrolled_by` reaches `max_start` in a transcript with real scroll room — state goes through `to_bottom()` BEFORE resolve, so `was_explicit_tail = true` and the lock clears. What it stops: * Render-time resolve clamping `at_line(N)` to tail when content shrunk doesn't quietly revoke the user's intent. * `scrolled_by` collapsing a scroll-up to `to_bottom()` when content briefly fits in one screen no longer triggers the auto-clear (the prior state wasn't tail). Verified locally: * `cargo fmt --all -- --check` clean. * `cargo clippy --workspace --all-targets --all-features --locked -- -D warnings` clean. * `cargo test --workspace --all-features --locked` — 2038 passed, 2 ignored, 0 failed (a snapshot::repo flake unrelated to scroll; passes in isolation). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
🐳 DeepSeek TUI
This terminal-native coding agent is built around DeepSeek V4's 1M-token context window and prefix cache capability. It is distributed as a single binary and requires no Node.js or Python runtime. It also includes an MCP client, a sandbox, and a durable task queue out of the box.
Before proceeding, ensure that Node.js and npm are installed.
node --version
npm --version
If Node.js and npm are not installed in your environment, refer to the installation guides provided below:
- English version: https://nodejs.org/en/download
- Chinese version: https://nodejs.org/zh-cn/download
Select the version that best matches your device specifications and operating system.
Install the deepseek-tui now:
npm install -g deepseek-tui
# When installing deepseek-tui in China's internet environment, you can use an npm mirror to speed up the installation process.
# npm install -g deepseek-tui@latest --registry=https://registry.npmmirror.com
What Is It?
DeepSeek TUI is a coding agent that runs entirely in your terminal. It gives DeepSeek's frontier models direct access to your workspace — reading and editing files, running shell commands, searching the web, managing git, and orchestrating sub-agents — all through a fast, keyboard-driven TUI.
Built for DeepSeek V4 (deepseek-v4-pro / deepseek-v4-flash) with 1M-token context window and native thinking-mode (chain-of-thought) streaming.
Key Features
- Native RLM (
rlm_query) — fans out 1–16 cheapdeepseek-v4-flashchildren in parallel for batched analysis and parallel reasoning, all against the existing API client - Thinking-mode streaming — watch the model's chain-of-thought unfold in real time as it works through your tasks
- Full tool suite — file ops, shell execution, git, web search/browse, apply-patch, sub-agents, MCP servers
- 1M-token context — automatic intelligent compaction when context fills up; prefix-cache aware for cost efficiency
- Three modes — Plan (read-only explore), Agent (interactive with approval), YOLO (auto-approved)
- Reasoning-effort tiers — cycle through
off → high → maxwithShift + Tab - Session save/resume — checkpoint and resume long-running sessions
- Workspace rollback — side-git pre/post-turn snapshots with
/restoreandrevert_turn, without touching your repo's.git - Durable task queue — background tasks survive restarts; think scheduled automation, long-running reviews
- HTTP/SSE runtime API —
deepseek serve --httpfor headless agent workflows - MCP protocol — connect to Model Context Protocol servers for extended tooling; please see docs/MCP.md
- LSP diagnostics — inline error/warning surfacing after every edit via rust-analyzer, pyright, typescript-language-server, gopls, clangd
- User memory — optional persistent note file injected into the system prompt for cross-session preferences
- Localized UI —
en,ja,zh-Hans,pt-BRwith auto-detection - Live cost tracking — per-turn and session-level token usage and cost estimates; cache hit/miss breakdown
- Skills system — composable, installable instruction packs from GitHub with no backend service required
How It's Wired
deepseek (dispatcher CLI) → deepseek-tui (companion binary) → ratatui interface ↔ async engine ↔ OpenAI-compatible streaming client. Tool calls route through a typed registry (shell, file ops, git, web, sub-agents, MCP, RLM) and results stream back into the transcript. The engine manages session state, turn tracking, the durable task queue, and an LSP subsystem that feeds post-edit diagnostics into the model's context before the next reasoning step.
See docs/ARCHITECTURE.md for the full walkthrough.
Quickstart
npm install -g deepseek-tui
deepseek --version
deepseek
Prebuilt binaries are published for Linux x64, Linux ARM64 (v0.8.8+), macOS x64, macOS ARM64, and Windows x64. For other targets (musl, riscv64, FreeBSD, etc.), see Install from source or docs/INSTALL.md.
On first launch you'll be prompted for your DeepSeek API key. The key is saved to ~/.deepseek/config.toml so it works from any directory without OS credential prompts.
You can also set it ahead of time:
deepseek auth set --provider deepseek # saves to ~/.deepseek/config.toml
export DEEPSEEK_API_KEY="YOUR_KEY" # env var alternative; use ~/.zshenv for non-interactive shells
deepseek
deepseek doctor # verify setup
To rotate or remove a saved key:
deepseek auth clear --provider deepseek.
Linux ARM64 (Raspberry Pi, Asahi, Graviton, HarmonyOS PC)
npm i -g deepseek-tui works on glibc-based ARM64 Linux from v0.8.8 onward. You can also download prebuilt binaries from the Releases page and place them side by side on your PATH.
China / Mirror-friendly Installation
If GitHub or npm downloads are slow from mainland China, use a Cargo registry mirror:
# ~/.cargo/config.toml
[source.crates-io]
replace-with = "tuna"
[source.tuna]
registry = "sparse+https://mirrors.tuna.tsinghua.edu.cn/crates.io-index/"
Then install both binaries (the dispatcher delegates to the TUI at runtime):
cargo install deepseek-tui-cli --locked # provides `deepseek`
cargo install deepseek-tui --locked # provides `deepseek-tui`
deepseek --version
Prebuilt binaries can also be downloaded from GitHub Releases. Use DEEPSEEK_TUI_RELEASE_BASE_URL for mirrored release assets.
Install from source
Works on any Tier-1 Rust target — including musl, riscv64, FreeBSD, and older ARM64 distros.
# Linux build deps (Debian/Ubuntu/RHEL):
# sudo apt-get install -y build-essential pkg-config libdbus-1-dev
# sudo dnf install -y gcc make pkgconf-pkg-config dbus-devel
git clone https://github.com/Hmbown/DeepSeek-TUI.git
cd DeepSeek-TUI
cargo install --path crates/cli --locked # requires Rust 1.85+; provides `deepseek`
cargo install --path crates/tui --locked # provides `deepseek-tui`
Both binaries are required. Cross-compilation and platform-specific notes: docs/INSTALL.md.
Other API Providers
# NVIDIA NIM
deepseek auth set --provider nvidia-nim --api-key "YOUR_NVIDIA_API_KEY"
deepseek --provider nvidia-nim
# Fireworks
deepseek auth set --provider fireworks --api-key "YOUR_FIREWORKS_API_KEY"
deepseek --provider fireworks --model deepseek-v4-pro
# Self-hosted SGLang
SGLANG_BASE_URL="http://localhost:30000/v1" deepseek --provider sglang --model deepseek-v4-flash
What's New In v0.8.10
A patch release: hotfixes, UX polish, and runtime API additions for the whalescale desktop integration. No breaking changes. Full changelog.
- Stacked toast overlay — status toasts queue and render together instead of overwriting each other
- File @-mention frecency — file mention suggestions learn from recent selections (
~/.deepseek/file-frecency.jsonl) - Runtime API expansion — CORS origins config, full thread editing (
PATCH /v1/threads/{id}),archived_onlyquery filter, aggregate usage endpoint (GET /v1/usage?group_by=day|model|provider|thread) - Language picker on first run — new onboarding step selects locale before entering the API key
- OPENCODE shell.env hook — lifecycle hooks can inject shell environment into spawned commands
- Cache-aware compaction — compaction calls reuse cached prompt prefixes, cutting
/compactcosts significantly - glibc 2.28 baseline — prebuilts now target glibc 2.28 (via
cargo zigbuild), covering older distros; npm postinstall fails fast with a clear source-build message when incompatible - Better markdown rendering — transcript now handles tables, bold/italic, and horizontal rules; no more infinite loops on unclosed markers
- MCP SIGTERM on shutdown — stdio servers receive SIGTERM with a 2-second grace period instead of SIGKILL
- Shell-child PDEATHSIG on Linux — children auto-SIGTERM when the parent exits, closing a leak window
- Windows Terminal paste fix — Ctrl/Cmd+V during onboarding now works correctly
- Terminal startup repaint — no more stale background rows above the first frame
- Slash-prefix Enter activation — typing
/moand pressing Enter activates the first match - Shell
cwdboundary validation — path escape returnsPathEscapeon out-of-workspacecwd, consistent with file tools
6 first-time contributors: @staryxchen (#556), @shentoumengxin (#524), @Vishnu1837 (#565), @20bytes (#569), @loongmiaow-pixel (#578), @WyxBUPT-22 (#579). Thanks also to @lloydzhou, @jeoor, @toi500, @xsstomy, and @melody0709 for bug reports.
Usage
deepseek # interactive TUI
deepseek "explain this function" # one-shot prompt
deepseek --model deepseek-v4-flash "summarize" # model override
deepseek --yolo # auto-approve tools
deepseek auth set --provider deepseek # save API key
deepseek doctor # check setup & connectivity
deepseek doctor --json # machine-readable diagnostics
deepseek setup --status # read-only setup status
deepseek setup --tools --plugins # scaffold tool/plugin dirs
deepseek models # list live API models
deepseek sessions # list saved sessions
deepseek resume --last # resume latest session
deepseek serve --http # HTTP/SSE API server
deepseek pr <N> # fetch PR and pre-seed review prompt
deepseek mcp list # list configured MCP servers
deepseek mcp validate # validate MCP config/connectivity
deepseek mcp-server # run dispatcher MCP stdio server
Keyboard Shortcuts
| Key | Action |
|---|---|
Tab |
Complete / or @ entries; while running, queue draft as follow-up; otherwise cycle mode |
Shift+Tab |
Cycle reasoning-effort: off → high → max |
F1 |
Searchable help overlay |
Esc |
Back / dismiss |
Ctrl+K |
Command palette |
Ctrl+R |
Resume an earlier session |
Alt+R |
Search prompt history and recover cleared drafts |
Ctrl+S |
Stash current draft (/stash list, /stash pop to recover) |
@path |
Attach file/directory context in composer |
↑ (at composer start) |
Select attachment row for removal |
Alt+↑ |
Edit last queued message |
Full shortcut catalog: docs/KEYBINDINGS.md.
Modes
| Mode | Behavior |
|---|---|
| Plan 🔍 | Read-only investigation — model explores and proposes a plan (update_plan + checklist_write) before making changes |
| Agent 🤖 | Default interactive mode — multi-step tool use with approval gates; model outlines work via checklist_write |
| YOLO ⚡ | Auto-approve all tools in a trusted workspace; still maintains plan and checklist for visibility |
Configuration
User config: ~/.deepseek/config.toml. Project overlay: <workspace>/.deepseek/config.toml (denied: api_key, base_url, provider, mcp_config_path). config.example.toml has every option.
Key environment variables:
| Variable | Purpose |
|---|---|
DEEPSEEK_API_KEY |
API key |
DEEPSEEK_BASE_URL |
API base URL |
DEEPSEEK_MODEL |
Default model |
DEEPSEEK_PROVIDER |
deepseek (default), nvidia-nim, fireworks, sglang |
DEEPSEEK_PROFILE |
Config profile name |
DEEPSEEK_MEMORY |
Set to on to enable user memory |
NVIDIA_API_KEY / FIREWORKS_API_KEY / SGLANG_API_KEY |
Provider auth |
SGLANG_BASE_URL |
Self-hosted SGLang endpoint |
NO_ANIMATIONS=1 |
Force accessibility mode at startup |
SSL_CERT_FILE |
Custom CA bundle for corporate proxies |
UI locale is separate from model language — set locale in settings.toml, use /config locale zh-Hans, or rely on LC_ALL/LANG. See docs/CONFIGURATION.md and docs/MCP.md.
Models & Pricing
| Model | Context | Input (cache hit) | Input (cache miss) | Output |
|---|---|---|---|---|
deepseek-v4-pro |
1M | $0.003625 / 1M* | $0.435 / 1M* | $0.87 / 1M* |
deepseek-v4-flash |
1M | $0.0028 / 1M | $0.14 / 1M | $0.28 / 1M |
Legacy aliases deepseek-chat / deepseek-reasoner map to deepseek-v4-flash. NVIDIA NIM variants use your NVIDIA account terms.
DeepSeek Pro rates currently reflect a limited-time 75% discount, which remains valid until 15:59 UTC on 5 May 2026. After that time, the TUI cost estimator will revert to the base Pro rates.
Publishing Your Own Skill
DeepSeek TUI discovers skills from workspace directories (.agents/skills → skills → .opencode/skills → .claude/skills) and the global ~/.deepseek/skills. Each skill is a directory with a SKILL.md file:
~/.deepseek/skills/my-skill/
└── SKILL.md
Frontmatter required:
---
name: my-skill
description: Use this when DeepSeek should follow my custom workflow.
---
# My Skill
Instructions for the agent go here.
Commands: /skills (list), /skill <name> (activate), /skill new (scaffold), /skill install github:<owner>/<repo> (community), /skill update / uninstall / trust. Community installs from GitHub require no backend service. Installed skills appear in the model-visible session context; the agent can auto-select relevant skills via the load_skill tool when your task matches their descriptions.
Documentation
| Doc | Topic |
|---|---|
| ARCHITECTURE.md | Codebase internals |
| CONFIGURATION.md | Full config reference |
| MODES.md | Plan / Agent / YOLO modes |
| MCP.md | Model Context Protocol integration |
| RUNTIME_API.md | HTTP/SSE API server |
| INSTALL.md | Platform-specific install guide |
| MEMORY.md | User memory feature guide |
| SUBAGENTS.md | Sub-agent role taxonomy and lifecycle |
| KEYBINDINGS.md | Full shortcut catalog |
| RELEASE_RUNBOOK.md | Release process |
| OPERATIONS_RUNBOOK.md | Ops & recovery |
Full Changelog: CHANGELOG.md.
Thanks
Earlier releases shipped with help from these contributors:
- Hafeez Pizofreude — SSRF protection in
fetch_urland Star History chart - Unic (YuniqueUnic) — Schema-driven config UI (TUI + web)
- Jason — SSRF security hardening
Contributing
See CONTRIBUTING.md. Pull requests welcome — check the open issues for good first contributions.
Note
Not affiliated with DeepSeek Inc.
