plan(ssh): add transparent SSH support design & implementation plan

Research and plan transparent SSH execution so an agent runs commands on a remote computer as if local — the agent never learns it is using SSH. Covers: - How the cwd → ToolExecuteContext pipeline works today and where a computerId threads in (mirroring cwd end-to-end) - The ExecBackend abstraction (spawn + fs) behind which tool-shell/ read-file/write-file/edit-file are refactored, with LocalExecBackend (node) and SshExecBackend (ssh2) implementations - Computer data model + workspace defaultComputerId + per-conversation override, mirroring the getEffectiveCwd resolution ladder (null = local) - SSH connection pooling (one per computer, lazy connect, keep-alive, idle reaping), auth via SecretsAccess/gopass, host-key verification - Turn loop / dispatch integration (additive optional computerId field, backward-compatible — absent = today's local behavior) - LSP/MCP degrade by dropping those tools on remote turns (future: remote server spawn over SSH) - API surface (computer CRUD, per-conv + workspace-default endpoints, chat.send gains computerId), frontend impact - Security, edge cases, phased implementation, contract gaps reported to unit owners (one-owner-per-unit honored — planner does not edit others) No code changed; planning document only. No merge or push.
author: Adam Malczewski <[email protected]> 2026-06-25 10:38:18 +0900
committer: Adam Malczewski <[email protected]> 2026-06-25 10:38:18 +0900
commit: a4c4b4a7ea6ead10ee969c5cc474768e29fc2558 (patch)
tree: edad9b824ea6ea6a120f97683301e0cb2eaff27e /notes
parent: 8a74335c21a57ee63c9a0658754430d6520dbc87 (diff)
download: dispatch-a4c4b4a7ea6ead10ee969c5cc474768e29fc2558.tar.gz
dispatch-a4c4b4a7ea6ead10ee969c5cc474768e29fc2558.zip
1 files changed, 917 insertions, 0 deletions
diff --git a/notes/ssh-support-plan.md b/notes/ssh-support-plan.md
new file mode 100644
index 0000000..a4a19b3
--- /dev/null
+++ b/notes/ssh-support-plan.md
@@ -0,0 +1,917 @@
+# SSH Support — Design & Implementation Plan
+
+> **Status:** Planning. No implementation has begun.
+> **Branch:** `feature/ssh-support`
+> **Scope:** Transparent SSH execution so an agent runs commands on a remote
+> computer as if local — the agent never learns it is using SSH.
+> This plan follows the architecture rules in `AGENTS.md` (kernel → core →
+> standard tiers; effects at the edges; contracts are the only cross-unit
+> surface; one owner per unit).
+
+---
+
+## 0. Goals (from the feature brief)
+
+1. **Remote computer selection** — alongside `cwd`, a user can select another
+   computer to connect to via SSH. When an agent runs commands, they execute on
+   that remote computer transparently. The agent must NOT know it is using SSH;
+   it just runs commands and they happen on the remote machine.
+2. **Workspace-level defaults** — a user can set a computer as the default for a
+   workspace. Any agent summoned in that workspace without an explicitly
+   assigned computer inherits the workspace's configured computer.
+3. **Per-conversation override** — a conversation can specify its own computer,
+   overriding the workspace default.
+
+The non-functional requirement that shapes everything below: **transparency.**
+The model sees identical tools with identical descriptions whether execution is
+local or remote. Only the tool *implementation* routes differently per call.
+
+---
+
+## 1. How execution works today (the seam we plug into)
+
+### 1.1 The cwd → tool pipeline
+
+`cwd` is already threaded end-to-end through a pure, injected path. SSH support
+mirrors this exact path with a `computerId`:
+
+```
+ChatRequest.cwd / ChatRequest.workspaceId        (transport-contract)
+  → StartTurnInput.cwd / workspaceId             (session-orchestrator)
+    → runTurnDetached: getEffectiveCwd(...)        (resolve per-turn)
+      → RunTurnInput.cwd                          (kernel contract)
+        → StepContext.cwd                         (run-turn.ts)
+          → createStepDispatcher(..., cwd)        (dispatch.ts)
+            → executeToolCall(..., cwd)
+              → ToolExecuteContext.cwd            (contracts/tool.ts)
+                → tool.execute(args, ctx)        uses ctx.cwd
+```
+
+Key files (the single-owner units this feature touches):
+
+| Unit (package) | Role | Current local-only behavior |
+|---|---|---|
+| `kernel` (contracts) | `RunTurnInput.cwd`, `ToolExecuteContext.cwd` | Threads a string; never interprets |
+| `kernel` (runtime) | `dispatch.ts` `executeToolCall` builds `ToolExecuteContext` | Forwards `cwd` verbatim |
+| `session-orchestrator` | `getEffectiveCwd` resolution; builds `RunTurnInput` | Resolves cwd against workspace `defaultCwd` |
+| `conversation-store` | `Workspace.defaultCwd`, per-conv cwd, `getEffectiveCwd` | Stores cwd + workspace |
+| `tool-shell` | `run_shell` tool; `SpawnShell` interface | `realSpawn` = `node:child_process` |
+| `tool-read-file` | `read_file` tool | `node:fs/promises` directly (readdir/readFile/stat) |
+| `tool-write-file` | `write_file` tool | `node:fs/promises` directly (access/stat/writeFile) |
+| `tool-edit-file` | `edit_file` tool | `node:fs` directly |
+| `lsp` | spawns language servers, reads/watches files | `Bun.spawn`, `Bun.file`, `node:fs.watch` |
+| `mcp` | spawns MCP servers | injectable `spawn` adapter |
+| `transport-http` | `POST /chat`, cwd/workspace endpoints | — |
+| `transport-ws` | `chat.send` message (cwd/workspaceId) | — |
+| `host-bin` | wires extensions + `process.cwd()` into tools | — |
+
+### 1.2 The critical finding: tools hardcode `node:fs`/`node:child_process`
+
+The shell tool already has an injectable seam — `SpawnShell`:
+
+```ts
+// packages/tool-shell/src/shell.ts
+export type SpawnShell = (params: {
+  readonly command: string;
+  readonly cwd: string;
+  readonly signal: AbortSignal;
+  readonly timeout: number;
+  readonly onOutput: (data: string, stream: "stdout" | "stderr") => void;
+}) => Promise<SpawnResult>;
+```
+
+But it is bound **once at activation** with a fixed local spawn:
+
+```ts
+// packages/tool-shell/src/extension.ts
+host.defineTool(createRunShellTool({ workdir: process.cwd(), spawn: realSpawn }));
+```
+
+The filesystem tools (`read_file`, `write_file`, `edit_file`) are **worse**: they
+call `node:fs/promises` *directly inside `execute()`*, with no injection seam at
+all. The only injection is the `workdir` (bound to `process.cwd()` at boot).
+
+**Implication:** transparency is not free. To run a tool remotely, the tool must
+resolve its execution backend *per call* from `ToolExecuteContext` — not from a
+binding fixed at activation. This requires:
+
+1. A new `ExecBackend` (a.k.a. host backend) abstraction over spawn + fs.
+2. Threading a `computerId` through `ToolExecuteContext` (mirroring `cwd`).
+3. Refactoring the filesystem tools to use the injected backend instead of
+   `node:fs` directly.
+
+### 1.3 The workspace model we mirror
+
+`getEffectiveCwd` (conversation-store) is the exact resolution pattern to clone
+for computers:
+
+1. **Absolute per-conversation cwd** → used outright
+2. **Relative per-conversation cwd** → resolved against workspace `defaultCwd`
+3. **No per-conversation cwd** → workspace `defaultCwd`
+4. **Neither** → `serverDefaultCwd` (`process.cwd()`)
+
+For computers the ladder is:
+
+1. **Per-conversation `computerId`** → used outright
+2. **No per-conversation `computerId`** → workspace `defaultComputerId`
+3. **Neither** → `null` = **local** (no SSH; today's behavior)
+
+`null` (local) is the "server default" equivalent — and it is the ONLY level
+that requires no SSH, so the feature degrades cleanly to today's behavior when no
+computer is configured anywhere.
+
+---
+
+## 2. Core design: the `ExecBackend` abstraction
+
+### 2.1 The contract (new, lives in a new core extension `exec-backend`)
+
+The central abstraction is an `ExecBackend`: the union of spawn + filesystem
+operations a tool needs, expressed against **paths and bytes**, never against
+`node:fs`/`child_process`. There are exactly two implementations:
+
+- `LocalExecBackend` — wraps `node:fs/promises` + `node:child_process` (today's
+  behavior, factored out).
+- `SshExecBackend` — wraps `ssh2` `exec` + `sftp` (new, in the `ssh` extension).
+
+```ts
+// packages/exec-backend/src/backend.ts  (NEW core extension)
+
+/** A spawned process's stdio handles + lifecycle, transport-agnostic. */
+export interface ExecResult {
+  readonly exitCode: number | null;
+  readonly timedOut: boolean;
+  readonly aborted: boolean;
+}
+
+export interface SpawnParams {
+  readonly command: string;
+  readonly cwd: string;
+  readonly signal: AbortSignal;
+  readonly timeout: number;
+  readonly onOutput: (data: string, stream: "stdout" | "stderr") => void;
+}
+
+/** Stat result — the subset read_file/write_file/edit_file need. */
+export interface StatResult {
+  readonly isFile: boolean;
+  readonly isDirectory: boolean;
+}
+
+/**
+ * The execution backend: spawn + a minimal filesystem surface.
+ * Tools program against THIS, never against node:fs. Two implementations:
+ * local (node) and ssh (ssh2). Resolved per-call from ToolExecuteContext.
+ *
+ * Deliberately a SMALL surface (only what the bundled tools use) so a remote
+ * implementation is tractable. New operations are added here, not ad hoc.
+ */
+export interface ExecBackend {
+  /** Run a shell command, streaming stdout/stderr. The shell-tool seam. */
+  readonly spawn: (params: SpawnParams) => Promise<ExecResult>;
+
+  // --- filesystem (the read_file / write_file / edit_file surface) ---
+  readonly readFile: (path: string) => Promise<string>;
+  readonly writeFile: (path: string, content: string) => Promise<void>;
+  readonly stat: (path: string) => Promise<StatResult>;
+  readonly readdir: (path: string) => Promise<readonly { readonly name: string; readonly isDirectory: boolean }[]>;
+  /** Check existence without throwing. */
+  readonly exists: (path: string) => Promise<boolean>;
+}
+```
+
+### 2.2 Resolution: `computerId` on `ToolExecuteContext` + a resolver
+
+The tool cannot reach the Host API at `execute` time (it only gets
+`ToolExecuteContext`). So resolution flows through the context, mirroring `cwd`:
+
+**Kernel contract change** (additive, optional field — backward compatible):
+
+```ts
+// packages/kernel/src/contracts/tool.ts  (MODIFIED)
+export interface ToolExecuteContext {
+  readonly toolCallId: string;
+  readonly onOutput: (data: string, stream: "stdout" | "stderr") => void;
+  readonly signal: AbortSignal;
+  readonly log: Logger;
+  readonly cwd?: string;
+  readonly conversationId?: string;
+  /**
+   * The computer this tool-call executes on (NEW). When omitted/undefined,
+   * execution is LOCAL (today's behavior). When set, tools resolve a remote
+   * ExecBackend via the injected resolver. The kernel never interprets it —
+   * it forwards verbatim from RunTurnInput, like cwd.
+   */
+  readonly computerId?: string;
+}
+```
+
+```ts
+// packages/kernel/src/contracts/runtime.ts  (MODIFIED — RunTurnInput)
+export interface RunTurnInput {
+  // ...existing fields...
+  readonly cwd?: string;
+  /**
+   * The computer to execute this turn's tools on (NEW). Omitted = local.
+   * Forwarded verbatim to each ToolExecuteContext.computerId. Like cwd, it
+   * never enters the model prompt (no prompt-cache impact).
+   */
+  readonly computerId?: string;
+  // ...
+}
+```
+
+The dispatch runtime (`executeToolCall`) threads `computerId` exactly as it does
+`cwd` today (one extra optional arg / field).
+
+### 2.3 How a tool resolves its backend
+
+Each affected tool is constructed with an injected **backend resolver** — a
+function `(computerId?) => ExecBackend`. At `execute` time it calls
+`resolveBackend(ctx.computerId)`:
+
+```ts
+// packages/tool-shell/src/shell.ts  (MODIFIED factory signature)
+export function createRunShellTool(deps: {
+  readonly workdir: string;
+  /** Resolve the execution backend for a call. computerId undefined = local. */
+  readonly resolveBackend: (computerId?: string) => ExecBackend;
+  readonly outputCap?: number;
+}): ToolContract {
+  // ...
+  async execute(args, ctx) {
+    // ...
+    const backend = deps.resolveBackend(ctx.computerId);
+    spawnResult = await backend.spawn({ command, cwd: effectiveCwd, signal, timeout, onOutput });
+    // ...
+  }
+}
+```
+
+The `LocalExecBackend` ignores `computerId`; the `SshExecBackend` is built for a
+specific connection (see §4). The resolver (provided by the `exec-backend`
+extension via a service handle, wired in `host-bin`) returns the right one.
+
+> **Why a resolver function and not `host.getService` inside execute?** Tools
+> don't receive `host` at execute time — only `ToolExecuteContext`. Injecting the
+> resolver at construction (like `spawn` today) keeps the tool pure-ish and
+> testable (a test injects a fake resolver), consistent with the existing
+> `SpawnShell`/`McpExtensionDeps` injection patterns. The resolver is the ONE
+> piece of ambient-ish wiring; it is owned by the `exec-backend` extension and
+> is reproducible from inputs (computerId → backend).
+
+### 2.4 The filesystem tools must be refactored
+
+`read_file`/`write_file`/`edit_file` currently call `node:fs/promises` inline.
+They must be rewritten to call `backend.readFile(...)` etc. through the same
+injected resolver. The pure logic (validate args, slice lines, diff, decide
+overwrite) stays pure and untouched — only the I/O calls move behind the
+backend. This is the bulk of the mechanical work but is low-risk: the contract is
+a strict subset of `node:fs`.
+
+---
+
+## 3. Data model: the `Computer` entity
+
+### 3.1 The `Computer` type (wire)
+
+Mirrors `Workspace` — a backend-owned, slug-addressed entity. Lives in
+`@dispatch/wire` so the frontend can depend on the wire alone.
+
+```ts
+// packages/wire/src/index.ts  (NEW)
+export interface Computer {
+  /** URL slug (immutable). Lowercase [a-z0-9-], 1–40 chars. */
+  readonly id: string;
+  /** Display title (editable). Defaults to id on creation. */
+  readonly title: string;
+  /** SSH hostname or IP. */
+  readonly host: string;
+  /** SSH port. Default 22. */
+  readonly port: number;
+  /** SSH username. */
+  readonly username: string;
+  /**
+   * Auth method: "key" (a private key reference) | "password" | "agent".
+   * The secret itself is NEVER stored here — only a reference into the secret
+   * store (see §7). "agent" uses the local ssh-agent (no secret reference).
+   */
+  readonly authMethod: "key" | "password" | "agent";
+  /**
+   * Reference to the credential in the secret store (a gopass-style key),
+   * when authMethod is "key" or "password". Absent for "agent".
+   */
+  readonly secretRef?: string;
+  readonly createdAt: number;
+  readonly lastActivityAt: number;
+}
+
+export interface ComputerEntry extends Computer {
+  /** Number of conversations/workspaces using this computer. */
+  readonly usageCount: number;
+}
+```
+
+### 3.2 Storage — a new `computer-store` extension
+
+Following the existing pattern, a `ComputerStore` (parallel to the workspace
+methods already on `ConversationStore`). Two ownership options:
+
+- **Option A (recommended):** a new `computer-store` extension owning `Computer`
+  CRUD + the `defaultComputerId` on workspaces + per-conversation `computerId`.
+  Keeps `conversation-store` unchanged (one owner per unit). The
+  `getEffectiveComputer` resolution lives here.
+- Option B: extend `conversation-store` with computer methods. Fewer packages
+  but violates one-owner-per-unit and grows an already-large store.
+
+**Recommendation: Option A.** `computer-store` is the single owner of the
+`Computer` entity and the `getEffectiveComputer` resolution. It depends on
+`conversation-store` only for reading `workspaceId` (via the existing
+`getWorkspaceId`) — or, cleaner, it owns a parallel `Workspace.defaultComputerId`
+field. Since `WorkspaceRow` is owned by `conversation-store`, the cleanest split
+is:
+
+- `conversation-store` gains a `defaultComputerId: string | null` field on
+  `WorkspaceRow`/`Workspace` (it already owns `defaultCwd` — symmetric).
+- `computer-store` owns the `Computer` entity, the per-conversation `computerId`
+  persistence, and `getEffectiveComputer` (which reads
+  `conversation-store.getWorkspaceId` + `getWorkspace` for the default, then
+  its own per-conversation value).
+
+```ts
+// packages/computer-store/src/store.ts  (NEW)
+export interface ComputerStore {
+  readonly getComputer: (id: string) => Promise<Computer | null>;
+  readonly ensureComputer: (id: string, opts?: ComputerCreateOpts) => Promise<Computer>;
+  readonly setComputerTitle: (id: string, title: string) => Promise<Computer>;
+  readonly deleteComputer: (id: string) => Promise<{ closedCount: number }>;
+  readonly listComputers: () => Promise<readonly ComputerEntry[]>;
+
+  // per-conversation
+  readonly getComputerId: (conversationId: string) => Promise<string | null>;
+  readonly setComputerId: (conversationId: string, computerId: string | null) => Promise<void>;
+
+  // resolution — mirrors getEffectiveCwd
+  readonly getEffectiveComputer: (
+    conversationId: string,
+    overrideComputerId?: string,
+  ) => Promise<Computer | null>; // null = local
+}
+```
+
+### 3.3 The resolution ladder (`getEffectiveComputer`)
+
+```
+1. overrideComputerId (per-turn, from chat.send)      → resolve + return
+2. per-conversation computerId (persisted)             → resolve + return
+3. workspace defaultComputerId                         → resolve + return
+4. none of the above                                   → null  (LOCAL)
+```
+
+`null` is the deliberate "local" sentinel — no SSH connection, today's behavior.
+This is the key to clean degradation and the workspace-default inheritance:
+a new conversation in a workspace with a `defaultComputerId` inherits it without
+persisting anything itself.
+
+### 3.4 `Workspace` gains `defaultComputerId` (conversation-store)
+
+```ts
+// packages/wire/src/index.ts  (MODIFIED)
+export interface Workspace {
+  readonly id: string;
+  readonly title: string;
+  readonly defaultCwd: string | null;
+  /** NEW: default computer for conversations in this workspace. null = local. */
+  readonly defaultComputerId: string | null;
+  readonly createdAt: number;
+  readonly lastActivityAt: number;
+}
+```
+
+`conversation-store` gains `setWorkspaceDefaultComputerId(id, computerId | null)`
+(parallel to `setWorkspaceDefaultCwd`). This is the **one contract gap** this
+plan reports to the `conversation-store` owner: a new field on `WorkspaceRow` +
+`Workspace` + a setter. (Per the constitution, the planner does not edit
+conversation-store; it reports the needed change.)
+
+---
+
+## 4. SSH connection management (the `ssh` extension)
+
+### 4.1 Library
+
+**`ssh2`** (mscdex/ssh2, v1.17.0) — the standard pure-JS SSH2 client, MIT, 5.8k
+stars, 2k+ dependents. It provides:
+
+- `client.exec(command, opts)` → a stream with `stdout`/`stderr` `'data'` events
+  and an `'exit'` event (exit code). This maps directly onto `SpawnShell`.
+- `client.sftp()` → an SFTP session with `readFile`, `writeFile`, `stat`,
+  `readdir`, `createReadStream`/`createWriteStream`. This implements the fs half
+  of `ExecBackend`.
+- Auth: `privateKey`, `password`, `agent` (`ssh-agent`), `keyboard-interactive`.
+
+**Bun compatibility:** `ssh2` relies on Node's `crypto`/`Stream` APIs. The
+project runs on **Bun**. Two paths:
+
+- **Primary:** use `ssh2` directly and verify under Bun (Bun implements most
+  Node compat; ssh2's native addon `cpu-features` is optional). Validate with a
+  smoke test early in implementation.
+- **Fallback:** the `bun-ssh2` fork (artokun/bun-ssh2) is a Bun-patched variant.
+  If `ssh2` fails under Bun, swap the import — the API surface is identical.
+
+> **Action for the user (package install):** SSH support needs the `ssh2` (or
+> `bun-ssh2`) dependency added to `packages/ssh/package.json`. Per the package
+> install policy, I will not install system-wide; the implementation agent adds
+> it as a project-local dependency (`bun add ssh2` in the package). No system
+> package is required (ssh2 ships its own crypto; OpenSSH is not needed on the
+> Dispatch host — the library IS the SSH client).
+
+### 4.2 Connection pooling
+
+A single `ssh2` `Client` connection can run **many** `exec` calls and **one**
+SFTP session concurrently, so we pool **one connection per `Computer`** (keyed by
+computer id, since a computer's connection params are stable). This is the
+`SshConnectionPool`, owned by the `ssh` extension:
+
+```ts
+// packages/ssh/src/pool.ts  (NEW)
+export interface SshConnection {
+  /** Acquire the live ssh2 Client (connects lazily on first acquire). */
+  readonly getClient: () => Promise<ssh2.Client>;
+  /** Acquire a shared SFTP session (lazily created, reused). */
+  readonly getSftp: () => Promise<ssh2.SFTPWrapper>;
+  readonly close: () => Promise<void>;
+  /** Live status for the frontend status endpoint. */
+  readonly state: "disconnected" | "connecting" | "connected" | "error";
+}
+
+export interface SshConnectionPool {
+  /** Get-or-connect the pooled connection for a computer. */
+  readonly acquire: (computerId: string) => Promise<SshConnection>;
+  /** Close + drop a single computer's connection (on error / manual disconnect). */
+  readonly drop: (computerId: string) => Promise<void>;
+  /** Close all (shutdown). */
+  readonly closeAll: () => Promise<void>;
+  /** Status snapshot for all known computers. */
+  readonly status: () => readonly { readonly computerId: string; readonly state: SshConnection["state"]; readonly error?: string }[];
+}
+```
+
+**Lifecycle / pooling rules:**
+
+- **Lazy connect:** the first `acquire(computerId)` for a computer opens the
+  connection. Subsequent acquires reuse it (no reconnect per command — this is
+  the transparency + performance win over spawning `ssh` per call).
+- **Keep-alive:** ssh2 supports `keepaliveInterval` / `keepaliveCountMax`.
+  Configure (e.g. 30s interval, 3 misses) so idle pooled connections detect
+  dead peers without a user-visible hang.
+- **Idle reaping:** a periodic sweep closes connections unused for N minutes
+  (configurable; default ~15m) to avoid holding sockets on remote hosts. The
+  next `acquire` reconnects transparently.
+- **Per-computer single connection** is the MVP. If a remote host becomes a
+  bottleneck (many concurrent tool calls — note the default dispatch is
+  `maxConcurrent: 1`, so this is unlikely), the pool can grow to a small cap
+  (e.g. 3) per computer later. SFTP is a single session per connection; if fs
+  contention appears, open additional SFTP sessions on the same connection.
+
+### 4.3 The `SshExecBackend`
+
+Built per `acquire`, wrapping the pooled connection's `exec` + `sftp`:
+
+```ts
+// packages/ssh/src/backend.ts  (NEW)
+export function createSshExecBackend(conn: SshConnection, computer: Computer): ExecBackend {
+  return {
+    async spawn(params) {
+      const client = await conn.getClient();
+      // sh -c on the remote, cwd via `cd <cwd> && ...` or exec opts.
+      // ssh2 exec has no cwd option → prefix `cd "$cwd" && ` (shell-quoted).
+      const wrapped = `cd ${shellQuote(params.cwd)} && ${params.command}`;
+      return new Promise((resolve) => {
+        client.exec(wrapped, { pty: false }, (err, stream) => {
+          if (err) return resolve({ exitCode: 1, timedOut: false, aborted: false });
+          // wire stream.stdout/stderr 'data' → params.onOutput
+          // wire stream 'exit'/'close' → resolve({exitCode, ...})
+          // wire params.signal abort → stream.end(); resolve aborted
+          // wire params.timeout → stream.end(); resolve timedOut
+        });
+      });
+    },
+    async readFile(path) { const sftp = await conn.getSftp(); return sftp.readFile(path, "utf8"); /* throws ENOENT → map */ },
+    async writeFile(path, content) { const sftp = await conn.getSftp(); return sftp.writeFile(path, content, "utf8"); },
+    async stat(path) { const sftp = await conn.getSftp(); const s = await sftp.stat(path); return { isFile: s.isFile(), isDirectory: s.isDirectory() }; },
+    async readdir(path) { const sftp = await conn.getSftp(); const list = await sftp.readdir(path); /* map → {name, isDirectory} */ },
+    async exists(path) { try { await sftp.stat(path); return true; } catch { return false; } },
+  };
+}
+```
+
+**Error mapping:** `node:fs` throws `ENOENT` etc. with `.code`. ssh2/SFTP errors
+have different shapes. The `SshExecBackend` maps them to the `node:fs`-style
+errors the existing tool pure-logic expects (e.g. `(err as NodeJS.ErrnoException).code
+=== "ENOENT"`), so the tools' existing error branches (`read_file`'s "File not
+found") work unchanged. This mapping lives in the backend, not the tools.
+
+### 4.4 Auth & secrets
+
+- `computer.secretRef` is a key into the secret store (the `SecretsAccess` Host
+  API surface; production wired to gopass per the secrets-management skill).
+- The `ssh` extension resolves the secret at connect time (lazily, on first
+  `acquire`): `host.secrets.get(computer.secretRef)` → the private key PEM or
+  password string. The secret never leaves the `ssh` extension and is never
+  persisted in the `Computer` row.
+- `agent` auth forwards to the local `ssh-agent` (ssh2's `agent: process.env.SSH_AUTH_SOCK`)
+  — no secret reference needed.
+- `keyboard-interactive` / `password` auth is supported but discouraged for
+  production; key/agent is the default recommendation.
+
+---
+
+## 5. Integration with the turn loop & tool dispatch
+
+### 5.1 Threading `computerId` end-to-end
+
+The change is a strict superset of the cwd threading — one more optional field
+at each hop:
+
+```
+ChatRequest.computerId (NEW) / Workspace.defaultComputerId (NEW)
+  → StartTurnInput.computerId (NEW)
+    → runTurnDetached: getEffectiveComputer(...) (NEW resolution)
+      → RunTurnInput.computerId (NEW)
+        → StepContext.computerId (NEW)
+          → createStepDispatcher(..., computerId) (NEW arg)
+            → executeToolCall(..., computerId) (NEW arg)
+              → ToolExecuteContext.computerId (NEW)
+                → tool.execute resolves backend from ctx.computerId
+```
+
+Every one of these is **additive and optional** — when `computerId` is absent
+everywhere, behavior is byte-identical to today (local). This is the
+backward-compatibility invariant.
+
+### 5.2 session-orchestrator changes
+
+`runTurnDetached` already resolves `effectiveCwd` via a chained promise. It
+gains a parallel `effectiveComputer` resolution (mirroring the cwd promise),
+then a `resolveBackend` is wired so tools get the right backend. Concretely:
+
+- Add `computerId?: string` to `StartTurnInput`.
+- Resolve `effectiveComputerId` = `computerStore.getEffectiveComputer(convId, override)`.
+- Persist per-conversation `computerId` on first turn (like cwd).
+- Thread `computerId` into `RunTurnInput` (line ~589 where `opts` is built).
+- The `TurnLifecyclePayload` gains `computerId` (for cache-warming symmetry —
+  a warm probe must assemble tools under the same computer so the *tool
+  descriptions* match; see §5.4).
+
+### 5.3 The `exec-backend` extension wires the resolver
+
+A new core extension `exec-backend` provides a service handle
+`execBackendHandle: ServiceHandle<ExecBackendResolver>` where
+`ExecBackendResolver = (computerId?: string) => ExecBackend`. Its implementation:
+
+```ts
+function resolveBackend(computerId?: string): ExecBackend {
+  if (computerId === undefined) return localBackend;          // local
+  const ssh = sshPool.acquire(computerId);                     // remote (async!)
+  return sshBackendFor(computerId);
+}
+```
+
+**Subtlety: `acquire` is async.** The resolver must return a backend whose
+methods are async (they already are — `spawn`/`readFile` return Promises), so
+the connection is acquired lazily *inside* the first backend method call, not
+at resolver-call time. The resolver stays synchronous; the `SshExecBackend`
+captures the `computerId` + a lazy `acquire` thunk. This keeps the resolver
+side-effect-free (no connection opened merely by resolving a backend — only
+when a tool actually executes).
+
+The tool extensions (`tool-shell`, `tool-read-file`, `tool-write-file`,
+`tool-edit-file`) gain a `resolveBackend` dep injected at activation
+(`host-bin` wires `host.getService(execBackendHandle)`).
+
+### 5.4 Cache-warming & prompt-cache safety
+
+`cache-warming` replays the conversation's prefix to warm the provider cache. It
+assembles tools via `applyToolsFilter` under the *same cwd* today. With SSH, the
+**tool descriptions are unchanged** (transparency!), so the prompt-cache prefix
+is unaffected by the computer — UNLESS a tools-filter changes the tool *set*
+based on computer (e.g. dropping LSP tools when remote; see §6). The plan:
+`WarmService.warm` and the tools-filter must thread `computerId` so any
+computer-sensitive filtering is byte-stable between warm and real turns. This
+is the same invariant the codebase already enforces for cwd.
+
+### 5.5 System prompt
+
+The system prompt is cwd-aware today (it may include the cwd). For transparency,
+the prompt should NOT reveal "you are on a remote machine" — the agent must not
+know. The cwd shown to the model is the *remote* cwd (a path on the remote
+machine), which is already what `ctx.cwd` would be. No system-prompt change is
+required for transparency. (Optionally, a future `{{computer}}` template variable
+could be added, but that would *break* transparency — out of scope / discouraged.)
+
+---
+
+## 6. LSP, MCP, and other spawned-process extensions
+
+### 6.1 LSP — the hard case
+
+The LSP extension spawns a **language server process** (e.g. `typescript-language-server`)
+rooted at the workspace, communicating over stdio. For full transparency, this
+process would need to run on the remote machine and Dispatch would bridge its
+stdio over SSH. ssh2 supports this (`client.exec` with a shell that runs the
+server, forwarding its stdio) — but it is significantly more complex than file
+ops (long-lived process, framing, file-watching over SFTP).
+
+**MVP decision: degrade gracefully.** When `effectiveComputer !== null` (remote):
+
+- The `lsp` tool's per-edit diagnostics are **skipped** (the `edit_file` tool
+  already degrades to no-diagnostics when LSP is unavailable — the existing
+  try/catch path).
+- The LSP status endpoint reports "disabled on remote computers" for that
+  conversation.
+
+**Future phase:** a `RemoteLspManager` that spawns the language server over SSH
+and bridges stdio + uses SFTP for `didOpen`/file-watching. This is a large,
+separate unit of work and is **out of scope** for the initial SSH feature. The
+plan records it as a known limitation; the `lsp` extension owner gets a
+change-request when remote LSP is prioritized.
+
+This is enforced cleanly via the **tools filter**: the session-orchestrator's
+`toolsFilter` (owned by session-orchestrator) drops the `lsp` tool from the
+turn's tool set when `effectiveComputer !== null`. The model simply doesn't see
+the `lsp` tool on remote turns — consistent with how MCP drops disconnected
+servers' tools today.
+
+### 6.2 MCP
+
+MCP servers are configured per-cwd (`.dispatch/mcp.json`). They spawn local
+processes. For a remote conversation, the MCP servers should be **discovered on
+the remote machine** (read the remote `.dispatch/mcp.json` via SFTP) and spawned
+remotely. This is also complex (long-lived remote processes).
+
+**MVP decision:** MCP tools are **also dropped** via the tools filter when
+remote (same mechanism as LSP). A future phase adds remote MCP server spawn
+over SSH. Recorded as a known limitation.
+
+### 6.3 Tools unaffected by SSH
+
+`web_search`, `youtube_transcript` — these hit the network from the Dispatch host
+(not the remote machine), so they are **unaffected** and remain available on
+remote turns. `todo` is in-memory. These need no changes.
+
+---
+
+## 7. Security considerations
+
+1. **Secrets never in the `Computer` row.** Only `secretRef` (a store key) is
+   persisted; the key/password is fetched from `SecretsAccess` at connect time
+   and held only in the `ssh` extension's process memory (on the pooled
+   connection). This mirrors how `credential-store` holds provider keys.
+2. **Secrets transit gopass → env → container** per the secrets-management
+   skill. The `Computer.secretRef` maps to a gopass entry; `host-bin` (or the
+   `ssh` extension) resolves it. The secret is never logged, never returned by
+   any API.
+3. **Host key verification.** ssh2 supports `hostVerifier` / `hostVerifier
+   callback`. The MVP **should** implement host-key pinning: on first connect,
+   the host key fingerprint is recorded (stored on the `Computer` or a separate
+   `known_hosts`-style field); subsequent connects verify it matches. A mismatch
+   → refuse connection + surface "HOST KEY CHANGED" to the user (never silently
+   connect). This prevents MITM. (A config flag `strictHostKey: true` (default)
+   vs `acceptNew` for first-seen is the OpenSSH analog.)
+4. **No agent-forwarding** by default (avoids credential leakage to the remote).
+5. **No PTY by default** for `exec` (`pty: false`) — commands run non-interactively,
+   output captured as today. PTY would risk leaking control chars / interactive
+   prompts hanging.
+6. **Command injection** — the shell tool already passes the model's `command`
+  to `sh -c` locally; SSH does not change this threat model (the agent is already
+  trusted to run arbitrary commands). The `cd "$cwd" && ` prefix must
+  **shell-quote** the cwd to avoid a cwd containing shell metachars breaking
+  out — use a proper quoting helper, not string concat.
+7. **Port exposure** — SSH is outbound from the Dispatch host; no inbound ports
+   opened. No change to the existing TLS/cert posture.
+8. **Auth method policy** — `agent` and `key` are preferred; `password` is
+   allowed but the UI should warn. No `password` stored in plaintext anywhere
+   (always via `secretRef` → gopass).
+9. **Auditability** — every remote `exec`/fs op should be logged via the
+   injected `Logger` (the `ssh` extension spans each operation with computerId),
+   so remote activity is traceable. Existing observability (trace-store) covers
+   this if spans are opened.
+
+---
+
+## 8. Edge cases
+
+| Case | Handling |
+|---|---|
+| **Connection drop mid-turn** | The pooled connection errors. The in-flight `spawn`/fs call rejects; the tool returns an error result (`isError: true`) with a clear message ("remote computer connection lost: …"). The model sees a normal tool error and can retry. The pool drops the dead connection; next `acquire` reconnects. The turn is NOT aborted (unlike a signal abort) — the model continues. |
+| **Remote machine offline (connect fails)** | First `acquire` rejects with a connect error → tool error result. A `GET /computers/:id/status` (or extend the existing LSP-status-style endpoint) lets the FE show "offline" before the user sends. |
+| **Timeout** | Each `spawn` carries its own `timeout` (existing tool param, default 120s). The backend enforces it over SSH (close the stream on timeout) — same `timedOut` result as local. Connect itself has a separate (shorter, e.g. 10s) connect timeout so an unreachable host fails fast. |
+| **Auth failure** | Connect rejects with auth error. Surface a specific error ("SSH authentication failed for computer X") via the tool result + the status endpoint. Never retry in a tight loop (avoid account lockout) — fail and let the user fix the secret. |
+| **cwd doesn't exist on remote** | `cd <cwd>` fails on the remote shell → the command exits non-zero with stderr "no such directory". The tool returns an error result; the model can `cd`/`ls` to recover. Same UX as a bad local cwd. |
+| **Path semantics differ (Windows remote)** | MVP assumes POSIX remotes (ssh2 + sh -c). A Windows remote would need `cmd.exe` + path translation — **out of scope**; documented as POSIX-only. |
+| **Long output** | The existing `OUTPUT_CAP` (50k chars) truncation in the shell tool applies identically — the backend streams stdout; the tool caps. No change. |
+| **Concurrent tool calls to same remote** | Default dispatch `maxConcurrent: 1` serializes, so one command at a time. With parallelism enabled, the pooled connection handles concurrent `exec` (ssh2 supports it); SFTP ops are serialized within the single SFTP session or open additional sessions. |
+| **Computer deleted while in use** | `deleteComputer` closes pooled connections + clears `defaultComputerId`/per-conv overrides (mirror `deleteWorkspace` closing conversations). In-flight calls get a dropped-connection error. |
+| **Aborted turn** | `ctx.signal` is threaded into the backend (`spawn` params already take `signal`). On abort, the backend closes the remote stream (best-effort `stream.end()`); the promise resolves `aborted`. The pooled connection stays alive for reuse. |
+| **Secret rotated/removed** | Next `acquire` after a drop fetches the current secret. If removed, connect fails with auth error. |
+
+---
+
+## 9. API surface (transport-contract + transport-http + transport-ws)
+
+All **additive**. Existing endpoints/messages unchanged.
+
+### 9.1 New computer CRUD endpoints
+
+```
+GET    /computers                          → { computers: ComputerEntry[] }
+PUT    /computers/:id                      → ensure/create (host, port, user, auth, secretRef)
+GET    /computers/:id                      → Computer
+PUT    /computers/:id/title                → rename
+DELETE /computers/:id                      → { computerId, closedCount }
+GET    /computers/:id/status               → { computerId, state: "disconnected"|"connecting"|"connected"|"error", error? }
+POST   /computers/:id/test                 → probe-connect (opens a test connection, reports ok/error)
+```
+
+### 9.2 Per-conversation + workspace-default endpoints (mirror cwd)
+
+```
+GET    /conversations/:id/computer         → { conversationId, computerId: string | null }
+PUT    /conversations/:id/computer         → { computerId: string | null }   (null = clear → inherit/local)
+DELETE /conversations/:id/computer         → clear (same as PUT null)
+
+PUT    /workspaces/:id/default-computer    → { computerId: string | null }  (mirror /workspaces/:id/default-cwd)
+```
+
+`GET /workspaces/:id` and `GET /workspaces` return the new `defaultComputerId`
+field (additive).
+
+### 9.3 Chat request gains `computerId`
+
+```ts
+// transport-contract ChatRequest  (MODIFIED — additive optional field)
+export interface ChatRequest {
+  readonly conversationId?: string;
+  readonly message: string;
+  readonly model?: string;
+  readonly cwd?: string;
+  readonly reasoningEffort?: ReasoningEffort;
+  readonly workspaceId?: string;
+  /** NEW: computer to execute this turn's tools on. Omit = inherit (workspace default → local). */
+  readonly computerId?: string;
+}
+```
+
+`POST /chat` body parsing, the WS `chat.send` router (`handleChatSend`), and
+`POST /conversations/:id/queue` (`QueueRequest`) all gain the optional
+`computerId`, threaded identically to `cwd`/`workspaceId`.
+
+### 9.4 Secret handling on the API
+
+`PUT /computers/:id` accepts `secretRef` (a string key) but **never** the secret
+value itself. The actual key material is managed out-of-band (gopass). A
+separate admin path (or env) populates the secret store. This keeps secrets out
+of the HTTP API entirely.
+
+---
+
+## 10. Frontend impact (dispatch-web / worktrees/ssh-support/frontend)
+
+The frontend is a Svelte app; cwd is managed in `src/app/store.svelte.ts` and
+`src/features/workspace/`. The changes mirror the cwd UI:
+
+1. **Computer management view** (new feature folder `src/features/computer/`):
+   list/create/edit/delete computers (host, port, user, auth method, secret ref).
+   A "Test connection" button hitting `POST /computers/:id/test`.
+2. **Per-conversation computer selector** — a `ComputerField.svelte` next to the
+   existing `CwdField.svelte` in the workspace sidebar. A dropdown of computers +
+   "Local (none)". Saves via `PUT /conversations/:id/computer`.
+3. **Workspace default computer** — in the workspace settings, a
+   `default-computer` selector (mirror the `default-cwd` control). Saves via
+   `PUT /workspaces/:id/default-computer`.
+4. **Connection status badge** — near the computer selector, showing the live
+   `state` from `GET /computers/:id/status` (connected/connecting/error/offline).
+   Poll or surface via the existing surface-registry mechanism.
+5. **Store** (`store.svelte.ts`) gains `computerId` reactive state +
+   `setComputer`/`refetchComputer` (parallel to `cwd`/`setCwd`).
+6. **`chat.send`** — the chat store's `send()` does not currently pass cwd per-
+   send (cwd is persisted, not per-message). `computerId` follows the same model:
+   persisted per-conversation, set via the sidebar, NOT per-message. So `chat.send`
+   needs no change for the MVP (computer is resolved server-side from the
+   persisted value). A per-send `computerId` override is a later option (the
+   contract supports it; the UI need not expose it initially).
+
+> **Transparency note for the FE:** the FE shows the computer to the *user* (so
+> they know where commands run), but the *agent* never sees it (not in the system
+> prompt, not in tool descriptions). The FE computer selector is a user-facing
+> control, not an agent-facing one.
+
+---
+
+## 11. New packages / units summary
+
+| New package | Tier | Owns | Depends on |
+|---|---|---|---|
+| `exec-backend` | core | `ExecBackend` contract, `LocalExecBackend`, `execBackendHandle` service, the resolver wiring | kernel |
+| `computer-store` | core | `Computer` entity CRUD, per-conv `computerId`, `getEffectiveComputer`, `computerStoreHandle` | conversation-store (reads workspaceId/defaultComputerId), wire |
+| `ssh` | standard | `SshConnectionPool`, `SshExecBackend`, ssh2 wiring, secret resolution, host-key verify | exec-backend, computer-store, kernel (SecretsAccess) |
+
+Modified units (contract changes, reported to owners — planner does NOT edit
+these directly per one-owner-per-unit):
+
+| Unit | Change |
+|---|---|
+| `kernel` (contracts) | `+ computerId` on `ToolExecuteContext` + `RunTurnInput` (additive optional) |
+| `kernel` (runtime dispatch) | thread `computerId` through `executeToolCall`/`createStepDispatcher` |
+| `wire` | `+ Computer`, `ComputerEntry`; `+ defaultComputerId` on `Workspace` |
+| `conversation-store` | `+ defaultComputerId` on `WorkspaceRow`/`Workspace`; `+ setWorkspaceDefaultComputerId` |
+| `tool-shell` | factory takes `resolveBackend`; `execute` uses `backend.spawn` |
+| `tool-read-file` | refactor to `backend.readFile/readdir/stat` |
+| `tool-write-file` | refactor to `backend.access/stat/writeFile` |
+| `tool-edit-file` | refactor to backend fs ops |
+| `session-orchestrator` | `+ computerId` on `StartTurnInput`/`TurnLifecyclePayload`; resolve `effectiveComputer`; thread into `RunTurnInput`; tools-filter drops `lsp`/`mcp` when remote |
+| `transport-contract` | `+ computerId` on `ChatRequest`/`ChatSendMessage`/`QueueRequest`; computer/workspace-computer response types |
+| `transport-http` | computer CRUD + per-conv/workspace-computer endpoints; thread `computerId` in `/chat` |
+| `transport-ws` | thread `computerId` in `handleChatSend`/`handleChatQueue` |
+| `host-bin` | wire `exec-backend` + `computer-store` + `ssh` extensions; inject `resolveBackend` into tool extensions |
+| `cache-warming` | thread `computerId` into warm tool assembly (cache-safe) |
+| frontend | computer management + selectors + status badge |
+
+---
+
+## 12. Implementation phases
+
+### Phase 0 — Contracts (no behavior change)
+- Add `computerId` to `ToolExecuteContext` + `RunTurnInput` (kernel contracts).
+- Add `Computer`/`ComputerEntry` + `Workspace.defaultComputerId` to `@dispatch/wire`.
+- Add `ExecBackend` contract + `execBackendHandle` in a new `exec-backend`
+  package; `LocalExecBackend` wraps today's node calls (behavior-identical).
+- Thread `computerId` through dispatch runtime (forwards `undefined` → no-op).
+- **Verify:** `bun run typecheck` + `bun run test` green, behavior unchanged.
+
+### Phase 1 — Refactor tools behind `ExecBackend` (still local-only)
+- `tool-shell`/`read-file`/`write-file`/`edit-file` factories take
+  `resolveBackend`; `LocalExecBackend` injected. Pure logic untouched.
+- `host-bin` wires the local resolver.
+- **Verify:** full test suite green; tools behave identically (this de-risks
+  the refactor before any SSH).
+
+### Phase 2 — Computer store + API (no SSH yet)
+- `computer-store` package: `Computer` CRUD + per-conv `computerId` +
+  `getEffectiveComputer` (returns `null` = local until a computer is set).
+- `conversation-store`: `defaultComputerId` field + setter.
+- transport-http/ws: computer endpoints + `computerId` on chat.
+- `session-orchestrator`: resolve + thread `computerId`.
+- **Verify:** can configure a computer per-conversation/workspace; with no `ssh`
+  extension loaded, a configured computer yields a clear "no SSH backend"
+  error (degraded) — local conversations unchanged.
+
+### Phase 3 — SSH execution
+- `ssh` package: `SshConnectionPool` + `SshExecBackend` (ssh2), secret
+  resolution, host-key verification, error mapping.
+- `exec-backend` resolver returns `SshExecBackend` for a `computerId`.
+- tools-filter drops `lsp`/`mcp` on remote turns.
+- **Verify:** integration test against a real (or dockerized) sshd — run_shell,
+  read_file, write_file, edit_file execute remotely; agent is unaware.
+
+### Phase 4 — Frontend
+- Computer management view, per-conv + workspace-default selectors, status badge.
+- Wire store + chat flow.
+
+### Phase 5 — Hardening
+- Connection drop/offline/timeout edge tests.
+- Host-key-pinning UX (first-connect trust prompt).
+- Idle reaping + keep-alive tuning.
+- Observability spans for remote ops.
+- Remote LSP/MCP (future — out of scope for initial feature).
+
+---
+
+## 13. Open questions / decisions for the user
+
+1. **ssh2 vs bun-ssh2** — verify `ssh2` runs under Bun early (Phase 3 kickoff).
+   If not, swap to `bun-ssh2` (identical API). (Requires adding the dependency
+   to `packages/ssh/package.json` — a project-local install, not system-wide.)
+2. **Host-key trust model** — on first connect, auto-trust-and-pin (record the
+   fingerprint), or require explicit user approval via the FE? Recommendation:
+   auto-trust-and-pin on first connect (like `StrictHostKeyChecking=accept-new`),
+   then verify on subsequent; surface changes loudly. Confirm with user.
+3. **`Computer` storage location** — confirm `computer-store` as a separate
+   extension (recommended) vs folding into `conversation-store`.
+4. **Remote LSP/MCP scope** — confirm these are out-of-scope for the initial
+   feature (degrade by dropping the tools), to be revisited later.
+5. **Per-send `computerId`** — confirm the MVP persists `computerId` per-
+   conversation (like cwd) rather than sending it on every `chat.send` (the
+   contract supports both; the UI uses persistence).
+
+---
+
+## 14. Glossary additions (proposed, for `GLOSSARY.md`)
+
+| Term | Meaning | Aliases to avoid |
+|---|---|---|
+| **computer** | A named SSH target (host+port+user+auth) a conversation can execute tools on. Stored as a `Computer` entity; referenced by `computerId`. `null`/absent = local execution (no SSH). | host (when meaning the SSH target — clashes with "host" the runtime), remote, machine |
+| **ExecBackend** | The transport-agnostic spawn+fs abstraction tools program against. Two implementations: `LocalExecBackend` (node) and `SshExecBackend` (ssh2). Resolved per-call from `ToolExecuteContext.computerId`. | backend, executor |
+| **computerId** | The identifier of the `Computer` a turn's tools execute on. Threaded like `cwd` (per-turn override → persisted per-conversation → workspace `defaultComputerId` → `null`/local). | hostId, machineId, remoteId |
+| **defaultComputerId** | A workspace's default computer, inherited by conversations with no per-conversation `computerId`. The computer analog of `defaultCwd`. | — |
author	Adam Malczewski <[email protected]>	2026-06-25 10:38:18 +0900
committer	Adam Malczewski <[email protected]>	2026-06-25 10:38:18 +0900
commit	a4c4b4a7ea6ead10ee969c5cc474768e29fc2558 (patch)
tree	edad9b824ea6ea6a120f97683301e0cb2eaff27e /notes
parent	8a74335c21a57ee63c9a0658754430d6520dbc87 (diff)
download	dispatch-a4c4b4a7ea6ead10ee969c5cc474768e29fc2558.tar.gz dispatch-a4c4b4a7ea6ead10ee969c5cc474768e29fc2558.zip