Show HN: Baton – A desktop app for developing with AI agents
TL;DR Highlight
A desktop app that lets you run multiple AI coding agents (Claude Code, Gemini CLI, etc.) simultaneously in separate git worktrees and monitor them all in one place — ideal for developers who want to split work by feature and develop in parallel.
Who Should Read
Developers who want to run multiple AI coding agents like Claude Code or Codex CLI simultaneously and manage the progress of each task from a single interface. Especially suited for those who want to develop multiple features in parallel without branch conflicts.
Core Mechanics
- Baton is a desktop app for running and managing multiple AI coding agents simultaneously (supports all CLI-based agents including Claude Code, Codex CLI, OpenCode, and Gemini CLI), available as a free download for Mac, Windows, and Linux.
- Each task (workspace) is fully isolated via git worktree (a git feature that maintains multiple independent working directories within a single repository), so agents never interfere with or conflict with each other — each works on its own branch without needing to switch branches or use stash.
- The dashboard displays each agent's status with badges: a blue 'Input' badge when waiting for input, a green 'Done' badge when the task is complete, and a red 'Error' badge when an error occurs — no need to check each tab individually. Best support is provided for Claude Code.
- When starting a task, you describe what you want to build and the AI automatically generates a branch name, workspace title, and description. Enabling 'Accept Edits' mode lets the agent start working immediately without permission prompts.
- A built-in diff viewer based on Monaco editor (the code editor component used in VS Code) lets you review agent-made changes file by file before opening a PR, with the ability to roll back individual files. A 'Live follow mode' is also supported for tracking changes in real time while the agent is working.
- A built-in MCP (Model Context Protocol, the standard protocol for AI agents to call external tools) server allows agents to directly create new workspaces or launch parallel tasks during a conversation.
- Additional code review utilities are built in, including fuzzy file search and full-text content search powered by fzf and ripgrep, git blame, and per-file commit history. Frequently used shell commands or agent prompts can be saved as 'Actions' for reuse.
Evidence
- "There were criticisms that Baton's differentiators weren't clear given the large number of similar open-source agent managers emerging, with tools like Conductor, superset.sh, t3.codes, and cmux mentioned as alternatives — one commenter even noted that Claude Desktop itself has supported worktree-based parallel agents for over a month. There was also criticism that these agent managers are essentially rebuilding IDEs, with the argument that improving VS Code would be more practical since it already runs as a web app in containers, supports workspaces, and has an extension ecosystem (visualJJ, a worktree/workspace manager, was also mentioned). Practical questions arose about the cost of running multiple Claude Code agents simultaneously, with comments asking whether users were expensing it to their company — indicating that cost is a significant real-world barrier. More fundamental questions were raised about what people are actually building with agents, worktrees, and harnesses. Commenters shared that most use cases stay at the level of generating boilerplate components for frameworks like React or Laravel, or small personal apps, with one person describing using agents to remove dead code from large codebases as a time-saving task. There was also UX feedback about the site's design — one commenter said they gave up reading within 30 seconds due to a TV-noise background effect and flickering thin blue lines — and separately, someone shared a similar terminal-based tool they had built and published on GitHub (agent-storm)."
How to Apply
- "If you need to develop multiple features simultaneously with Claude Code, install Baton and create a workspace per feature — each agent works on its own independent git branch, enabling parallel development without conflicts, and you can review changes with the diff viewer and open a PR when done. If you find yourself constantly switching terminal tabs to check whether an agent has finished, use Baton's status badges and dock notifications — you'll be alerted the moment an agent reaches a completed, error, or input-waiting state, so you can check back while doing other work. If you have frequently used agent run options (e.g., flags like --dangerously-skip-permissions) or project initialization commands, save them with Custom Agent Presets and Workspace Setup so you don't have to re-enter them every time you create a new workspace."
Code Example
# Installing via AppImage on Linux
sudo apt install fuse libfuse2 # Debian/Ubuntu
sudo dnf install fuse fuse-libs # Fedora
chmod +x baton-*.AppImage
./baton-*.AppImage
# Verifying download integrity
# macOS
shasum -a 256 [file]
# Linux
sha256sum [file]
# Windows (PowerShell)
Get-FileHash [file] -Algorithm SHA256Terminology
Related Papers
Show HN: adamsreview – better multi-agent PR reviews for Claude Code
Claude Code에서 최대 7개의 병렬 서브 에이전트가 각각 다른 관점으로 PR을 리뷰하고, 자동 수정까지 해주는 오픈소스 플러그인이다. 기존 /review나 CodeRabbit보다 실제 버그를 더 많이 잡는다고 주장하지만 커뮤니티에서는 복잡도와 실효성에 대한 회의론도 나왔다.
How Fast Does Claude, Acting as a User Space IP Stack, Respond to Pings?
Claude Code에게 IP 패킷을 직접 파싱하고 ICMP echo reply를 구성하도록 시켜서 실제로 ping에 응답하게 만든 실험으로, 'Markdown이 곧 코드이고 LLM이 프로세서'라는 아이디어를 네트워크 스택 수준까지 밀어붙인 재미있는 사례다.
Show HN: Git for AI Agents
AI 코딩 에이전트(Claude Code 등)가 수행한 모든 툴 호출을 자동으로 추적하고, 어떤 프롬프트가 어느 코드 줄을 작성했는지 blame까지 가능한 버전 관리 도구다.
Principles for agent-native CLIs
AI 에이전트가 CLI 도구를 더 잘 사용할 수 있도록 설계하는 원칙들을 정리한 글로, 에이전트가 CLI를 도구로 활용하는 빈도가 높아지면서 이 설계 방식이 실용적으로 중요해지고 있다.
Agent-harness-kit scaffolding for multi-agent workflows (MCP, provider-agnostic)
여러 AI 에이전트가 서로 역할을 나눠 협업할 수 있도록 조율하는 scaffolding 도구로, Vite처럼 설정 없이 빠르게 멀티 에이전트 파이프라인을 구성할 수 있다.
Show HN: Tilde.run – Agent sandbox with a transactional, versioned filesystem
AI 에이전트가 실제 프로덕션 데이터를 건드려도 롤백할 수 있는 격리된 샌드박스 환경을 제공하는 도구로, GitHub/S3/Google Drive를 하나의 버전 관리 파일시스템으로 묶어준다.