Troubleshooting

Last updated: 2026-06-27

When something isn’t working, start with microagent doctor. It checks the host backend, virtualization support, the supervisor binary, the default kernel, and console support, and tells you where the gap is. Most of the entries below are conditions doctor will flag.

Each symptom type has a tool that answers it fastest: host problems (missing KVM, binaries, permissions) are doctor’s job; boot problems and anything the guest printed are in logs. Questions about what state a workspace is in belong to status, and when you need the history of how it got there, read events.

This page is indexed by symptom - search for whatever you’re seeing.

Host setup

`microagent doctor` reports KVM unavailable on Linux

The host doesn’t have /dev/kvm, or the current process can’t see it. KVM is required for Firecracker; without it, the workspace can’t boot.

Common causes and fixes:

No virtualization support. You’re on a host without hardware virtualization (some VMs, some cloud instances). Move to a host with KVM-capable virtualization (most bare metal, most VPS providers, nested virtualization on supported hypervisors).
kvm kernel module not loaded. lsmod | grep kvm. Load with sudo modprobe kvm (plus kvm_intel or kvm_amd for your CPU vendor).
User not in kvm group. ls -l /dev/kvm should show group kvm. Add your user with sudo usermod -aG kvm $USER and log back in.
Sandboxed shell masking the device. Some agent environments and container shells hide /dev/kvm from the process. Run microagent directly on the host, outside any sandboxed wrapper.

`microagent doctor` reports KVM available, but `start` fails with permission denied

Your user can see /dev/kvm but can’t open it. Same kvm group fix as above. Verify with cat /dev/kvm - if you get “Operation not permitted” it’s a permissions problem; if you get “Invalid argument” you’re fine (kernel just rejects raw reads).

`firecracker` binary not found (Linux)

Homebrew and make install install the pinned upstream Firecracker VMM under the microagent prefix as libexec/firecracker. If doctor cannot find it, reinstall microagent or install Firecracker from its releases and put it on PATH. Alternatively:

Install under <prefix>/libexec/firecracker (Homebrew puts it there automatically).
Set MICROAGENT_FIRECRACKER=/path/to/firecracker in your environment.

`mke2fs` not found (rootfs builds fail)

The rootfs builder needs mke2fs to format ext4 disks. On Linux it’s usually installed (e2fsprogs); on macOS it isn’t shipped by default.

# macOS
brew install e2fsprogs
# Pass the path explicitly the first time:
microagent rootfs build --mke2fs /opt/homebrew/opt/e2fsprogs/sbin/mke2fs ...

Any microagent rootfs build or microagent run that uses non-default sizing accepts --mke2fs <path>.

Default kernel not installed

First microagent run installs the default kernel automatically. If you’d rather do it explicitly:

microagent kernel install

For a custom kernel or air-gapped install, see microagent kernel.

`microagent kernel verify` reports a SHA mismatch

The kernel file on disk doesn’t match its expected SHA-256. Either it’s corrupted or someone replaced it. Don’t ignore this - booting from a kernel you don’t trust is the textbook supply-chain failure.

microagent kernel install   # reinstall from the trusted source
microagent kernel verify --path ~/.microagent/kernels/<backend>/<arch>/Image \
                         --sha256 <expected>

If install doesn’t produce the expected SHA either, the source you’re pulling from is wrong. Reinstall from a trusted kernel URL and pass the expected --sha256 explicitly.

`microagent doctor --backend windows-hyperv` reports HCS access denied

Windows Hyper-V uses Windows Host Compute Service directly. The current user must be able to create and manage HCS compute systems.

Common fixes:

Run from an elevated shell.
Add the user to the local Hyper-V Administrators group, then sign out and back in so the new token has the group.
Confirm Windows features for Hyper-V and Windows Hypervisor Platform are enabled.

`microagent doctor --backend windows-hyperv` reports HCN/HNS unavailable

The Windows networking service used by Hyper-V is not reachable. Start with the Windows services for Host Network Service and Host Compute Service, then rerun:

microagent --json doctor --backend windows-hyperv

Published TCP networking, mediation, and guest-to-host listener targets use Hyper-V sockets. If HCN/HNS is unavailable, Windows Hyper-V fails closed before attaching a network endpoint.

Workspace lifecycle

`microagent delete` refuses while the VM is running

The recorded VM process is still alive. By design, delete won’t tear down a workspace whose VM hasn’t been stopped - it’d orphan the process.

microagent halt <name>     # clean disk-preserving stop
# or
microagent stop <name>     # graceful shutdown request
microagent kill <name>     # hard termination if stop doesn't return
microagent delete <name>

`microagent start` fails because the workspace is `quarantined`

Quarantined workspaces preserve disk and event history while host-side network and mediation paths are severed. They can’t be started directly - that’s the whole point of the state.

microagent halt <name>     # or stop / kill
microagent start <name>    # boots the preserved disk back up

See glossary for the full lifecycle vocabulary.

Workspace boots but the entrypoint exits immediately

Look at the serial log:

microagent logs <name>

Usual causes:

files: source missing. Check the microagent.yaml files: paths - they’re resolved relative to the spec file’s directory. A typo or wrong relative path means the file never lands in the rootfs.
Entrypoint references a path that wasn’t created. setup: runs first; mkdir -p your target directories there if they don’t exist in the base image.
Missing dependency. If setup: did a pip install and a dependency is wrong, the entrypoint sees ImportError. The serial log shows the Python traceback.
Wrong shebang or interpreter. A shell script entrypoint without #!/bin/sh and without an explicit bash/sh wrapper won’t execute.

`microagent --json status` shows `verification.divergence`

Recorded hashes don’t match what’s currently on disk. Treat kernel or injected-init divergence as suspicious until you understand it. Rootfs hashes are enforced while the workspace is still prepared; after a workspace has started, the rootfs is the writable VM disk and normal guest boot activity can change it.

microagent --json status <name> | jq '.verification.divergence'

The field and expected / actual values tell you which artifact diverged. Common case: someone reinstalled the kernel without re-creating the workspace, so the workspace still references the old kernel hash.

For repeatable deployments, prefer digest-pinned image refs such as docker.io/library/python@sha256:..., install kernels with an explicit --sha256, and check .verification.ok before start.

Networking

Firecracker `user` mode workspace won’t start

user mode needs three things:

pasta on PATH (from the passt package - apt install passt on Debian/Ubuntu, dnf install passt on Fedora; Homebrew installs it as a microagent dependency).
Unprivileged user namespaces enabled. Check sysctl user.max_user_namespaces (returns a non-zero count when enabled). Some distros also gate this via kernel.unprivileged_userns_clone - set both to 1 if either is 0. On Ubuntu 23.10+ (including stock Ubuntu 24.04 and GitHub-hosted runners), AppArmor additionally blocks unprivileged user namespace creation even when those sysctls look permissive - pasta fails with Couldn't write to /proc/self/uid_map: Operation not permitted. Set sysctl kernel.apparmor_restrict_unprivileged_userns=0 or install an AppArmor profile that grants the microagent binaries userns creation.
/dev/net/tun readable by the calling user.

microagent doctor reports each of these - start there to find the missing piece.

If your host doesn’t allow unprivileged user namespaces and you can’t change that policy, use --network isolated when the guest does not need network access.

Use microagent --json network <name> to inspect the runtime IP, subnet, gateway, DNS, and route that were assigned to the guest.

Required mediation channel fails closed

error: mediation channel required but unreachable

The workspace declared a mediation channel as required (the default) but the host listener isn’t reachable. The workspace starts, but readiness reports a mediationReady error and the channel is severed until a listener connects - no traffic can flow until then.

Fixes:

Stand up the mediation listener at the declared host:port before microagent start.
For development only, pass --mediation-optional on microagent create to allow startup without the channel. Don’t ship this - the channel is fail-closed for a reason (security).

Egress mediation

A `guarded` or `strict` workspace fails to start with a TPROXY error

egress: UDP mediation (TPROXY) unavailable for workspace research — load the TPROXY kernel modules or use --egress off

Egress mediation runs inside the workspace’s own user namespace and mediates UDP and DNS via Linux TPROXY, which needs kernel modules (nft_tproxy, nf_tproxy_ipv4, xt_socket, nf_socket_ipv4) that a rootless workspace can’t load itself. When they’re missing, a guarded or strict workspace fails closed - it refuses to start rather than run with an unmediated UDP/DNS channel.

Fixes:

Load the kernel modules once, as root: sudo modprobe nft_tproxy nf_tproxy_ipv4 xt_socket nf_socket_ipv4. Once the modules are present the workspace’s netns can install its own TPROXY rules. microagent doctor reports whether they’re in place.
Drop mediation if you genuinely don’t want it: --egress off.

This is the intended fail-closed behavior - an enforcement gap never silently widens what the agent can reach. See doctor.

An allowed host’s TLS connection fails

A destination you allowlisted (--egress-allow) still fails its TLS handshake, typically surfacing as a client-side certificate error in the guest, or an egress_mitm_handshake_error / egress_mitm_upstream_error record in microagent egress <name>.

Cause: under the default guarded mode (and for any allowed host under strict), microagent MITMs the TLS with a per-workspace CA. Some clients reject the injected CA’s leaf certificate:

Certificate pinning - the client only trusts a specific certificate or key, not the per-workspace CA.
Mutual TLS - the upstream demands a client certificate the mediator can’t present.
A client with its own root store - it ignores the CA microagent installed in the system trust store, so the mediator’s leaf is untrusted.

Fix: mark the host passthrough so it is allowed and audited but not intercepted - the original server certificate reaches the client untouched:

microagent create research --egress strict \
  --egress-allow api.openai.com \
  --egress-passthrough pinned.example.com

The trade-off: a passthrough connection is forwarded as an opaque L4 byte stream, so microagent records that the connection happened (and how much data crossed it) but cannot inspect the payload. You’re trading content visibility for compatibility. See allow vs passthrough for the full discussion and microagent egress for the audit records.

Console

`microagent connect` hangs waiting for a shell prompt

By default, connect waits for the guest shell to be ready before attaching or sending input. If the guest hasn’t reached its shell (still booting, entrypoint hasn’t returned to a shell, or never will), it waits.

Fixes:

Look at the serial log to see what the guest is doing: microagent logs <name>.
Disable the wait if you know the workspace doesn’t expose an interactive shell: microagent connect <name> --ready-timeout 0. You’ll see raw serial output without the readiness gate.
Check workspace state with microagent --json status <name> - if it’s failed, the guest never reached a usable state and connect won’t help.

`microagent connect` says “console input is unavailable”

The backend’s console capability is fine but this specific workspace doesn’t have a console input endpoint yet. Common causes:

The workspace is prepared but not running. Start it first.
The backend created the runtime files but the serial input FIFO hasn’t appeared yet (race window during start). Wait a moment and retry.

Image and rootfs

`microagent rootfs build` rejects a mutable tag

error: image reference is mutable, pass --allow-mutable to override

rootfs build defaults to refusing tag references (e.g. ubuntu:24.04) because they’re not reproducible - the same tag can resolve to different content tomorrow. Two paths:

Pin by digest (recommended for production): microagent rootfs build --image docker.io/library/ubuntu@sha256:...
Override (development only): pass --allow-mutable.

microagent create and microagent run are looser - they accept tags by default and record the resolved digest in the workspace’s verification record. See security for the trust-boundary discussion.

`microagent image pull` is slow or fails

Slow: the OCI registry is slow, or the layers are large. Look at the registry/network rather than microagent.
Disk space: pulls land under ~/.microagent/images/. Check disk space; prune old records with microagent image prune --delete.

Still stuck?

Run microagent --json doctor for the full host capability report.
Run microagent --json status <name> for the workspace state plus verification details.
Check microagent logs <name> for serial output - most guest-side issues appear there.
File an issue at github.com/geoffbelknap/microagent/issues with the doctor output and the failing command.

Troubleshooting

Host setup

microagent doctor reports KVM unavailable on Linux

microagent doctor reports KVM available, but start fails with permission denied

firecracker binary not found (Linux)

mke2fs not found (rootfs builds fail)

Default kernel not installed

microagent kernel verify reports a SHA mismatch

microagent doctor --backend windows-hyperv reports HCS access denied

microagent doctor --backend windows-hyperv reports HCN/HNS unavailable

Workspace lifecycle

microagent delete refuses while the VM is running

microagent start fails because the workspace is quarantined

Workspace boots but the entrypoint exits immediately

microagent --json status shows verification.divergence

Networking

Firecracker user mode workspace won’t start

Required mediation channel fails closed

Egress mediation

A guarded or strict workspace fails to start with a TPROXY error

An allowed host’s TLS connection fails

Console

microagent connect hangs waiting for a shell prompt

microagent connect says “console input is unavailable”

Image and rootfs

microagent rootfs build rejects a mutable tag

microagent image pull is slow or fails

Still stuck?

`microagent doctor` reports KVM unavailable on Linux

`microagent doctor` reports KVM available, but `start` fails with permission denied

`firecracker` binary not found (Linux)

`mke2fs` not found (rootfs builds fail)

`microagent kernel verify` reports a SHA mismatch

`microagent doctor --backend windows-hyperv` reports HCS access denied

`microagent doctor --backend windows-hyperv` reports HCN/HNS unavailable

`microagent delete` refuses while the VM is running

`microagent start` fails because the workspace is `quarantined`

`microagent --json status` shows `verification.divergence`

Firecracker `user` mode workspace won’t start

A `guarded` or `strict` workspace fails to start with a TPROXY error

`microagent connect` hangs waiting for a shell prompt

`microagent connect` says “console input is unavailable”

`microagent rootfs build` rejects a mutable tag

`microagent image pull` is slow or fails