SparkVM

SparkVM is a Firecracker microVM runner for Dockerfile rollouts usefull for agents long running task and inspired by composer-2 Async RL.

The goal of SparkVM is simple: Run thousands of agent rollouts efficiently on your own machine without needing a large Kubernetes cluster.

SparkVM scales better than Kubernetes for local, single host agent rollout execution because it avoids cluster level orchestration overhead and directly schedules Firecracker workers based on host capacity.

Quick Start

# 1) Prepare host once
sparkvm setup

# 2) Create a rollout
sparkvm rollout create --name my-agent --dockerfile Dockerfile

# 3) Run it
sparkvm workers run <rollout-id>

Python Quick Example:

from sparkvm import Rollouts, SparkVM

rollout = Rollouts().create(
    name="my-agent",
    runtime="Dockerfile",
    dockerfile="Dockerfile",
    deleteOnSuccess=False,
)

vm = SparkVM(vcpu=2, memory="2G", disk="4G", timeout=60.0, network=True, env={})
result = vm.run(rollout.id)
print(result.status, result.exit_code, result.passed)

Why SparkVM?

SparkVM allocates and manages agent rollouts efficiently by assigning available system resources to each microVM based on the host machine.

This means you can freely run agent rollouts without hesitation.

You do not need a big Kubernetes cluster for triggering thousands of rollouts anymore. SparkVM will do that for you, just deploy it on your machine. SparkVM will track, manage, and run the VMs efficiently.

How SparkVM Works ?

SparkVM runs workloads inside lightweight Firecracker microVMs, each rollout can be isolated, tracked, paused, restored, and managed based on the available resources of the host machine and designed to make large-scale agent rollouts simpler deployments.

What SparkVM Can Do ?

Container-based deployment
Run Dockerfile-based rollouts inside Firecracker microVMs
Allocate host resources efficiently across microVMs
Store snapshots
Restore a VM from where it left off
Manage long-running agent tasks
Control what your agent can access through network egress policies
Track and manage thousands of rollouts from one machine

SparkVM supports both SDK and CLI usage, you can use the SDK to integrate SparkVM into your own agent systems, rollout pipelines, or automation tools.

You can also use the CLI to trigger and manage rollouts directly from your terminal example Use Cases

Agent rollout execution
Async RL workloads
Long-running task isolation
Dockerfile-based experiments
MicroVM sandboxing
Snapshot and restore workflows
Controlled network access for agents

Setup SparkVM

Use this when you are preparing a machine for SparkVM for the first time:

sparkvm setup

What sparkvm setup does:

Creates SparkVM directories under your home (~/.sparkvm by default): bin, images, rollouts, workers, scheduler, cache.
Validates host requirements: Linux host, supported arch (x86_64 or aarch64), and required setup tools.
Installs the managed Firecracker binary into ~/.sparkvm/bin/firecracker when needed.
Creates ~/.sparkvm/bin/kvm symlink pointing to /dev/kvm.
Downloads the managed kernel image to ~/.sparkvm/images/vmlinux when needed.
Prepares SparkVM-managed CNI paths under ~/.sparkvm/cni/{bin,conf} and writes sparkvm.conflist.
Auto-installs required CNI binaries into ~/.sparkvm/cni/bin when possible:
- ptp, host-local, firewall from official CNI plugin releases
- cnitool from CNI release archive (with Go build fallback)
- tc-redirect-tap via go install when needed
Initializes the SQLite DB and default machine policy.
Migrates old rollout metadata into SQLite when legacy data exists.

If you run it again:

It is mostly safe and idempotent.
Existing managed assets are reused.
Use --force to reinstall/re-download managed assets.

Useful setup flags:

sparkvm setup --force
sparkvm setup --owner <user> (requires root, then chowns SparkVM home recursively)

To wipe everything and start fresh:

sparkvm reset

What sparkvm reset does:

Prompts for confirmation unless --force is provided.
Unmounts mounted paths under worker folders first.
Deletes everything inside SparkVM home (~/.sparkvm by default), including DB state, rollouts, workers, images, binaries, kernel, logs, and cache.
Recreates only an empty SparkVM home directory.

Canonical Python API

from sparkvm import Rollouts, SparkVM, SparkScheduler, MachineConfig

rollout = Rollouts().create(
    name="my-agent",
    runtime="Dockerfile",
    dockerfile="Dockerfile",
    deleteOnSuccess=False,
)

# Option A: run immediately (single rollout execution)
vm = SparkVM(vcpu=2, memory="2G", disk="4G", timeout=60.0, network=True, env={})
result = vm.run(rollout.id)
print(result.status, result.exit_code, result.passed)

# Option B: scheduler-managed queue execution
MachineConfig.set_policy(poll_interval=2.0)
scheduler = SparkScheduler()
summary = scheduler.tick()  # one scheduling cycle
print(summary["tick_id"], summary["spawned"])

CLI Usage (All Available Args)

# Global option (available on every command)
sparkvm [--home-dir <path>] <command> ...

# Setup / diagnostics
sparkvm setup [--force] [--owner <user>]
sparkvm doctor
sparkvm start
sparkvm cleanup {rollouts|workers|all} [--force]
sparkvm reset [--force]

# Rollouts
sparkvm rollout create \
  --name <name> \
  [--dockerfile Dockerfile] \
  [--delete-on-success] \
  [--vcpu 2] \
  [--memory 2G] \
  [--disk 4G] \
  [--timeout 60.0] \
  [--network | --no-network] \
  [--env KEY=VALUE --env KEY2=VALUE2]
sparkvm rollout list
sparkvm rollout view <rollout-id>
sparkvm rollout <rollout-id>   # alias for: sparkvm rollout view <rollout-id>

# Workers
sparkvm workers run <rollout-id> \
  [--vcpu 2] \
  [--memory 2G] \
  [--disk 4G] \
  [--timeout 60.0] \
  [--network | --no-network] \
  [--env KEY=VALUE --env KEY2=VALUE2]
sparkvm workers list
sparkvm workers view <worker-id> \
  [--tail <n>] [--live] [--result] [--failure] [--results] [--path]

Name		Name	Last commit message	Last commit date
Latest commit History 46 Commits
.tmp-sparkvm-home		.tmp-sparkvm-home
.vscode		.vscode
assets		assets
examples		examples
prompts		prompts
scripts		scripts
src		src
.gitignore		.gitignore
AGENT.md		AGENT.md
README.md		README.md
design.md		design.md
kernel.md		kernel.md
prompts.txt		prompts.txt
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SparkVM

Quick Start

Why SparkVM?

How SparkVM Works ?

What SparkVM Can Do ?

Setup SparkVM

Canonical Python API

CLI Usage (All Available Args)

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

SparkVM

Quick Start

Why SparkVM?

How SparkVM Works ?

What SparkVM Can Do ?

Setup SparkVM

Canonical Python API

CLI Usage (All Available Args)

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages