Releases: defilantech/LLMKube
v0.8.1
0.8.1 (2026-06-01)
⚠ BREAKING CHANGES
- foreman: Agent.spec.requestTimeoutSeconds changes meaning from a per-request HTTP timeout to a loop-wide wall-clock budget, and its default moves from 600 to 3600. The former per-request bound is now the new Agent.spec.requestTurnTimeoutSeconds (default 120). Re-apply your Agent CRs after upgrade so existing Agents pick up explicit values.
Features
Bug Fixes
- foreman: recover orphaned phase=Running tasks on agent restart (#542) (#598) (6dd2c44)
- foreman: split per-turn timeout from loop-wide budget (#532) (#602) (41e7663)
- foreman: warm-path reviewer scheduling on macOS (#578, #579) (#597) (a94d1ef)
- metal-agent: prefer routable interface for host-IP auto-detect (#526) (#599) (c780795)
Documentation
- foreman: absolute paths in overview README cross-refs (fix llmkube-web prerender) (#596) (b5f6f94)
- foreman: move docs/foreman to docs/site/foreman + register in site nav (#594) (9fd85bb)
Miscellaneous
llmkube-0.8.1
A Helm chart for LLMKube - Kubernetes operator for GPU-accelerated LLM inference
foreman-0.8.1
Foreman is an opt-in add-on for LLMKube that schedules agentic workloads (Workload, AgenticTask) across a fleet of nodes (FleetNode). Installing LLMKube alone does not install or require Foreman. Foreman is a SIBLING chart to llmkube, not a subchart: install llmkube first (helm install llmkube defilantech/llmkube), then install foreman alongside it. They share no Helm relationship at packaging or install time; the only coupling is that the foreman-operator's RBAC reads inference.llmkube.dev CRDs that llmkube installs.
v0.8.0
0.8.0 (2026-05-28)
Features
- foreman/api: structured AgenticTaskFailureReason taxonomy (#565) (6e72e85)
- foreman/loop: observation masking for context-window management (#563) (d17c3e0)
- foreman/loop: stuck-loop detector with nudge-then-force protocol (#544) (#569) (2172ece)
- foreman/reviewer: fetch_issue tool replaces gh issue view subshell (#581) (0253e43)
- foreman/tools: distinguish whitelist-excluded from unknown tool calls (#564) (089e9ca)
- foreman/v0.2: hybrid cloud reviewer Agent + sovereignty toggles (#553) (65a7cb8)
- foreman/v0.2: WorkloadSpec.reviewerAgentRefs (plural) + third pipeline stage (#551) (831ae8c)
- foreman: add repo-map localization for coder Agents (#560) (#566) (f6bf8c0)
- foreman: AgenticTask branches include workload name (#573) (#574) (2986906)
- foreman: executor fetches GitHub issue body when payload prompt is empty (#571) (#572) (2b5bd31)
- foreman: post-M4 stability follow-ups for v5-batch readiness (#535) (a841612)
- foreman: v0.4 reviewer agent — tool-using reviewer with sharpened prompt + structured findings (#575) (#576) (06091a9)
- foreman: workspace-scoped bash + WORKSPACE_ROOT contract (#567) (#568) (061eb41)
- gpu: add Intel GPU (oneAPI/SYCL) support across controller, CLI, docs, and e2e (#557) (741ef5d)
- metal-agent: InferenceService name allowlist for multi-Mac fleets (#555) (67361f3)
Bug Fixes
- foreman/build: include gate_job_template.yaml in Docker context (#554) (def535d)
- foreman/executor: resolve InferenceService port from live Endpoints, not stale install-time override (#550) (4351608)
- foreman/executor: route reviewer-role GO through modelDecidedResult (#545) (16943a5)
- foreman/loop: force-terminate returns clean Terminal envelope (#544 follow-up) (#570) (19500c9)
- foreman/oai: always emit content on non-assistant messages (#562) (f6dc8e1)
- foreman/reviewer: ground-truth filesTouched + bump qwen maxTurns + tighten confabulation defenses (#584) (b3e21f0)
- foreman/reviewer: ground-truth issueAsk + cap qwen Section D (#587) (b66006c)
- foreman/reviewer: role-aware stuck-loop detector + non-empty reviewer user prompt (rerun-7 follow-up) (#577) (19590f6)
- foreman/tools: cmd.WaitDelay + process-group kill so BashTool can't deadlock on grandchild-held pipes (#547) (c12f6f8)
- foreman: cascade + Workload rollup gate on phase AND verdict, not phase alone (#548) (1b72a7c)
Documentation
- foreman: v0.8.0 release-prep docs + README Foreman section (#591) (5e41df1)
- mention make lint-all in AGENTS.md and CONTRIBUTING.md (#588) (39da983)
Miscellaneous
llmkube-0.8.0
A Helm chart for LLMKube - Kubernetes operator for GPU-accelerated LLM inference
foreman-0.8.0
Foreman is an opt-in add-on for LLMKube that schedules agentic workloads (Workload, AgenticTask) across a fleet of nodes (FleetNode). Installing LLMKube alone does not install or require Foreman. Foreman is a SIBLING chart to llmkube, not a subchart: install llmkube first (helm install llmkube defilantech/llmkube), then install foreman alongside it. They share no Helm relationship at packaging or install time; the only coupling is that the foreman-operator's RBAC reads inference.llmkube.dev CRDs that llmkube installs.
v0.7.12
llmkube-0.7.12
A Helm chart for LLMKube - Kubernetes operator for GPU-accelerated LLM inference
foreman-0.7.12
Foreman is an opt-in add-on for LLMKube that schedules agentic workloads (Workload, AgenticTask) across a fleet of nodes (FleetNode). Installing LLMKube alone does not install or require Foreman. Foreman is a SIBLING chart to llmkube, not a subchart: install llmkube first (helm install llmkube defilantech/llmkube), then install foreman alongside it. They share no Helm relationship at packaging or install time; the only coupling is that the foreman-operator's RBAC reads inference.llmkube.dev CRDs that llmkube installs.