Skip to content

Releases: defilantech/LLMKube

v0.8.1

01 Jun 08:14
1c4ce2f

Choose a tag to compare

0.8.1 (2026-06-01)

⚠ BREAKING CHANGES

  • foreman: Agent.spec.requestTimeoutSeconds changes meaning from a per-request HTTP timeout to a loop-wide wall-clock budget, and its default moves from 600 to 3600. The former per-request bound is now the new Agent.spec.requestTurnTimeoutSeconds (default 120). Re-apply your Agent CRs after upgrade so existing Agents pick up explicit values.

Features

  • inferenceservice: typed spec.ropeScaling for RoPE/YaRN context extension (#507) (#600) (a554aee)

Bug Fixes

  • foreman: recover orphaned phase=Running tasks on agent restart (#542) (#598) (6dd2c44)
  • foreman: split per-turn timeout from loop-wide budget (#532) (#602) (41e7663)
  • foreman: warm-path reviewer scheduling on macOS (#578, #579) (#597) (a94d1ef)
  • metal-agent: prefer routable interface for host-IP auto-detect (#526) (#599) (c780795)

Documentation

  • foreman: absolute paths in overview README cross-refs (fix llmkube-web prerender) (#596) (b5f6f94)
  • foreman: move docs/foreman to docs/site/foreman + register in site nav (#594) (9fd85bb)

Miscellaneous

  • pin next release to 0.8.1 (Release-As) (#605) (a876cc6)

llmkube-0.8.1

01 Jun 08:15
1c4ce2f

Choose a tag to compare

A Helm chart for LLMKube - Kubernetes operator for GPU-accelerated LLM inference

foreman-0.8.1

01 Jun 08:15
1c4ce2f

Choose a tag to compare

Foreman is an opt-in add-on for LLMKube that schedules agentic workloads (Workload, AgenticTask) across a fleet of nodes (FleetNode). Installing LLMKube alone does not install or require Foreman. Foreman is a SIBLING chart to llmkube, not a subchart: install llmkube first (helm install llmkube defilantech/llmkube), then install foreman alongside it. They share no Helm relationship at packaging or install time; the only coupling is that the foreman-operator's RBAC reads inference.llmkube.dev CRDs that llmkube installs.

v0.8.0

28 May 08:41
ea9e0af

Choose a tag to compare

0.8.0 (2026-05-28)

Features

  • foreman/api: structured AgenticTaskFailureReason taxonomy (#565) (6e72e85)
  • foreman/loop: observation masking for context-window management (#563) (d17c3e0)
  • foreman/loop: stuck-loop detector with nudge-then-force protocol (#544) (#569) (2172ece)
  • foreman/reviewer: fetch_issue tool replaces gh issue view subshell (#581) (0253e43)
  • foreman/tools: distinguish whitelist-excluded from unknown tool calls (#564) (089e9ca)
  • foreman/v0.2: hybrid cloud reviewer Agent + sovereignty toggles (#553) (65a7cb8)
  • foreman/v0.2: WorkloadSpec.reviewerAgentRefs (plural) + third pipeline stage (#551) (831ae8c)
  • foreman: add repo-map localization for coder Agents (#560) (#566) (f6bf8c0)
  • foreman: AgenticTask branches include workload name (#573) (#574) (2986906)
  • foreman: executor fetches GitHub issue body when payload prompt is empty (#571) (#572) (2b5bd31)
  • foreman: post-M4 stability follow-ups for v5-batch readiness (#535) (a841612)
  • foreman: v0.4 reviewer agent — tool-using reviewer with sharpened prompt + structured findings (#575) (#576) (06091a9)
  • foreman: workspace-scoped bash + WORKSPACE_ROOT contract (#567) (#568) (061eb41)
  • gpu: add Intel GPU (oneAPI/SYCL) support across controller, CLI, docs, and e2e (#557) (741ef5d)
  • metal-agent: InferenceService name allowlist for multi-Mac fleets (#555) (67361f3)

Bug Fixes

  • foreman/build: include gate_job_template.yaml in Docker context (#554) (def535d)
  • foreman/executor: resolve InferenceService port from live Endpoints, not stale install-time override (#550) (4351608)
  • foreman/executor: route reviewer-role GO through modelDecidedResult (#545) (16943a5)
  • foreman/loop: force-terminate returns clean Terminal envelope (#544 follow-up) (#570) (19500c9)
  • foreman/oai: always emit content on non-assistant messages (#562) (f6dc8e1)
  • foreman/reviewer: ground-truth filesTouched + bump qwen maxTurns + tighten confabulation defenses (#584) (b3e21f0)
  • foreman/reviewer: ground-truth issueAsk + cap qwen Section D (#587) (b66006c)
  • foreman/reviewer: role-aware stuck-loop detector + non-empty reviewer user prompt (rerun-7 follow-up) (#577) (19590f6)
  • foreman/tools: cmd.WaitDelay + process-group kill so BashTool can't deadlock on grandchild-held pipes (#547) (c12f6f8)
  • foreman: cascade + Workload rollup gate on phase AND verdict, not phase alone (#548) (1b72a7c)

Documentation

  • foreman: v0.8.0 release-prep docs + README Foreman section (#591) (5e41df1)
  • mention make lint-all in AGENTS.md and CONTRIBUTING.md (#588) (39da983)

Miscellaneous

  • release Foreman debut as 0.8.0 (take 2) (#593) (a8f0368)

llmkube-0.8.0

28 May 08:42
ea9e0af

Choose a tag to compare

A Helm chart for LLMKube - Kubernetes operator for GPU-accelerated LLM inference

foreman-0.8.0

28 May 08:42
ea9e0af

Choose a tag to compare

Foreman is an opt-in add-on for LLMKube that schedules agentic workloads (Workload, AgenticTask) across a fleet of nodes (FleetNode). Installing LLMKube alone does not install or require Foreman. Foreman is a SIBLING chart to llmkube, not a subchart: install llmkube first (helm install llmkube defilantech/llmkube), then install foreman alongside it. They share no Helm relationship at packaging or install time; the only coupling is that the foreman-operator's RBAC reads inference.llmkube.dev CRDs that llmkube installs.

v0.7.12

24 May 20:18
2bf8ad6

Choose a tag to compare

0.7.12 (2026-05-24)

Features

  • foreman/m6: Workload reconciler with stub planner (explicit pipeline + issue-batch shortcut) (#533) (dbdcd46)

Bug Fixes

  • foreman/m4: gate Job honors payload.branch + clones from --git-remote-url (#529) (905a269)
  • foreman: chart wires --workspace-dir + tightens docs for native-mode required values (#534) (1c43c69)

llmkube-0.7.12

24 May 20:19
2bf8ad6

Choose a tag to compare

A Helm chart for LLMKube - Kubernetes operator for GPU-accelerated LLM inference

foreman-0.7.12

24 May 20:19
2bf8ad6

Choose a tag to compare

Foreman is an opt-in add-on for LLMKube that schedules agentic workloads (Workload, AgenticTask) across a fleet of nodes (FleetNode). Installing LLMKube alone does not install or require Foreman. Foreman is a SIBLING chart to llmkube, not a subchart: install llmkube first (helm install llmkube defilantech/llmkube), then install foreman alongside it. They share no Helm relationship at packaging or install time; the only coupling is that the foreman-operator's RBAC reads inference.llmkube.dev CRDs that llmkube installs.

v0.7.11

23 May 01:04
4c2480b

Choose a tag to compare

0.7.11 (2026-05-23)

Bug Fixes

  • foreman: drop chart-level subchart dep on llmkube (unblock v0.7.11 chart-releaser) (#519) (207ddc6)