Skip to content

πŸ₯ Repository Health DashboardΒ #695

Description

@github-actions

πŸ₯ Daily Health Check β€” 2026-06-26

Status: πŸ”΄ 3 critical Β· 🟑 2 warnings Β· πŸ”΅ 1 info
Since yesterday: πŸ†• 3 new Β· βœ… 0 resolved Β· πŸ“Œ 3 unchanged

πŸ“Œ Maintainer action needed: please pin this issue as the canonical health dashboard and unpin/close any stale duplicate.


πŸ†• New Findings (3)

These appeared since the last health check (2026-06-25).

πŸ”΄ [P1] Evaluation build-validator job failed: GitHub.Copilot.SDK build error


πŸ”΄ [P5] Evaluation failure rate critical: ~33% across all branches in last 24h


🟑 [P2] Evaluation cancelled after 102 min on main (issue_comment trigger)

  • Fingerprint: pipeline:evaluation:evaluate:timeout
  • First seen: 2026-06-26
  • Details: An issue_comment-triggered evaluation run ("Add eval coverage for dotnet-test/filter-syntax") was cancelled on main at 2026-06-25T08:34Z after running for ~102 minutes. No job failed β€” the run was cancelled externally (likely by a concurrency group eviction when a newer run started). This is consistent with the chronic eval duration issue (runs taking 99–130 min).
  • Links: Cancelled run #4804
  • Suggested action: The 102-min runtime is consistent with the chronic duration issue (P3/resource:eval-duration:critical). Reducing eval runtime (parallelizing dotnet-test plugin evaluation) would also reduce concurrency-group cancellations.

πŸ” Investigation Results

Deep investigations are dispatched for new critical/warning findings.
The grooming workflow links results ~3 hours after this run.

Finding Severity Investigation First Seen Result
Evaluation avg duration critical (110+ min) πŸ”΄ Critical βœ… Done 2026-06-03 The evaluate (dotnet-test) job is the sole critical bottleneck at 100 minutes, driven by 175 sequential eval scenarios; the regression traces directly to PR #707 (merged June 1) which added polyglot scenarios to three skills.
Evaluation build-validator failed (GitHub.Copilot.SDK CS0234) πŸ”΄ Critical πŸ”„ Dispatched 2026-06-26 ⏳ Investigation dispatched β€” results arriving shortly...
Evaluation failure rate critical (33% in 24h) πŸ”΄ Critical πŸ”„ Dispatched 2026-06-26 ⏳ Investigation dispatched β€” results arriving shortly...

βœ… Resolved Since Yesterday (0)

No findings resolved since the last health check (2026-06-25).


πŸ“Œ Existing Findings (3)

These have been present since before today. Sorted by severity then age.

πŸ”΄ Critical β€” Evaluation average duration critical (~99–130 min avg, threshold: 55 min) Β· first seen 2026-06-03 Β· 12 occurrences
  • Fingerprint: resource:eval-duration:critical
  • First seen: 2026-06-03 Β· Occurrences: 12 (chronic β€” 3+ weeks)
  • Details: The 14-day average for substantial evaluation schedule runs remains well above the 55-min critical threshold. Today's scheduled run took 99 minutes (down from the 121–130 min recent average), but still 1.8Γ— the threshold. The previous run (yesterday's) was 85 min, suggesting some variance.
  • 7-day summary (schedule, main): Today: 99 min βœ… success; yesterday: 85 min βœ…; Jun 23–24: 2 failures. Avg ~118 min (est.)
  • Root cause (from investigation): The evaluate (dotnet-test) job is the bottleneck, driven by 175 sequential eval scenarios added by PR Make dotnet-test analysis skills and auditor agent polyglotΒ #707 (merged 2026-06-01).
  • Links: Today's run (99 min, success) Β· Investigation result Β· evaluation.yml
  • Suggested action: Parallelize eval scenarios in evaluate (dotnet-test) β€” split 175 sequential scenarios across parallel jobs, or introduce a fast-path for PR-triggered evaluations vs. scheduled full runs.
🟑 Warning β€” Orphan plugin: dotnet-experimental not listed in marketplace.json Β· first seen 2026-05-14 Β· 31 occurrences
  • Fingerprint: infra:orphan-plugin:dotnet-experimental
  • First seen: 2026-05-14 Β· Occurrences: 31 (chronic β€” 6+ weeks)
  • Details: plugins/dotnet-experimental/ exists on disk with a valid plugin.json and skills (exp-mock-usage-analysis, exp-test-maintainability, exp-simd-vectorization), but no entry in .github/plugin/marketplace.json (which lists 14 plugins). The plugin is not discoverable by consumers.
  • Links: marketplace.json Β· plugins/dotnet-experimental/
  • Suggested action: Either add { "name": "dotnet-experimental", "source": "./plugins/dotnet-experimental", "description": "..." } to marketplace.json when ready to publish, or remove the directory if not intended for publication.
πŸ”΅ Info β€” evaluation.yml uses --verdict-warn-only mode Β· first seen 2026-05-16 Β· 29 occurrences
  • Fingerprint: infra:verdict-warn-only
  • First seen: 2026-05-16 Β· Occurrences: 29 (intentional configuration)
  • Details: evaluation.yml passes --verdict-warn-only to the skill-validator, treating skill validation failures as warnings rather than hard failures. This is intentional.
  • Link: evaluation.yml

πŸ“Š Trends (7-day)

Metric Today 7d Avg Ξ” Trend
Eval duration β€” schedule/main (min) 99 ~118 -19 βœ…
Eval success rate β€” main schedule (7d) 100% (1/1) ~71% +29% βœ…
Eval success rate β€” all branches (24h) 67% (2/3) 100% -33% ⚠️
Eval scheduled cancellation rate (24h) 0% (0/1) 0% 0% ➑️
Workflow failure rate β€” main (24h) ~3% (1 failure) ~0% +3% ↗️
Compute hours/day ~3.5h ~2.0h +1.5h ↗️

⚠️ Eval pipeline degraded today: 1 evaluation build failure + 1 cancellation on main in 24h.
βœ… Scheduled eval succeeded (99 min, latest main commit). The build failure was on an older PR commit and appears resolved.
⚠️ P1 + P5 are correlated β€” the SDK build error in PR #818's skill-validator code drove both findings. The fix (PR #832, merged 22:35 UTC) appears to have resolved the incompatibility.
⚠️ Eval duration remains a chronic concern (118 min est. 7d avg). Concurrency cancellations will continue until runtime is reduced.
⚠️ Skipped I5 check (Pages deployment): GitHub Pages API not accessible via available tools.
i️ I3 (validate-skills): No validate-skills.yml workflow found β€” check not applicable.
i️ I6 (unpinned actions): All workflow action references use first-party (actions/*) or SHA-pinned third-party actions β€” no unpinned third-party actions detected.


πŸ€– Generated by DevOps Health Check agentic workflow Β· Run #28217288553 Β· 2026-06-26T04:34 UTC

Generated by DevOps Daily Health Check Β· 1.2K AIC Β· ⊞ 36.5K Β· β—·

Metadata

Metadata

Assignees

No one assigned

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions