Skip to content

Add ShopPay Audit Benchmark#16

Open
Dmatut7 wants to merge 1 commit into
Vvkmnn:mainfrom
Dmatut7:add-shoppay-audit-benchmark
Open

Add ShopPay Audit Benchmark#16
Dmatut7 wants to merge 1 commit into
Vvkmnn:mainfrom
Dmatut7:add-shoppay-audit-benchmark

Conversation

@Dmatut7

@Dmatut7 Dmatut7 commented Jun 5, 2026

Copy link
Copy Markdown

Adds ShopPay Audit Benchmark to the Agent benchmark section.

It is a compact OSS benchmark for evaluating AI coding agents on spec-grounded business-logic audits, with seeded defects, baseline tests, an answer key, and a 100-point scoring rubric.

Repo: https://github.com/Dmatut7/shoppay-audit-benchmark
Landing page: https://dmatut7.github.io/shoppay-audit-benchmark/
Release: https://github.com/Dmatut7/shoppay-audit-benchmark/releases/tag/v0.1.1

Validation: README-only addition; lint skipped because dependencies were not installed in the shallow clone.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant