evalkit-action

Eval-as-CI for AuraOne EvalKit. The action accepts rubric-path, responses-path, judge-config, and threshold, installs auraone-evalkit, runs score/report commands, and can fail checks below threshold.

What This Is Not

Examples contain no paid or customer data.

Example

name: evalkit
on: [pull_request]
jobs:
  eval:
    runs-on: ubuntu-latest
    steps:
      - uses: actions/checkout@v4
      - uses: auraoneai/evalkit-action@v0.1.1
        with:
          rubric-path: evals/rubric.jsonl
          responses-path: evals/model_outputs.jsonl
          threshold: "0.75"
          github-token: ${{ secrets.GITHUB_TOKEN }}

The action installs auraone-evalkit, writes report-ready score JSON, generates a Markdown report, comments on pull requests when a token and PR context are available, and fails the check when the average score is below threshold. judge-config must be a JSON object. The action validates it, writes it to a temporary file, and exposes it to EvalKit subprocesses as EVALKIT_JUDGE_CONFIG and EVALKIT_JUDGE_CONFIG_PATH.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.github/workflows		.github/workflows
dist		dist
docs		docs
examples/.github/workflows		examples/.github/workflows
src		src
CHANGELOG.md		CHANGELOG.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md
action.yml		action.yml
package.json		package.json
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

evalkit-action

What This Is Not

Example

About

Uh oh!

Releases 2

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

evalkit-action

What This Is Not

Example

About

Topics

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages