Skip to content

Actions: EricLBuehler/candle-vllm

Actions

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
412 workflow runs
412 workflow runs

Filter by Workflow

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Remove rust config for CI build
.github/workflows/ci.yml #844: Commit 8c34057 pushed by guoqingbao
Failure master
Improve robustness of multirank inference (#428)
.github/workflows/ci.yml #840: Commit 6f4b87f pushed by guoqingbao
Failure master
Optimize CUDA graph (#426)
.github/workflows/ci.yml #838: Commit 76c2abe pushed by guoqingbao
Failure master
Improve ChatUI (smooth and fast response)
.github/workflows/ci.yml #837: Commit 185caf0 pushed by guoqingbao
Failure master
Support quantized kvcache (#425)
.github/workflows/ci.yml #836: Commit 8cb03bf pushed by guoqingbao
Failure master
Improve graph capture (#424)
.github/workflows/ci.yml #835: Commit bb15a14 pushed by guoqingbao
Failure master
Fully support FP8 KVCache (#423)
.github/workflows/ci.yml #834: Commit ae0e148 pushed by guoqingbao
Failure master
Fix image prefix cache miss (#420)
.github/workflows/ci.yml #831: Commit 6e0bd12 pushed by guoqingbao
Failure master
Improve continuous batching (#419)
.github/workflows/ci.yml #830: Commit f09e3fd pushed by guoqingbao
Failure master
Improve prefix-cache eviction (#411)
.github/workflows/ci.yml #825: Commit f3b890b pushed by guoqingbao
Failure master
Improve tool calling across various models (#410)
.github/workflows/ci.yml #824: Commit 68c54fa pushed by guoqingbao
Failure master
Typo fix (#409)
.github/workflows/ci.yml #822: Commit 27bc6d1 pushed by guoqingbao
Failure master
Support Minimax M2.5/M2.7 models (#408)
.github/workflows/ci.yml #820: Commit 769d78b pushed by guoqingbao
Failure master