Skip to content

Metal: FP8-packed compressed-KV cache + long-context memory optimizations#416

Open
lixiangnlp wants to merge 1 commit into
antirez:mainfrom
lixiangnlp:fp8-kv-cache-memory-opt
Open

Metal: FP8-packed compressed-KV cache + long-context memory optimizations#416
lixiangnlp wants to merge 1 commit into
antirez:mainfrom
lixiangnlp:fp8-kv-cache-memory-opt

Commits

Commits on Jun 15, 2026