Popular repositories Loading
-
DeepSeek-V4-Flash-Dual-DGX-Spark-1M-Context
DeepSeek-V4-Flash-Dual-DGX-Spark-1M-Context PublicDeploy DeepSeek V4 Flash (MoE reasoning model) on dual DGX Spark nodes with 1M token context, InfiniBand, and FP8 KV-cache
-
-
DGX_Spark_Qwen_3.6_27b_35b_GGUF_start_script
DGX_Spark_Qwen_3.6_27b_35b_GGUF_start_script PublicShell 14
-
Qwen3.6-35B-A3B-NVFP4-vLLM
Qwen3.6-35B-A3B-NVFP4-vLLM PublicSelf-hosted vLLM inference for Qwen3.6-35B-A3B-NVFP4
Jinja 13
-
Qwen3.6-27B-NVFP4-vLLM
Qwen3.6-27B-NVFP4-vLLM PublicProduction-ready vLLM deployment wrapper for Qwen3.6-27B (NVFP4) — self-hosted OpenAI-compatible inference
-
DeepSeek-v4-Flash-vs-Step-3.7-Flash-Tool-Call-Benchmark
DeepSeek-v4-Flash-vs-Step-3.7-Flash-Tool-Call-Benchmark PublicHead-to-head comparison of DeepSeek-V4-Flash vs Step-3.7-Flash on tool-eval-bench v2.0.6 (69 scenarios). Full results, summary, and analysis.
If the problem persists, check the GitHub status page or contact support.
