Skip to content
View wuli666's full-sized avatar

Block or report wuli666

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
wuli666/README.md

Hi, I'm wuli666 👋

Building in the LLM serving & agent space — inference, quantization, and a bit of agent RL.
vLLM ecosystem · now heading deeper into infra 🛠️

views followers


🧰 Stack & Focus

FP8 / INT8 quantization · efficient inference & serving · multi-agent systems · RLVR for small models


🚀 Featured Projects

Project What it is
langextract-vllm A vLLM provider plugin for LangExtract — run structured extraction on a local vLLM backend
claude-code-architecture Deep reverse-engineering of the Claude Code CLI (v2.1.88) internals from sourcemaps
mobileground-r1 A small-VLM phone-GUI grounding agent, trained with RLVR (GRPO)
vantage AI Job Decision Copilot — scan, score, advise, decide

🤝 Open-source Contributions

Where I contribute upstream:


📈 Activity

wuli666's activity graph


📫 421774554@qq.com

Pinned Loading

  1. langextract-vllm langextract-vllm Public

    Add vLLM provider plugin for LangExtract

    Python 31 5

  2. huizai huizai Public

    JavaScript

  3. news_agent_system news_agent_system Public

    Python 1

  4. vllm-omni vllm-omni Public

    Forked from vllm-project/vllm-omni

    A framework for efficient model inference with omni-modality models

    Python

  5. claude-code-architecture claude-code-architecture Public

    Claude Code (v2.1.88) 源码架构深度分析 | Deep architecture analysis of Anthropic Claude Code CLI based on sourcemap reverse engineering

    2 2