building local LLM infrastructure, benchmarking models, publishing results
I build local LLM inference stacks from source on consumer hardware, benchmark models systematically, and publish datasets on HuggingFace. I also build analytics dashboards and have scaled a tech community to 20,000+ members.
Current Focus:
- local inference optimisation (llama.cpp, CUDA..)
- systematic benchmarks across dense, MoE, and hybrid architectures
- quantisation testing (GGUF Q4_K_M, IQ4_XS, turboquant turbo2/turbo3)
- context window scaling analysis and VRAM profiling
- publishing benchmark datasets on HuggingFace
- AI / ML Practitioner - local LLM inference, model evaluation, HuggingFace contributor
- Growth Lead @ Yari Finance - DeFi protocol growth, partnerships, on-chain analytics
- Founder @ BeraLand - built a 20K+ member blockchain community from zero
- 15+ Dune dashboards tracking $1B+ in trading volume
- Master's in Corporate & Market Finance - KPMG background
I write about AI infrastructure, local inference, and model evaluation on ๐
