Skip to content
View BrutalCaeser's full-sized avatar

Highlights

  • Pro

Block or report BrutalCaeser

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
BrutalCaeser/README.md


πŸ‘‹ Hey, I'm Yash

I'm an MS AI student at Northeastern University (Silicon Valley), trying to understand how generative models learn the structure of language, video, and the physical world. I started out in mechanical engineering, so I care less about chasing leaderboards than about the math underneath. Calculus and Bayesian probability are the parts that feel right to me, and they happen to be the spine of diffusion and flow-based modeling.

Most of what I build doesn't work the first time. I've had research bets collapse (entropy-weighted masking, SIGReg representation lines), and the rigor of chasing them down is the real asset. So I write up the negative results too, and I'm still hunting for the problem that feels inevitable and mine.

  • πŸ”¬ Working on diffusion / flow-matching language models, video world models, and representation collapse in self-supervised learning.
  • πŸ› οΈ Currently an AI/GenAI Engineering Co-op at NovasIQ, building deterministic agentic systems on Claude + MCP.
  • πŸŽ“ Reinforcement Learning teaching assistant at Khoury College (DQN, PPO, policy gradients).
  • 🌱 Long game: a startup in robotics / physical intelligence, or an AI layer over the legacy software that bottlenecks real industries.
  • πŸ”— Full portfolio, projects & writing β†’ brutalcaeser.github.io

🧭 What I'm working on now

🎯 reinforcing_dLLMs Reinforcement learning (diffu-GRPO) for reasoning in diffusion LLMs. I validated the one-step log-prob estimator the method hinges on, then showed RL lifts held-out Countdown ~4 pp on a single GPU (a faithful run is ~24 GPU-days).
🧩 block-diffusion-pareto Mapped the full quality↔throughput frontier for block-diffusion LMs and found generation throughput peaks at block size 32 β€” the unpublished value a leading commercial model reportedly runs.
πŸŒ€ phantom-gradients When a model's useful features live in far fewer dimensions than its embedding, training fights noise in the empty ones. A coherence-guided sampler recovers the structure without knowing the true dimension, beating even an oracle that does.
🌊 Flow-Language-Model Reproducing and extending Flow-Map language models (one-step text generation), where I found a quality curve the original paper missed.
πŸ€– physical_ai A 16-week sim-to-real track: Isaac Lab β†’ GR00T β†’ SO-ARM101.

✍️ Recent writing


πŸ“Œ A few things I've built

Diffusion_Robot_Control_Policy
β€”
microDLM ⭐ 1
From-scratch discrete diffusion language model on Tiny Shakespeare β€” 5 changes from GPT
spatial-jepa-sigreg
GAP 1: Distributional Regularization Meets Spatial Structure β€” SIGReg Γ— Patch-Level JEPA Representations (N…
matrix-game-hpc
Matrix-Game-2.0 deployment on Northeastern Explorer HPC β€” docs, logs, and scripts
storyverse
Turn any children's storybook into an animated film β€” FLUX illustrations, Wan 2.2 animation, Edge TTS narra…
minigenie
Flow matching video world model for Procgen games β€” built from scratch in PyTorch

🧰 Tools I reach for

tech stack

+ JAX/Flax Β· CUDA Β· Hugging Face Β· Claude + MCP Β· ONNX


Most-used languages

github contribution snake

Away from the screen: chess, tennis, and a slow walk through a philosophy reading list. Right now it's To Kill a Mockingbird, with Russell's The Problems of Philosophy on deck. Borrowed beliefs bore me.

πŸ“Š The "things I've built" list and the snake both refresh themselves on a schedule, so this page is never quite stale.

Pinned Loading

  1. microDLM microDLM Public

    From-scratch discrete diffusion language model on Tiny Shakespeare β€” 5 changes from GPT

    Python 1

  2. read_my_lips read_my_lips Public

    Forked from amanvirparhar/chaplin

    Read my Lips is an end-to-end assistive communication tool designed to empower mute individuals or those who prefer silent communication. It reads your lips in real-time, understands what you are s…

    Python

  3. phantom-gradients phantom-gradients Public

    Synthetic testbed for distributional SSL regularizers when intrinsic dim K << embedding dim D (WorldModel/RDMReg research line)

    Python

  4. reinforcing_dLLMs reinforcing_dLLMs Public

    Independent, constrained-compute reproduction of d1's diffu-GRPO (RL for diffusion LLMs) + a planned novel block-diffusion port. WIP.

    Python

  5. Flow-Language-Model Flow-Language-Model Public

    Forked from david3684/flm

    Official Codebase For paper "One-step Language Modeling via Continuous Denoising"

    Python

  6. minigenie minigenie Public

    Flow matching video world model for Procgen games β€” built from scratch in PyTorch

    Python