Robotic Reinforcement Learning Β· AI Agents Β· LLM Infrastructure Β· Sim-to-Real
PhD Student β Open to Research / Robotics / ML Internships.
PhD student working at the intersection of Robot Learning and Large-scale Systems β I train RL policies in simulation and push them toward real hardware, and I care about the infrastructure that makes both fast and correct.
I came to AI from an unusual direction: a B.Arch / architecture background that taught me to reason about complex systems, geometry, spatial relationships, and the tradeoffs between elegant ideas and real-world constraints. Over time, I became convinced that many of the most important problems in design and the physical world will be solved not by better static tools, but by intelligent systems that can learn, adapt, and act. That belief drew me to robot learning, simulation, and AI infrastructure. I am especially energized by unfamiliar, technically demanding problems and by the process of turning ideas across disciplines into scalable systems that work in practice.
-
π€ Robotics RL in Simulation β building and training policies in NVIDIA Isaac Sim / Isaac Lab (locomotion / manipulation / sim-to-real) with GPU-parallel environments and PPO/SAC-style training.
-
βοΈ LLM infrastructure β contributor to vLLM, the core LLM inference engine: 2 performance PRs merged.
-
π¦Ύ ROS 2 β bridging learned policies to real robot stacks (nodes, controllers, perception β action pipelines).
-
π¬ Research interests: Reinforcement Learning, AI Agent, Robot Manipulation/Locomotion, GPU-accelerated simulation and sim-to-real transfer.
π« lynnhe02@gmail.com Β· π Texas
- PR #46542 β
[Perf][LoRA](merged): Replaced a per-tokenlist.index()lookup inconvert_mappingβ an O(num_tokens Γ num_loras) hot path the code had flagged with a TODO β by building a reverse{lora_id: index}dict once for O(1) lookups, cutting mapping construction to O(num_tokens). 2.5Γβ6.5Γ faster in microbenchmarks (e.g. 64 LoRAs / 1024 tokens: 275Β΅s β 42Β΅s), with identical output verified against randomized + existing LoRA tests. - PR #46543 β
[Perf][Multimodal](merged): Removed a wasteful O(num_frames) timestamp-list allocation in GLM-4V / GLM video frame sampling, computing each timestamp inline asframe_index * duration_per_frame. Byte-for-byte identical behavior with lower memory on long videos.
Robotics & Simulation
AI / ML
Languages & Systems
| Project | Description | Stack |
|---|---|---|
| Franka Arm RL (Isaac Lab) | Reinforcement-learning training for a Franka Emika Panda manipulator in NVIDIA Isaac Lab β GPU-parallel environments for arm control (reaching / manipulation), with PPO-style policy training in simulation. | Isaac Lab, Isaac Sim, PyTorch, RL |
| Archiagents | End-to-end AI agent for architectural design (collaborative project). Ingests project briefs + CAD/DWG/IFC/Revit files, runs requirement dialogue, generates design schemes and photorealistic renders, and outputs IFC4 BIM models with an embedded Autodesk APS viewer. My role: brought the architecture-domain expertise (B.Arch background) β shaping the design-requirement logic, the agent's reasoning over building programs, and the IFC4 / BIM modeling that turns AI output into valid design deliverables. | Vercel AI SDK, shadcn/ui, Autodesk APS, IFC4 |
| Revit-Civil-AI-Estimator | Revit 2025 add-in that uses OpenAI to automate quantity takeoff and cost estimation for civil-engineering workflows. | C#, OpenAI API, Revit |

