he/him
Researcher focusing on Large Language Model Post-training, Reinforcement Learning, Multi-agent Systems, and Multimodal AI. Previously worked on Embodied AI.
Sorry, but the page you were trying to view does not exist.