he/him
Researcher focusing on Large Language Model Post-training, Reinforcement Learning, Multi-agent Systems, and Multimodal AI. Previously worked on Embodied AI.
This is a page not in the menu. You can use markdown in this page.