Oswin So

4th year PhD @ REALM, MIT AeroAstro

profile.jpeg

I am interested in developing structure-exploiting algorithms for the learning and control of dynamical systems. My research has mainly focused on improving safety in reinforcement learning using Hamilton-Jacobi reachability analysis and Control Barrier Functions, with applications to multi-agent systems and fixed wing aircraft. I have also dabbled in topics such as scientific reasoning in (d)LLMs.

I’m currently at REALM at MIT, advised by Chuchu Fan. Previously, I did my undergrad at Georgia Tech, where I was very fortunate to do undergraduate research with Evangelos Theodorou and Molei Tao.

The past summer, I interned at META FAIR working on fine-tuning discrete generative models characterized by Continuous-Time Markov Chains (e.g., diffusion-based LLMs), where I was mentored by Guan-Horng Liu and worked with Ricky T. Q. Chen. I’ve previously interned at Toyota Research Institute, where I worked on game theoretic planning. I also worked at Aurora as a Behavior Planning Intern during the summer of 2021 under Paul Vernaza and Arun Venkatraman, working on cost function learning via on-policy negative examples for autonomous driving.

See my full CV here (updated in February 2026).

Contact: oswinso [at] mit [dot] edu
Follow: Google Scholar | LinkedIn | oswinso | @oswinso

selected publications

(* Equal contribution. See Google Scholar for the full list.)
  1. In Submission
    vdppo.png
    Bellman Value Decomposition for Task Logic in Safe Optimal Control
    William Sharpless*, Oswin So*, Dylan Hirsch, Sylvia Herbert, and Chuchu Fan
    In Submission
  2. ICLR 2026
    dam.png
    Discrete Adjoint Matching
    Oswin So, Brian Karrer, Chuchu Fan, Ricky T. Q. Chen, and Guan-Horng Liu
    In The Fourteenth International Conference on Learning Representations (ICLR), 2026
  3. ICLR 2026
    fge.jpg
    Solving Parameter-Robust Avoid Problems with Unknown Feasibility using Reinforcement Learning
    Oswin So*, Eric Yang Yu*, Songyuan Zhang, Matthew Cleaveland, Mitchell Black, and Chuchu Fan
    In The Fourteenth International Conference on Learning Representations (ICLR), 2026
  4. ICLR 2026
    reform.png
    ReFORM: Reflected Flows for On-support Offline RL via Noise Manipulation
    Songyuan Zhang, Oswin So, H. M. Sabbir Ahmad, Eric Yang Yu, Matthew Cleaveland, Mitchell Black, and Chuchu Fan
    In The Fourteenth International Conference on Learning Representations (ICLR), 2026
  5. RSS 2025
    defmarl.jpg
    Solving Multi-Agent Safe Optimal Control with Distributed Epigraph Form MARL
    Songyuan Zhang*, Oswin So*, Mitchell Black, Zachary Serlin, and Chuchu Fan
    In Robotics: Science and Systems (RSS), 2025
    [Outstanding Student Paper Award, 0.6%]
  6. RSS 2025
    ns-vimpc.jpg
    Safe Beyond the Horizon: Efficient Sampling-based MPC with Neural Control Barrier Functions
    Yin Ji*, Oswin So*, Eric Yu, Chuchu Fan, and Panagiotis Tsiotras
    In Robotics: Science and Systems (RSS), 2025
  7. ICLR 2025
    dgppo.jpg
    Discrete GCBF Proximal Policy Optimization for Multi-agent Safe Optimal Control
    Songyuan Zhang*, Oswin So*, Mitchell Black, and Chuchu Fan
    In The Thirteenth International Conference on Learning Representations (ICLR), 2025
  8. T-RO
    gcbf+.jpeg
    GCBF+: A neural graph control barrier function framework for distributed safe multi-agent control
    Songyuan Zhang*, Oswin So*, Kunal Garg, and Chuchu Fan
    IEEE Transactions on Robotics (T-RO), 2024
  9. NeurIPS 2024
    rcppo.jpg
    Solving Minimum-Cost Reach Avoid using Reinforcement Learning
    Oswin So*, Cheng Ge*, and Chuchu Fan
    In Thirty-Eighth Conference on Neural Information Processing Systems (NeurIPS), 2024
  10. ICRA 2024
    pncbf.gif
    How to train your neural control barrier function: Learning safety filters for complex input-constrained systems
    Oswin So, Zachary Serlin, Makai Mann, Jake Gonzales, Kwesi Rutledge, Nicholas Roy, and Chuchu Fan
    In 2024 International Conference on Robotics and Automation (ICRA), 2024
  11. RSS 2023
    efppo.gif
    Solving Stabilize-Avoid Optimal Control via Epigraph Form and Deep Reinforcement Learning
    Oswin So and Chuchu Fan
    In Robotics: Science and Systems (RSS), 2023