All News
Complete timeline of recent updates and announcements.
09/25
Extremely happy to share that our work on reward hacking🔍 in large language models was accepted to NeurIPS 2025 as a Spotlight Paper! I am also thankful for the NeurIPS Scholar Award.
09/25
Just started my second year of PhD!
09/25
I am at Amazon NYC presenting reward hacking🔍 at the NY Reinforcement Learning Workshop.
08/25
I am at Princeton attending the Machine Learning Theory Summer School.
07/25
I am at ICML, presenting our work on reward hacking🔍 at the Models of Human Feedback for AI Alignment Workshop. Grateful to have been awarded the Hudson River Trading travel grant.
06/25
I am at University of Minnesota attending the North America School of Information Theory.
05/25
I just finished my first year of PhD at Harvard!
04/25
Our paper on discretion🔍 in AI alignment was accepted to ACM FAccT 2025!
03/25
I am at Yale, giving a talk on discretion🔍 in AI alignment. Happy to share that this work got the Best Paper Award at the New England NLP workshop! You can check my slides here.
09/24
I joined Harvard as a PhD student in Flavio Calmon's group! Happy to be supported by the Harvard Graduate Prize Fellowship.