about
group
publications
teaching

Announcement_3

September 26, 2025

Contrary to many recent claims, we show that RL actually learns new skills and generalizes surprisingly well in LLMs. Check out our new preprint.

© Copyright 2025 Hao Peng. Powered by Jekyll with al-folio theme. Hosted by GitHub Pages. Last updated: November 21, 2025.