Hao Peng

3314 SC

201 North Goodwin Avenue

Urbana, IL 61801

I am an Assistant Professor at the Department of Computer Science of the University of Illinois at Urbana-Champaign (UIUC).

I received my Ph.D. from the University of Washington, with Noah Smith, and my Bachelors Degree from Peking University. I spent one year at the Allen Institute for Artificial Intelligence as a Young Investigator, and time at Microsoft Research, Google, and DeepMind.

My current research interests focus on large language models (LLMs). Specifically, I work on AI towards:

Solving complex reasoning problems in a generalizable way; I believe that learning from experience (e.g., through reinforcement learning) and insights from human cognition are crucial towards this goal;
Understanding and reasoning about the world causally;
Positively impacting society;
Advancing the frontier of human knowledge and contributing to scientific discovery, which I see as the ultimate demonstration of true generalization beyond the human knowledge they have been trained on.

Outside work, I cater to the whims of a quartet of furry overlords: Meera, Loki, Sylvie, and Kea. When they release me from their service, I cycle in the summer, and (backcountry) ski in the winter.

Undergraduate and master’s students: We love hearing from motivated undergraduate and master’s students! If you’d like to collaborate with us, the best way to get in touch is by email. When reaching out, please read one of our recent papers that excites you and ask a few questions about it in your message. Please start your subject line with [Meera+Loki+Sylvie+Kea] to indicate that you have read this message.

recent publications

From f(x) and g(x) to f(g(x)): LLMs Learn New Skills in RL by Composing Old Ones

Lifan Yuan, Weize Chen, Yuchen Zhang, Ganqu Cui, Hanbin Wang, Ziming You, Ning Ding, Zhiyuan Liu, Maosong Sun, and Hao Peng

2025
Probing the Critical Point (CritPt) of AI Reasoning: a Frontier Physics Research Benchmark

Minhui Zhu, Minyang Tian, Xiaocheng Yang, Tianci Zhou, Penghao Zhu, Eli Chertkov, Shengyan Liu, Yufeng Du, Lifan Yuan, Ziming Ji, and 55 more authors

2025

paper website
Executable Counterfactuals: Improving LLMs’ Causal Reasoning Through Code

Aniket Vashishtha, Qirun Dai, Hongyuan Mei, Amit Sharma, Chenhao Tan, and Hao Peng

2025

paper
Context Length Alone Hurts LLM Performance Despite Perfect Retrieval

Yufeng Du, Minyang Tian, Srikanth Ronanki, Subendhu Rongali, Sravan Babu Bodapati, Aram Galstyan, Azton Wells, Roy Schwartz, Eliu A Huerta, and Hao Peng

In Findings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), 2025

paper
spotlight

The Best Instruction-Tuning Data are Those That Fit

Dylan Zhang, Qirun Dai, and Hao Peng

In Proceedings of the Advances in Neural Information Processing Systems (NeurIPS), 2025

paper
The Unreasonable Effectiveness of Entropy Minimization in LLM Reasoning

Shivam Agarwal, Zimin Zhang, Lifan Yuan, Jiawei Han, and Hao Peng

In Proceedings of the Advances in Neural Information Processing Systems (NeurIPS), 2025

paper
Reinforcement Learning Finetunes Small Subnetworks in Large Language Models

Sagnik Mukherjee, Lifan Yuan, Dilek Hakkani-Tur, and Hao Peng

In Proceedings of the Advances in Neural Information Processing Systems (NeurIPS), 2025

paper