publications | Hao Peng

2025

From f(x) and g(x) to f(g(x)): LLMs Learn New Skills in RL by Composing Old Ones

Lifan Yuan, Weize Chen, Yuchen Zhang, Ganqu Cui, Hanbin Wang, Ziming You, Ning Ding, Zhiyuan Liu, Maosong Sun, and Hao Peng

2025
Probing the Critical Point (CritPt) of AI Reasoning: a Frontier Physics Research Benchmark

Minhui Zhu, Minyang Tian, Xiaocheng Yang, Tianci Zhou, Penghao Zhu, Eli Chertkov, Shengyan Liu, Yufeng Du, Lifan Yuan, Ziming Ji, and 55 more authors

2025

paper website
Executable Counterfactuals: Improving LLMs’ Causal Reasoning Through Code

Aniket Vashishtha, Qirun Dai, Hongyuan Mei, Amit Sharma, Chenhao Tan, and Hao Peng

2025

paper
Context Length Alone Hurts LLM Performance Despite Perfect Retrieval

Yufeng Du, Minyang Tian, Srikanth Ronanki, Subendhu Rongali, Sravan Babu Bodapati, Aram Galstyan, Azton Wells, Roy Schwartz, Eliu A Huerta, and Hao Peng

In Findings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), 2025

paper
spotlight

The Best Instruction-Tuning Data are Those That Fit

Dylan Zhang, Qirun Dai, and Hao Peng

In Proceedings of the Advances in Neural Information Processing Systems (NeurIPS), 2025

paper
The Unreasonable Effectiveness of Entropy Minimization in LLM Reasoning

Shivam Agarwal, Zimin Zhang, Lifan Yuan, Jiawei Han, and Hao Peng

In Proceedings of the Advances in Neural Information Processing Systems (NeurIPS), 2025

paper
Reinforcement Learning Finetunes Small Subnetworks in Large Language Models

Sagnik Mukherjee, Lifan Yuan, Dilek Hakkani-Tur, and Hao Peng

In Proceedings of the Advances in Neural Information Processing Systems (NeurIPS), 2025

paper
mCLM: A Modular Chemical Language Model that Generates Functional and Makeable Molecules

Carl Edwards, Chi Han, Gawon Lee, Thao Nguyen, Sara Szymkuć, Chetan Kumar Prasad, Bowen Jin, Jiawei Han, Ying Diao, Ge Liu, and 4 more authors

2025
Process Reinforcement through Implicit Rewards

Ganqu Cui, Lifan Yuan, Zefan Wang, Hanbin Wang, Wendi Li, Bingxiang He, Yuchen Fan, Tianyu Yu, Qixin Xu, Weize Chen, and 13 more authors

2025

paper
Improving Influence-based Instruction Tuning Data Selection for Balanced Learning of Diverse Capabilities

Qirun Dai, Dylan Zhang, Jiaqi W. Ma, and Hao Peng

In Findings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), 2025

paper
LLMs are Vulnerable to Malicious Prompts Disguised as Scientific Language

Yubin Ge, Neeraja Kirtane, Hao Peng, and Dilek Hakkani-Tür

2025

paper
FactCheckmate: Preemptively Detecting and Mitigating Hallucinations in LMs

Deema Alnuhait, Neeraja Kirtane, Muhammad Khalifa, and Hao Peng

In Findings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), 2025

paper
A Little Goes a Long Way: Efficient Long Context Training and Inference with Partial Contexts

Suyu Ge, Xihui Lin, Yunan Zhang, Jiawei Han, and Hao Peng

In Proceedings of the International Conference on Learning Representations (ICLR), 2025

paper
oral

Retrieval Head Mechanistically Explains Long-Context Factuality

Wenhao Wu, Yizhong Wang, Guangxuan Xiao, Hao Peng, and Yao Fu

In Proceedings of the International Conference on Learning Representations (ICLR), 2025

paper
Advancing LLM Reasoning Generalists with Preference Trees

Lifan Yuan, Ganqu Cui, Hanbin Wang, Ning Ding, Xingyao Wang, Jia Deng, Boji Shan, Huimin Chen, Ruobing Xie, Yankai Lin, and 5 more authors

In Proceedings of the International Conference on Learning Representations (ICLR), 2025

paper code
OpenHands: An Open Platform for AI Software Developers as Generalist Agents

Xingyao Wang, Boxuan Li, Yufan Song, Frank F. Xu, Xiangru Tang, Mingchen Zhuge, Jiayi Pan, Yueqi Song, Bowen Li, Jaskirat Singh, and 14 more authors

In Proceedings of the International Conference on Learning Representations (ICLR), 2025

paper
Eliminating Position Bias of Language Models: A Mechanistic Approach

Ziqi Wang, Hanlin Zhang, Xiner Li, Kuan-Hao Huang, Chi Han, Shuiwang Ji, Sham M. Kakade, Hao Peng, and Heng Ji

In Proceedings of the International Conference on Learning Representations (ICLR), 2025

paper code

2024

Free Process Rewards without Process Labels

Lifan Yuan, Wendi Li, Huayu Chen, Ganqu Cui, Ning Ding, Kaiyan Zhang, Bowen Zhou, Zhiyuan Liu, and Hao Peng

In Proceedings of the International Conference on Machine Learning (ICML), 2024

paper
S2-Attention: Hardware-Aware Context Sharding Among Attention Heads

Xihui Lin, Yunan Zhang, Suyu Ge, Liliang Ren, Barun Patra, Vishrav Chaudhary, Hao Peng, and Xia Song

2024

paper
SciCode: A Research Coding Benchmark Curated by Scientists

Minyang Tian, Luyu Gao, Dylan Zhang, Xinan Chen, Cunwei Fan, Xuefei Guo, Roland Haas, Pan Ji, Kittithat Krongchon, Yao Li, and 19 more authors

In The Thirty-eight Conference on Neural Information Processing Systems Datasets and Benchmarks Track, 2024

paper website code
A Single Transformer for Scalable Vision-Language Modeling

Yangyi Chen, Xingyao Wang, Hao Peng, and Heng Ji

Transactions on Machine Learning Research (TMLR), 2024

paper code
PLUM: Preference Learning Plus Test Cases Yields Better Code Language Models

Dylan Zhang, Shizhe Diao, Xueyan Zou, and Hao Peng

arXiv preprint, 2024

paper
Source-Aware Training Enables Knowledge Attribution in Language Models

Muhammad Khalifa, David Wadden, Emma Strubell, Honglak Lee, Lu Wang, Iz Beltagy, and Hao Peng

In Proceedings of the Conference on Language Modeling (COLM), 2024

paper
Language Models Hallucinate, but May Excel at Fact Verification

Jian Guan, Jesse Dodge, David Wadden, Minlie Huang, and Hao Peng

In Proceedings of the North American Chapter of the Association for Computational Linguistics (NAACL), 2024

paper code
best paper
honorable mention

LM-Infinite: Zero-Shot Extreme Length Generalization for Large Language Models

Chi Han, Qifan Wang, Hao Peng, Wenhan Xiong, Yu Chen, Heng Ji, and Sinong Wang

In Proceedings of the North American Chapter of the Association for Computational Linguistics (NAACL), 2024

paper code
Executable Code Actions Elicit Better LLM Agents

Xingyao Wang, Yangyi Chen, Lifan Yuan, Yizhe Zhang, Yunzhu Li, Hao Peng, and Heng Ji

In Proceedings of the International Conference on Machine Learning (ICML), 2024

paper website code
Data Engineering for Scaling Language Models to 128K Context

Yao Fu, Rameswar Panda, Xinyao Niu, Xiang Yue, Hannaneh Hajishirzi, Yoon Kim, and Hao Peng

2024

paper code
Examining LLMs’ Uncertainty Expression Towards Questions Outside Parametric Knowledge

Genglin Liu, Xingyao Wang, Lifan Yuan, Yangyi Chen, and Hao Peng

arXiv preprint, 2024
spotlight

TRAM: Bridging Trust Regions and Sharpness Aware Minimization

Tom Sherborne, Naomi Saphra, Pradeep Dasigi, and Hao Peng

In Proceedings of the International Conference on Learning Representations (ICLR), 2024

paper code
MINT: Evaluating LLMs in Multi-turn Interaction with Tools and Language Feedback

Xingyao Wang, Zihan Wang, Jiateng Liu, Yangyi Chen, Lifan Yuan, Hao Peng, and Heng Ji

In Proceedings of the International Conference on Learning Representations (ICLR), 2024

paper website code data
CRAFT: Customizing LLMs by Creating and Retrieving from Specialized Toolsets

Lifan Yuan, Yangyi Chen, Xingyao Wang, Yi R. Fung, Hao Peng, and Heng Ji

In Proceedings of the International Conference on Learning Representations (ICLR), 2024

paper code
LeTI: Learning to Generate from Textual Interactions

Xingyao Wang, Hao Peng, Reyhaneh Jabbarvand, and Heng Ji

In Proceedings of the North American Chapter of the Association for Computational Linguistics (NAACL), 2024

paper code

2023

FiLM: Fill-in Language Models for Any-Order Generation

Tianxiao Shen, Hao Peng, Ruoqi Shen, Yao Fu, Zaid Harchaoui, and Yejin Choi

arXiv preprint, 2023

paper code
Improving Language Model Negotiation with Self-Play and In-Context Learning from AI Feedback

Yao Fu, Hao Peng, Tushar Khot, and Mirella Lapata

arXiv preprint, 2023

paper code
Efficiency Pentathlon: A Standardized Arena for Efficiency Evaluation

Hao Peng, Qingqing Cao, Jesse Dodge, Matthew E. Peters, Jared Fernandez, Tom Sherborne, Kyle Lo, Sam Skjonsberg, Emma Strubell, Darrell Plessas, and 4 more authors

arXiv preprint, 2023

paper code
Chain-of-Thought Hub: A Continuous Effort to Measure Large Language Models’ Reasoning Performance

Yao Fu, Litu Ou, Mingyu Chen, Yuhao Wan, Hao Peng, and Tushar Khot

arXiv preprint, 2023

paper website
oral

Specializing Smaller Language Models towards Multi-Step Reasoning

Yao Fu, Hao Peng, Litu Ou, Ashish Sabharwal, and Tushar Khot

In Proceedings of the International Conference on Machine Learning (ICML), 2023

paper code data
Complexity-Based Prompting for Multi-step Reasoning

Yao Fu, Hao Peng, Ashish Sabharwal, Peter Clark, and Tushar Khot

In Proceedings of the International Conference on Learning Representations (ICLR), 2023

paper code
Transparency Helps Reveal When Language Models Learn Meaning

Zhaofeng Wu, William Merrill, Hao Peng, Iz Beltagy, and Noah A. Smith

Transactions of the Association for Computational Linguistics (TACL), 2023

paper code poster slides

2022

How Much Does Attention Actually Attend? Questioning the Importance of Attention in Pretrained Transformers

Michael Hassid, Hao Peng, Daniel Rotem, Jungo Kasai, Ivan Montero, Noah A. Smith, and Roy Schwartz

In Findings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022

paper
Modeling Context With Linear Attention for Scalable Document-Level Translation

Zhaofeng Wu, Hao Peng, Nikolaos Pappas, and Noah A. Smith

In Findings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022

paper
Twist Decoding: Diverse Generators Guide Each Other

Jungo Kasai, Keisuke Sakaguchi, Ronan Le Bras, Hao Peng, Ximing Lu, Dragomir Radev, Yejin Choi, and Noah A. Smith

In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022

paper
ABC: Attention with Bounded-memory Control

Hao Peng, Jungo Kasai, Nikolaos Pappas, Dani Yogatama, Zhaofeng Wu, Lingpeng Kong, Roy Schwartz, and Noah A. Smith

In Proceedings of the Annual Meeting of the Association for Computational Linguistics (ACL), 2022

paper
Tailor: Generating and Perturbing Text with Semantic Controls

Alexis Ross, Tongshuang Wu, Hao Peng, Matthew Peters, and Matt Gardner

In Proceedings of the Annual Meeting of the Association for Computational Linguistics (ACL), 2022

paper

2021

Finetuning Pretrained Transformers into RNNs

Jungo Kasai, Hao Peng, Yizhe Zhang, Dani Yogatama, Gabriel Ilharco, Nikolaos Pappas, Yi Mao, Weizhu Chen, and Noah A. Smith

In In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), 2021

paper
spotlight

Random Feature Attention

Hao Peng, Nikolaos Pappas, Dani Yogatama, Roy Schwartz, Noah Smith, and Lingpeng Kong

In Proceedings of the International Conference on Learning Representations (ICLR), 2021

paper
Deep Encoder, Shallow Decoder: Reevaluating Non-autoregressive Machine Translation

Jungo Kasai, Nikolaos Pappas, Hao Peng, James Cross, and Noah Smith

In Proceedings of the International Conference on Learning Representations (ICLR), 2021

paper
Contextualized Perturbation for Textual Adversarial Attack

Dianqi Li, Yizhe Zhang, Hao Peng, Liqun Chen, Chris Brockett, Ming-Ting Sun, and Bill Dolan

In Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), 2021

paper
Infusing Finetuning with Semantic Dependencies

Zhaofeng Wu, Hao Peng, and Noah A. Smith

Transactions of the Association for Computational Linguistics (TACL), 2021

paper

2020

A Mixture of h - 1 Heads is Better than h Heads

Hao Peng, Roy Schwartz, Dianqi Li, and Noah A. Smith

In Proceedings of the Annual Meeting of the Association for Computational Linguistics (ACL), 2020

paper

2019

PaLM: A Hybrid Parser and Language Model

Hao Peng, Roy Schwartz, and Noah A. Smith

In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), 2019

paper
RNN Architecture Learning with Sparse Regularization

Jesse Dodge, Roy Schwartz, Hao Peng, and Noah A. Smith

In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), 2019

paper
Text Generation with Exemplar-based Adaptive Decoding

Hao Peng, Ankur Parikh, Manaal Faruqui, Bhuwan Dhingra, and Dipanjan Das

In Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), 2019

paper

2018

Rational Recurrences

Hao Peng, Roy Schwartz, Sam Thomson, and Noah A. Smith

In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), 2018

paper
best paper
honorable mention

Backpropagating through Structured Argmax using a SPIGOT

Hao Peng, Sam Thomson, and Noah A. Smith

In Proceedings of the Annual Meeting of the Association for Computational Linguistics (ACL), 2018

paper
Learning Joint Semantic Parsers from Disjoint Data

Hao Peng, Sam Thomson, Swabha Swayamdipta, and Noah A. Smith

In Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), 2018

paper
"You Are No Jack Kennedy": On Media Selection of Highlights from Presidential Debates

Chenhao Tan, Hao Peng, and Noah A. Smith

In Proceedings of The Web Conference (WWW), 2018

paper

2017

Deep Multitask Learning for Semantic Dependency Parsing

Hao Peng, Sam Thomson, and Noah A. Smith

In Proceedings of the Annual Meeting of the Association for Computational Linguistics (ACL), 2017

paper

2016

A Convolutional Attention Network for Extreme Summarization of Source Code

Miltiadis Allamanis, Hao Peng, and Charles Sutton

In Proceedings of the International Conference on Machine Learning (ICML), 2016

paper

2015

Discriminative Neural Sentence Modeling by Tree-Based Convolution

Lili Mou, Hao Peng, Ge Li, Yan Xu, Lu Zhang, and Zhi Jin

In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), 2015

paper
Classifying Relations via Long Short Term Memory Networks along Shortest Dependency Paths

Yan Xu, Lili Mou, Ge Li, Yunchuan Chen, Hao Peng, and Zhi Jin

In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), 2015

paper