publications
2024
- PLUM: Preference Learning Plus Test Cases Yields Better Code Language ModelsarXiv preprint, 2024
- Source-Aware Training Enables Knowledge Attribution in Language ModelsIn Proceedings of the Conference on Language Modeling (COLM), 2024
- Examining LLMs’ Uncertainty Expression Towards Questions Outside Parametric KnowledgearXiv preprint, 2024
2023
2022
- How Much Does Attention Actually Attend? Questioning the Importance of Attention in Pretrained TransformersIn Findings of the Association for Computational Linguistics: EMNLP 2022, 2022
- Modeling Context With Linear Attention for Scalable Document-Level TranslationIn Findings of the Association for Computational Linguistics: EMNLP 2022, 2022
- Twist Decoding: Diverse Generators Guide Each OtherIn Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
- ABC: Attention with Bounded-memory ControlIn Proceedings of the Annual Meeting of the Association for Computational Linguistics (ACL), 2022
- Tailor: Generating and Perturbing Text with Semantic ControlsIn Proceedings of the Annual Meeting of the Association for Computational Linguistics (ACL), 2022
2021
- Finetuning Pretrained Transformers into RNNsIn In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), 2021
- spotlightRandom Feature AttentionIn Proceedings of the International Conference on Learning Representations (ICLR), 2021
- Deep Encoder, Shallow Decoder: Reevaluating Non-autoregressive Machine TranslationIn Proceedings of the International Conference on Learning Representations (ICLR), 2021
- Contextualized Perturbation for Textual Adversarial AttackIn Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), 2021
- Infusing Finetuning with Semantic DependenciesTransactions of the Association for Computational Linguistics (TACL), 2021
2020
- A Mixture of h - 1 Heads is Better than h HeadsIn Proceedings of the Annual Meeting of the Association for Computational Linguistics (ACL), 2020
2019
- PaLM: A Hybrid Parser and Language ModelIn Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), 2019
- RNN Architecture Learning with Sparse RegularizationIn Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), 2019
- Text Generation with Exemplar-based Adaptive DecodingIn Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), 2019
2018
- Rational RecurrencesIn Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), 2018
- best paper
honorable mentionBackpropagating through Structured Argmax using a SPIGOTIn Proceedings of the Annual Meeting of the Association for Computational Linguistics (ACL), 2018 - Learning Joint Semantic Parsers from Disjoint DataIn Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), 2018
- "You Are No Jack Kennedy": On Media Selection of Highlights from Presidential DebatesIn Proceedings of The Web Conference (WWW), 2018
2017
- Deep Multitask Learning for Semantic Dependency ParsingIn Proceedings of the Annual Meeting of the Association for Computational Linguistics (ACL), 2017
2016
- A Convolutional Attention Network for Extreme Summarization of Source CodeIn Proceedings of the International Conference on Machine Learning (ICML), 2016
2015
- Discriminative Neural Sentence Modeling by Tree-Based ConvolutionIn Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), 2015
- Classifying Relations via Long Short Term Memory Networks along Shortest Dependency PathsIn Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), 2015