Publication

("*" indicates equal contribution)

Ongoing Works

  1. [EAD-RLVR] {Chenghao Yang*, Lin Gui*, Chenxiao Yang*}, Victor Veitch, Lizhu Zhang, Zhuokai Zhao., Let it Calm: Exploratory Annealed Decoding for Verifiable Reinforcement Learning. [Arxiv] [Code]
  2. [LLM Branching Factor] Chenghao Yang, Sida Li, Ari Holtzman., LLM Probability Concentration: How Alignment Shrinks the Generative Horizon. [Arxiv] [Code]
  3. [LLM Persona Collapse] Yunze Xiao*, Vivienne J. Zhang*, Chenghao Yang, Ningshan Ma, Weihao Xuan, Jen-tse Huang The Chameleon's Limit: Investigating Persona Collapse and Homogenization in Large Language Models. [paper] [Code]

Journal Papers

  1. {Chenghao Yang*, Xiangyang Mou*, Mo Yu*}, Bingsheng Yao, Xiaoxiao Guo, Saloni Potdar, Hui Su., Narrative Question Answering with Cutting-Edge Open-Domain QA Techniques: A Comprehensive Study TACL 2021 [paper]
  2. Nabila El-Bassel, Karli R Hochstatter, Melissa Slavin, {Chenghao Yang*, Yudong Zhang*}, Smaranda Muresan., Harnessing the Power of Social Media to Understand the Impact of COVID-19 on People Who Use Drugs During Lockdown and Social Distancing. Journal of Addiction Medicine [PubMed Paper]

Conference Papers

  1. [BACo] {Yichen Wang*, Chenghao Yang*, Tenghao Huang*}, Muhao Chen, Jonathan May, Mina Lee., Optimizing Diversity and Quality through Base–Aligned Model Collaboration, ICML 2026. [paper]
  2. [AI-Realtor] {Jibang Wu*, Chenghao Yang*}, Yi Wu, Simon Mahns, Chaoqi Wang, Hao Zhu, Fei Fang, Haifeng Xu., AI Realtor: Towards Grounded Persuasive Language Generation for Automated Copywriting, CAIS 2026. [paper] [Code and Data]
  3. Suho Shin, Chenghao Yang, Haifeng Xu, MohammadTaghi Hajiaghayi., Tokenized Bandit for LLM Decoding and Alignment, ICML 2025. [paper]
  4. {Yanhong Li*, Chenghao Yang*}, Allyson Ettinger., When Hindsight is Not 20/20: Testing Limits on Reflective Thinking in Large Language Models, NAACL 2024 Findings [paper] [code]
  5. {Chenghao Yang*, Tuhin Chakrabarty*}, Karli R Hochstatter, Melissa N Slavin, Nabila El-Bassel, Smaranda Muresan., Identifying Self-Disclosures of Use, Misuse and Addiction in Community-based Social Media Posts, NAACL 2024 Findings [paper] [code]
  6. Chaoqi Wang, Yibo Jiang, Chenghao Yang, Han Liu, Yuxin Chen., Beyond Reverse KL: Generalizing Direct Preference Optimization with Diverse Divergence Constraints, ICLR 2024 Spotlight [paper]
  7. Chenghao Yang, Allyson Ettinger., Can You Follow Me? Testing Situational Understanding in ChatGPT, EMNLP 2023 (Award Nomination) [paper][code]
  8. Chenghao Yang, Fan Yin, He He, Kai-Wei Chang, Xiaofei Ma and Bing Xiang., Efficient Shapley Values Estimation by Amortization for Text Classification, ACL 2023 [paper][code][video]
  9. {Shiqi Wang*, Zheng Li*}, Haifeng Qian, Chenghao Yang, Zijian Wang, Mingyue Shang, Varun Kumar, Samson Tan, Baishakhi Ray, Parminder Bhatia, Ramesh Nallapati, Murali Krishna Ramanathan, Dan Roth and Bing Xiang., ReCode: Robustness Evaluation of Code Generation Models, ACL 2023 (Best Paper Award Nomination) [paper][codebase] [DL4C @ ICLR 2023 Version]
  10. Chenghao Yang, Xuezhe Ma., Improving Stability of Fine-Tuning Pretrained Language Models via Component-Wise Gradient Norm Clipping, EMNLP 2022 [paper] [codebase]
  11. Chenghao Yang, Hongyuan Mei, Jason Eisner., Transformer Embeddings of Irregularly Spaced Events and Their Participants, ICLR 2022 [full paper] [codebase]
  12. Chenghao Yang, Yudong Zhang, Smaranda Muresan., Weakly-Supervised Methods for Suicide Risk Assessment: Role of Related Domains, ACL 2021 (Short) [paper] [codebase]
  13. {Chenghao Yang*, Yuan Zang*, Fanchao Qi*}, Zhiyuan Liu, Meng Zhang, Qun Liu, Maosong Sun., Word-level Textual Adversarial Attacking as Combinatorial Optimization, ACL 2020 (Long) [paper] [codebase]
  14. {Fanchao Qi*, Junjie Huang*}, Chenghao Yang, Zhiyuan Liu et al., Modeling Semantic Compositionality with Sememe Knowledge, ACL 2019 (Long & Oral) [paper] [codebase]

Workshop Papers

  1. Chaoqi Wang, Yibo Jiang, Chenghao Yang, Han Liu, Yuxin Chen., Beyond Reverse KL: Generalizing Direct Preference Optimization with Diverse Divergence Constraints, SoLAR@NeurIPS 2023, Instruction@NeurIPS 2023 [paper]
  2. Chenghao Yang*, Yuhui Zhang*, Zhengping Zhou*, Zhiyuan Liu., Enhancing Transformer with Sememe Knowledge, RepL4NLP@ACL 2020 [paper]
  3. Xiangyang Mou, Mo Yu, Bingsheng Yao, Chenghao Yang, Xiaoxiao Guo, Saloni Potdar, Hui Su., Frustratingly Hard Evidence Retrieval for QA Over Books, NUSE@ACL 2020 [paper]