Jacqueline He

I am a second-year Ph.D. student in the Natural Language Processing group at the University of Washington. My advisors are Luke Zettlemoyer and Pang Wei Koh. I am fortunate to be supported by the NSF Graduate Research Fellowship.

In 2022, I graduated summa cum laude from Princeton University with a B.S.E. in Computer Science, and minors in Finance and Statistics & Machine Learning. I was affiliated with the Princeton Natural Language Processing group, where my primary advisor was Danqi Chen. In the interim between undergrad and grad school, I worked as a software engineer at Meta.

I grew up in San Jose, California, and was born in Tucson, Arizona.

Email / Google Scholar / Github / Twitter

Research

I am broadly interested in natural language processing, specifically how to encourage controllable behaviors in language models.

Thanks to my amazing mentors and collaborators! ☺

* denotes equal contribution

Preprints

Precise Information Control in Long-Form Text Generation
Jacqueline He, Howard Yen*, Margaret Li*, Shuyue Stella Li, Zhiyuan Zeng, Weijia Shi, Yulia Tsvetkov, Danqi Chen, Pang Wei Koh, Luke Zettlemoyer
preprint 2025
abstract / paper / code

OpenScholar: Synthesizing Scientific Literature with Retrieval-augmented LMs
Akari Asai, Jacqueline He*, Rulin Shao*, Weijia Shi, Amanpreet Singh, Joseph Chee Chang, Kyle Lo, Luca Soldaini, Sergey Feldman, Mike D'arcy, David Wadden, Matt Latzke, Minyang Tian, Pan Ji, Shengyan Liu, Hao Tong, Bohao Wu, Yanyu Xiong, Luke Zettlemoyer, Graham Neubig, Dan Weld, Doug Downey, Wen-tau Yih, Pang Wei Koh, Hannaneh Hajishirzi
preprint 2024
abstract / paper / code / blog / demo

Publications

PrefPalette: Personalized Preference Modeling with Latent Attributes
Shuyue Stella Li, Melanie Sclar, Hunter Lang, Ansong Ni, Jacqueline He, Puxin Xu, Andrew Cohen, Chan Young Park, Yulia Tsvetkov, Asli Celikyilmaz
COLM 2025
abstract / paper / code

Scaling Retrieval-Based Language Models with a Trillion-Token Datastore
Rulin Shao, Jacqueline He, Akari Asai, Weijia Shi, Tim Dettmers, Sewon Min, Luke Zettlemoyer, Pang Wei Koh
NeurIPS 2024
abstract / paper / code

Challenges in Context-Aware Neural Machine Translation
Linghao Jin*, Jacqueline He*, Jonathan May, Xuezhe Ma
EMNLP 2023
abstract / paper / code

MABEL: Attenuating Gender Bias using Textual Entailment Data
Jacqueline He, Mengzhou Xia, Christiane Fellbaum, Danqi Chen
EMNLP 2022
abstract / paper / code

Can Rationalization Improve Robustness?
Howard Chen, Jacqueline He, Karthik Narasimhan, Danqi Chen
NAACL 2022
abstract / paper / code

Page template from Jon Barron.