Zhiqing Sun

Research Scientist

at OpenAI

Hey there, welcome!

I am a Research Scientist at OpenAI.

I received my Ph.D. in Computer Science from CMU LTI and a B.S. with honors in Computer Science from Peking University.

Email (work) / Email (personal) / Google Scholar / Twitter / GitHub

Projects

News

Feb. 2025: Defended my Ph.D. thesis, titled “Scalable Alignment of Large Language Models Towards Truth Seeking, Complex Reasoning, and Human Values”.
Jun. 2024: Joined OpenAI Post-Training team as a Research Scientist.
Apr. 2024: Received the OpenAI Superalignment Fast Grants ($100,000) to support our research on easy-to-hard generalization.
Jan. 2024: TAing for 11-741 Machine Learning with Graphs.
Oct. 2023: Selected as the 2023 Rising Stars in Data Science and gave a talk on scalable alginment at the Rising Stars workshop in UChicago.
Sept. 2023: Received the Microsoft Accelerate Foundation Models Research (AFMR) Initiative ($20,000).
Sept. 2023: Received the Google PhD Fellowship in Natural Language Processing.

Recent Publications

For a more complete list or preprints, see the publications page, or my google scholar page.

(*=equal contribution)

2024

ICLR

SALMON: Self-Alignment with Instructable Reward Models

Zhiqing Sun, Yikang Shen, Hongxin Zhang, Qinhong Zhou, Zhenfang Chen, David Daniel Cox, Yiming Yang, and Chuang Gan

In The Twelfth International Conference on Learning Representations, 2024

Bib HTML Code

@inproceedings{sun2023salmon,
  title = {SALMON: Self-Alignment with Instructable Reward Models},
  author = {Sun, Zhiqing and Shen, Yikang and Zhang, Hongxin and Zhou, Qinhong and Chen, Zhenfang and Cox, David Daniel and Yang, Yiming and Gan, Chuang},
  booktitle = {The Twelfth International Conference on Learning Representations},
  year = {2024},
}

NeurIPS

Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision

Zhiqing Sun*, Longhui Yu*, Yikang Shen, Weiyang Liu, Yiming Yang, Sean Welleck, and Chuang Gan

In The Thirty-eighth Annual Conference on Neural Information Processing Systems, 2024

Bib HTML Code

@inproceedings{suneasy,
  title = {Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision},
  author = {Sun*, Zhiqing and Yu*, Longhui and Shen, Yikang and Liu, Weiyang and Yang, Yiming and Welleck, Sean and Gan, Chuang},
  booktitle = {The Thirty-eighth Annual Conference on Neural Information Processing Systems},
  year = {2024},
}

ACL Findings

Aligning Large Multimodal Models with Factually Augmented RLHF

Zhiqing Sun*, Sheng Shen*, Shengcao Cao*, Haotian Liu, Chunyuan Li, Yikang Shen, Chuang Gan+, Liang-Yan Gui+, Yu-Xiong Wang+, Yiming Yang+, Kurt Keutzer+, and Trevor Darrell+

In Findings of the Association for Computational Linguistics: ACL 2024, 2024

Bib HTML Code

@inproceedings{sun2023aligning,
  title = {Aligning Large Multimodal Models with Factually Augmented RLHF},
  author = {Sun*, Zhiqing and Shen*, Sheng and Cao*, Shengcao and Liu, Haotian and Li, Chunyuan and Shen, Yikang and Gan+, Chuang and Gui+, Liang-Yan and Wang+, Yu-Xiong and Yang+, Yiming and Keutzer+, Kurt and Darrell+, Trevor},
  booktitle = {Findings of the Association for Computational Linguistics: ACL 2024},
  year = {2024},
}

2023

NeurIPS (Spotlight)

Principle-Driven Self-Alignment of Language Models from Scratch with Minimal Human Supervision

Zhiqing Sun, Yikang Shen, Qinhong Zhou, Hongxin Zhang, Zhenfang Chen, David Cox, Yiming Yang, and Chuang Gan

In Thirty-seventh Conference on Neural Information Processing Systems, 2023

Bib HTML Code

@inproceedings{sun2023principle,
  title = {Principle-Driven Self-Alignment of Language Models from Scratch with Minimal Human Supervision},
  author = {Sun, Zhiqing and Shen, Yikang and Zhou, Qinhong and Zhang, Hongxin and Chen, Zhenfang and Cox, David and Yang, Yiming and Gan, Chuang},
  booktitle = {Thirty-seventh Conference on Neural Information Processing Systems},
  year = {2023},
  url = {https://openreview.net/forum?id=p40XRfBX96},
}

NeurIPS (Spotlight)

DIFUSCO: Graph-based Diffusion Solvers for Combinatorial Optimization

Zhiqing Sun, and Yiming Yang

In Thirty-seventh Conference on Neural Information Processing Systems, 2023

Bib HTML Code

@inproceedings{sun2023difusco,
  title = {{DIFUSCO}: Graph-based Diffusion Solvers for Combinatorial Optimization},
  author = {Sun, Zhiqing and Yang, Yiming},
  booktitle = {Thirty-seventh Conference on Neural Information Processing Systems},
  year = {2023},
  url = {https://openreview.net/forum?id=JV8Ff0lgVV},
}

ICLR

Recitation-Augmented Language Models

Zhiqing Sun, Xuezhi Wang, Yi Tay, Yiming Yang, and Denny Zhou

In The Eleventh International Conference on Learning Representations, 2023

Bib HTML Code

@inproceedings{sun2023recitation,
  title = {Recitation-Augmented Language Models},
  author = {Sun, Zhiqing and Wang, Xuezhi and Tay, Yi and Yang, Yiming and Zhou, Denny},
  booktitle = {The Eleventh International Conference on Learning Representations},
  year = {2023},
}