Build practical AI systems, by mapping general-purpose AIs to the right specific use cases.

Recent Publications

2024
What You Say = What You Want? Teaching Humans to Articulate Requirements for LLMs
Qianou Ma, Weirui Peng, Hua Shen, Kenneth Koedinger, Tongshuang Wu
ArXiv 2024
LLMs as Workers in Human-Computational Algorithms? Replicating Crowdsourcing Pipelines with LLMs
Tongshuang Wu, Haiyi Zhu, Maya Albayrak, Alexis Axon, Amanda Bertsch, Wenxing Deng, Ziqi Ding, Bill Guo, Sireesh Gururaja, Tzu-Sheng Kuo, Jenny T Liang, Ryan Liu, Ihita Mandal, Jeremiah Milbauer, Xiaolin Ni, Namrata Padmanabhan, Subhashini Ramkumar, Alexis Sudjianto, Jordan Taylor, Ying-Jui Tseng, Patricia Vaidos, Zhijin Wu, Wei Wu, Chenyang Yang
ArXiv 2024
Do LLMs Exhibit Human-Like Response Biases? A Case Study in Survey Design
Lindia Tjuatja, Valerie Chen, Tongshuang Wu, Ameet Talwalkar, Graham Neubig
TACL 2024
A Large Scale Audit of Dataset Licensing and Attribution in AI
Shayne Longpre, Robert Mahari, Anthony Chen, Naana Obeng-Marnu, Damien Sileo, William Brannon, Niklas Muennighoff, Nathan Khazam, Jad Kabbara, Kartik Perisetla, Xinyi (Alexis) Wu, Enrico Shippole, Kurt Bollacker, Tongshuang Wu, Luis Villa, Sandy Pentland, Deb Roy, Sara Hooker
Nature Machine Intelligence 2024
Wikibench: Community-Driven Data Curation for AI Evaluation on Wikipedia
Tzu-Sheng Kuo, Aaron Halfaker, Zirui Cheng, Jiwoo Kim, Meng-Hsin Wu, Tongshuang Wu, Ken Holstein, Haiyi Zhu
CHI 2024
How to Teach Programming in the AI Era? Using LLMs as a Teachable Agent for Debugging Best Paper
Qiaomu Ma, Hua Shen, Kenneth Koedinger, Tongshuang Wu
AIED 2024
Large Language Models Help Humans Verify Truthfulness – Except When They are Convincingly Wrong
Chenglei Si, Navita Goyal, Tongshuang Wu, Chen Zhao, Shi Feng, Hal Daumé III, Jordan Boyd-Graber
NAACL 2024
Better Synthetic Data by Retrieving and Transforming Existing Datasets
Saumya Gandhi, Ritu Gala, Vijay Viswanathan, Tongshuang Wu, Graham Neubig
ACL Findings 2024
Beyond Relevance: Evaluate and Improve Retrievers on Perspective Awareness
Xinran Zhao, Tong Chen, Sihao Chen, Hongming Zhang, Tongshuang Wu
CoLM 2024
"Merge Conflicts!" Exploring the Impacts of External Distractors to Parametric Knowledge Graphs
Cheng Qian, Xinran Zhao, Tongshuang Wu
CoLM 2024
Generating Situated Reflection Triggers About Alternative Solution Paths: A Case Study in Generative AI for Computer-Supported Collaborative Learning
Atharva Naik, Jessica Ruhan Yin, Anusha Kamath, Qianou Ma, Sherry Tongshuang Wu, Charles Murray, Majd Sakr, Carolyn P. Rose
AIED 2024
Selenite: Scaffolding Online Sensemaking with Comprehensive Overviews Elicited from Large Language Models
Michael Xieyang Liu, Tongshuang Wu, Tianying Chen, Franklin Mingzhe Li, Aniket Kittur, Brad A. Myers
CHI 2024
Self-Guide: Better Task-Specific Instruction Following via Self-Synthetic Finetuning
Chenyang Zhao, Xueying Jia, Vijay Viswanathan, Graham Neubig, Tongshuang Wu
CoLM 2024
What Is Wrong with My Model? Identifying Systematic Problems with Semantic Data Slicing
Chenyang Yang, Yining Hong, Grace A. Lewis, Tongshuang Wu, Christian Kästner
ASE 2024
Fact-and-Reflection (FaR) Improves Confidence Calibration of Large Language Models
Xinran Zhao, Hongming Zhang, Xiaoman Pan, Wenlin Yao, Dong Yu, Tongshuang Wu, Jianshu Chen
ACL Findings 2024
2023
Tool Learning with Foundation Models
Yujia Qin, Shengding Hu, Yankai Lin, Weize Chen, Ning Ding, Ganqu Cui, Zheni Zeng, Yufei Huang, Chaojun Xiao, Chi Han, Yi Ren Fung, Yusheng Su, Huadong Wang, Cheng Qian, Runchu Tian, Kunlun Zhu, Shihao Liang, Xingyu Shen, Bokai Xu, Zhen Zhang, Yining Ye, Bowen Li, Ziwei Tang5, Jing Yi, Yuzhang Zhu, Zhenning Dai, Lan Yan, Xin Cong, Yaxi Lu, Weilin Zhao, Yuxiang Huang, Junxi Yan, Xu Han, Xian Sun, Dahai Li, Jason Phang, Cheng Yang, Tongshuang Wu, Heng Ji, Zhiyuan Liu, Maosong Sun
Computing Surveys (Submitted) 2023
Bridging the Gap: A Survey on Integrating (Human) Feedback for Natural Language Generation
Patrick Fernandes, Aman Madaan, Emmy Liu, António Farinhas, Pedro Henrique Martins, Amanda Bertsch, José G. C. de Souza, Shuyan Zhou, Tongshuang Wu, Graham Neubig, André F. T. Martins
TACL 2023
Large Language Models Enable Few-Shot Clustering
Vijay Viswanathan, Kiril Gashteovski, Carolin Lawrence, Tongshuang Wu, Graham Neubig
TACL 2023
From Nuisance to News Sense: Augmenting the News with Cross-document Evidence and Context
Jeremiah Milbauer, Ziqi Ding, Zhijin Wu, Tongshuang Wu
EMNLP Demo Track 2023
Beyond Testers' Biases: Guiding Model Testing with Knowledge Bases using LLMs
Chenyang Yang, Rishabh Rustogi, Rachel Brower-Sinning, Grace Lewis, Christian Kaestner, Tongshuang Wu
EMNLP Findings 2023
DataFinder: Scientific Dataset Recommendation from Natural Language Descriptions
Vijay Viswanathan, Luyu Gao, Tongshuang Wu, Pengfei Liu, Graham Neubig
ACL 2023
Seeing Seeds Beyond Weeds: Green Teaming Generative AI for Beneficial Uses
Logan Stapleton, Jordan Taylor, Sarah Fox, Tongshuang Wu, Haiyi Zhu
ArXiv 2023
Promp2Model: Generating Deployable Models from Natural Language Instructions
Vijay Viswanathan, Chenyang Zhao, Amanda Bertsch, Tongshuang Wu, Graham Neubig
EMNLP Demo Track 2023
Synergi: A Mixed-Initiative System for Scholarly Synthesis and Sensemaking
Hyeonsu Kang, Tongshuang Wu, Joseph Chee Chang, Aniket Kittur
UIST 2023
BiasX: "Thinking Slow" in Toxic Content Moderation with Explanations of Implied Social Biases
Yiming Zhang, Sravani Nanduri, Liwei Jiang, Tongshuang Wu, Maarten Sap
EMNLP 2023
Capabilities for Better ML Engineering
Chenyang Yang, Rachel Brower-Sinning, Grace A. Lewis, Christian Kästner, Tongshuang Wu
AAAI SafeAI 2023
Is AI the Better Programming Partner? Human-Human Pair Programming vs. Human-AI pAIr Programming
Qianou Christina Ma, Tongshuang Wu, Kenneth Koedinger
AIED2023 Empowering Education with LLMs 2023
Measuring Adversarial Datasets
Yuanchen Bai, Raoyi Huang, Vijay Viswanathan, Tzu-Sheng Kuo, Tongshuang Wu
AACL ART of Safety 2023

Current Members

Christina Ma (PhD) ,
co-advisor: Ken Koedinger
Preparing Students for Effective Human-LLM Partnerships
Chenyang Yang (PhD) ,
co-advisor: Christian Kästner
Human-Centered ML Engineering
Xinran Zhao (PhD)
Information Seeking and Retrieval for Complex Tasks
Vijay Viswanathan (PhD) ,
co-advisor: Graham Neubig
Democratization of NLP Development and Evaluation
Jessie Mindel (PhD)
Simulated Agents and Collective Sensemaking
Yilin Zhang (Master)
Code Retrieval with AST
Jushaan Kalra (Master)
Multi-domain Retrieval
Cassandra Shi (Undergrad)
Requirement-driven LLMs

Alumni

Cheng Qian (Visit)
LLM hullucination. Now PhD student at UIUC.
Shaan Lehal (Undergrad)
LLM sensemaking copilot
Alina Chen (Undergrad)
LLM sensemaking copilot
Samriddhi Bhardwaj (Undergrad)
LLM sensemaking copilot
Alex Cheung (Undergrad)
LLM sensemaking copilot
Yashika Batra (Undergrad)
LLM sensemaking copilot
Atharva Naik (Master)
LLM in CS education. Now PhD student at CMU.
Raoyi (Cathy) Huang (Master)
NLP dataset characterization. Now PhD student at Cornell.
Yuanchen (Sophie) Bai (Master)
NLP dataset characterization
Yiyang (Diana) Wang (Master)
End-User Prompt Disambiguation. Now PhD student at Georgia Tech.