Ameet Talwalkar

I am an associate professor in the Machine Learning Department at CMU and Chief Scientist at Datadog. My current research interests include ML for science, human-AI interaction, and developing specialized foundation models and agents. Here is my Google scholar page and formal bio.

Selected Recent Work

Copilot Arena: A Platform for Code LLM Evaluation in the Wild (pdf)
W. Chi, V. Chen, A. Angelopoulos, W. Chiang, A. Mittal, N. Jain, T. Zhang, I. Stoica, C. Donahue, A. Talwalkar

Specialized Foundation Models Struggle to Beat Supervised Baselines (pdf)
Z. Xu, R. Gupta, W. Cheng, A. Shen, J. Shen, A. Talwalkar, M. Khodak
International Conference on Learning Representations (ICLR), 2025

The RealHumanEval: Evaluating Large Language Models' Abilities to Support Programmers (pdf)
H. Mozannar, V. Chen, M. Alsobay, S. Das, S. Zhao, D. Wei, M. Nagireddy, P. Sattigeri, A. Talwalkar, D. Sontag
Transactions on Machine Learning Research (TMLR), 2025

ScribeAgent: Towards Specialized Web Agents Using Production-Scale Workflow Data (pdf, blog)
J. Shen, A. Jain, Z. Xiao, I. Amlekar, M. Hadji, A. Podolny, A. Talwalkar

UPS: Towards Foundation Models for PDE Solving via Cross-Modal Adaptation (pdf)
J. Shen, T. Marwah, A. Talwalkar
Transactions on Machine Learning Research (TMLR), 2024

L2G: Repurposing Language Models for Genomics Tasks (pdf)
W. Cheng, J. Shen, M. Khodak, J. Ma, A. Talwalkar

Cross-Modal Fine-Tuning: Align then Refine (pdf)
J. Shen, L. Li, L. Dery, C. Staten, M. Khodak, G. Neubig, A. Talwalkar
International Conference on Machine Learning (ICML), 2023

Zeno: An Interactive Framework for Behavioral Evaluation of Machine Learning (pdf, website, blog)
A. Cabrera, E. Fu, D. Bertucci, K. Hostein, A. Talwalkar, J. Hong, A. Perer
Conference on Human Factors in Computing Systems (CHI), 2023

Efficient Architecture Search for Diverse Tasks (pdf, blog)
J. Shen, M. Khodak, A. Talwalkar
Neural Information Processing Systems (NeurIPS), 2022

Gradient Descent on Neural Networks Typically Occurs at the Edge of Stability (pdf)
J. Cohen, S. Kaur, Y. Li, Z. Kolter, A. Talwalkar
International Conference on Learning Representations (ICLR), 2021