A Preliminary Study on Explaining Risk of Code Changes Using LLM-based Prediction Models
Y. Liu, K. Jabre, R. Abreu, Z. J. Carmichael, V. Murali, A. Patel, J. Ge, W. Sun
Senior Research Scientist · Meta
Model understanding for LLM agents, attribution, and large-scale ML debugging.
CS PhD · Boston Area
I work on opening the black box of generative AI: understanding what large models have learned, why they answered the way they did, and how to tell when their explanations are misleading. At Meta I build attribution and model-understanding workflows for LLMs and agentic systems — turning explanation into intervention by guiding prompt edits, compressing context, and large-scale model debugging. I maintain PyTorch Captum, the interpretability library used by the broader PyTorch community.
My PhD is from the University of Notre Dame's Computer Vision Research Lab (advisor Walter J. Scheirer); my dissertation, Explainable AI for High-Stakes Decision-Making, covers intrinsically interpretable models, the failure modes of post hoc explainers, and defenses against adversarial manipulation of explanations.
Y. Liu, K. Jabre, R. Abreu, Z. J. Carmichael, V. Murali, A. Patel, J. Ge, W. Sun
P. Chlenski, Z. J. Carmichael, A. Warikoo, J. Shao, Y. Ye, O. Yang, V. Miglani, N. Bandi
Z. J. Carmichael, T. Redgrave, D. G. Cedre, W. J. Scheirer
Z. J. Carmichael, S. Lohit, A. Cherian, M. J. Jones, W. J. Scheirer
Z. J. Carmichael, W. J. Scheirer
Z. J. Carmichael, W. J. Scheirer
The government's "dual use" definition has some surprises for the AI industry.
A deep dive into the tech behind the current AI boom.
Algorithmic audits can reveal harmful biases — but what happens when an auditee tries to hide its decision-making?