ASTRA-bench: Evaluating Tool-Use Agent Reasoning and Action Planning with Personal User Context

Published in arXiv, 2026

Recommended citation: Zidi Xiu, David Q. Sun, Kevin Cheng, Maitrik Patel, Josh Date, Yizhe Zhang, Jiarui Lu, Omar Attia, Raviteja Vemulapalli, Oncel Tuzel, Meng Cao, Samy Bengio https://arxiv.org/abs/2603.01357

Download paper here

Recommended citation:

@article{xiu2026astrabench,
  title={ASTRA-bench: Evaluating Tool-Use Agent Reasoning and Action Planning with Personal User Context},
  author={Xiu, Zidi and Sun, David Q. and Cheng, Kevin and Patel, Maitrik and Date, Josh and Zhang, Yizhe and Lu, Jiarui and Attia, Omar and Vemulapalli, Raviteja and Tuzel, Oncel and Cao, Meng and Bengio, Samy},
  journal={arXiv preprint arXiv:2603.01357},
  year={2026}
}