A framework for comprehensive diagnosis and optimization of agents using simulated, realistic synthetic interactions
Contributions:2 PRs, 42 pushes, 1 branch in 2 months
agentevaluationllmopssimulatorsynthetic-data
Contributions:4 commits, 3 pushes, 1 branch in 2 months