4Deploy model whose predictions most resemble the ensemble mean (opens in new tab)(github.com)1neehao4d ago1
7Dynamic E2E Agentic Simulation and Evaluation with Cypress (opens in new tab)(github.com)2neehao12d ago0
11The User Is Stochastic: Testing Agentic Systems with Simulation and Evaluation (opens in new tab)(gojiberries.io)1neehao18d ago1
12Slosizer: Right-size reserved LLM capacity Based on SLO (opens in new tab)(pypi.org)1neehao21d ago0
13Pass-Through of Tariffs: Evidence from European Wine Imports (opens in new tab)(nber.org)76neehao23d ago84