Why it matters
AI Engineer session on Turning Fails into Features: Zapier’s Hard-Won Eval Lessons, presented by Rafal Willinski, Vitor Balocco, Zapier. It adds practical context for how teams are building and operating AI systems in production.
My takeaway: Turning Fails into Features: Zapier’s Hard-Won Eval Lessons — Rafal Willinski, Vitor Balocco, Zapier is a model-evaluation signal. The practical read is to tie capability claims to evidence, launch criteria, and regression tests rather than relying on demos or benchmark headlines.