10:20 - 10:45
AI for PM & Designers

FR

Long Talk (25min)

FR

Evals: the foundation of reliable AI products

Description

When building AI features, you can’t anticipate everything upfront. Real user interactions expose unexpected inputs and failures that only appear in production, leaving teams guessing what to fix and whether their changes actually improve the product.

In this talk, we’ll explore why datasets and evals have become the foundation of AI product development. You’ll learn how leading teams use evals to identify failures, guide their roadmap, prevent regressions, and build fast feedback loops.This shift changes how products are built, from static specifications to continuous improvement. Because in the age of AI, the teams that win are those with the fastest learning loops.

En savoir plus