Hamel Husain
banner
hamel.bsky.social
Hamel Husain
@hamel.bsky.social
evals evals evals. https://evals.info
👀 Animals have been assigned.

Scheduled to print fall 2026!

We have iterated on this with over 3k students (and continue to do so). We give our students access to the full draft as part of our evals course (link in bio)
November 8, 2025 at 5:03 PM
"Can I just get an LLM to do my error analysis?"

We get this question constantly. The answer is no, and trying is the fastest way to miss critical bugs.

Full podcast: youtu.be/BsWxPI9UM4c?...
October 3, 2025 at 9:12 PM
I recently sat down with Lenny Rachitsky to discuss why AI Evals are becoming the most sought after skill for product builders.

As a bonus, we step through an end-to-end example of building an eval in a spreadsheet so everyone can understand. See reply for links.
October 3, 2025 at 6:07 PM
Last chance to signup for this free lesson with OpenAI on evals, Including a sneak peek of their new eval products!

Link: maven.com/p/d2dc30/how...
June 8, 2025 at 2:44 PM
Can non-data scientists write AI Evals? The answer is nuanced and not just "Yes". @eugeneyan.com and I discuss this in the context of the "analyze-measure-improve" cycle from our course.

Links to more resources in the reply
May 17, 2025 at 12:18 AM
If you are writing evals without error analysis, our course AI Evals for Engineers & PMs is for you. Begins monday next week. Full syllabus in this link: maven.com/parlance-lab...
May 12, 2025 at 5:17 AM
GitHub CoPilot is one of the first commercially successful LLM products (predating ChatGPT). What was the secret? A robust eval suite!

In this lightning lesson, John Berryman will reveal the eval techniques (and mistakes) from working on this product

maven.com/p/da8264/how...
April 29, 2025 at 5:27 AM
I keep hearing about the emerging role of AI PM. How is this any different than a normal PM? Is it hype? We are gonna find out in this free lightning lesson. I will ask difficult questions. With @schof.bsky.social and Aman Khan

maven.com/p/544677/wha...
April 27, 2025 at 6:17 PM
Last chance to sign up for this. Recording sent to everyone who signs up. maven.com/p/29a33a/hyb...
April 16, 2025 at 12:59 PM
If you are building RAG applications, you don't want to miss this. Doug Turnbull is going to show you his tricks he's learned from a decade of optimizing retrieval in search systems, and how that transfers to RAG.

Link: maven.com/p/29a33a/hyb...
April 14, 2025 at 4:44 PM
Are you frustrated by how GitHub renders Jupyter notebooks? I have public service that renders GitHub notebooks with Quarto

nbsanity.com

It now works with gists!
December 7, 2024 at 7:42 PM
Just Incase someone else doesn’t know 😂
June 3, 2023 at 2:06 AM