srichavali.bsky.social
@srichavali.bsky.social
Reposted
In our latest Evals Series webinar, we covered how to evaluate your evaluator.

AI development has two loops, Meta-evaluation lives in the inner loop.

We also walked through a live demo of this loop in practice, iteratively improving the judge and showing measurable gains at each step.
December 12, 2025 at 7:08 PM
Reposted
Check out this walkthrough on bringing observability and evals into LLM workflows, plus a Phoenix demo with helpful context for anyone building agents in TypeScript.

Watch the session below 👇
Spoke at @arize.bsky.social’s AI Builder Meetup a few weeks back & the talk is now live!

Covered the basics of observability + evals, and showed via a Mastra agent how to set up tracing, run evals, & start your iteration cycle.

Check it out here 🚀
www.youtube.com/watch?v=qQGQ...
TypeScript Agents: How To Build and Evaluate
YouTube video by Arize AI
www.youtube.com
December 4, 2025 at 7:24 PM
Spoke at @arize.bsky.social’s AI Builder Meetup a few weeks back & the talk is now live!

Covered the basics of observability + evals, and showed via a Mastra agent how to set up tracing, run evals, & start your iteration cycle.

Check it out here 🚀
www.youtube.com/watch?v=qQGQ...
TypeScript Agents: How To Build and Evaluate
YouTube video by Arize AI
www.youtube.com
December 4, 2025 at 7:23 PM
Learn to prompt better
May 7, 2025 at 7:26 PM
🚀 Phoenix now supports MCP (Model Context Protocol)

This lets tools like Claude query prompts, datasets, and experiment results directly from a Phoenix instance (cloud or self-hosted).

Check out our docs to learn more about how to spin it up 👇

Here’s how it works:
www.youtube.com/watch?v=mHeZ...
Model Context Protocol & Arize Phoenix Integration
YouTube video by Arize AI
www.youtube.com
April 17, 2025 at 7:24 PM
Reposted
🤖 Building agents, but not sure how to measure their performance?

Our newest blog post on @hf.co has you covered!

This post shows you how to use @arize-phoenix.bsky.social to trace and evaluate your smolagents.

Credit to @srichavali.bsky.social and @aymeric-roucher.bsky.social
February 28, 2025 at 5:19 PM