Lightnews — Scholar-powered news

In our latest Evals Series webinar, we covered how to evaluate your evaluator.

AI development has two loops, Meta-evaluation lives in the inner loop.

We also walked through a live demo of this loop in practice, iteratively improving the judge and showing measurable gains at each step.

December 12, 2025 at 7:08 PM

Reposted

arize-phoenix

@arize-phoenix.bsky.social

Check out this walkthrough on bringing observability and evals into LLM workflows, plus a Phoenix demo with helpful context for anyone building agents in TypeScript.

Watch the session below 👇

srichavali.bsky.social @srichavali.bsky.social · 9d

Spoke at @arize.bsky.social’s AI Builder Meetup a few weeks back & the talk is now live!

Covered the basics of observability + evals, and showed via a Mastra agent how to set up tracing, run evals, & start your iteration cycle.

Check it out here 🚀
www.youtube.com/watch?v=qQGQ...

TypeScript Agents: How To Build and Evaluate

YouTube video by Arize AI

www.youtube.com

December 4, 2025 at 7:24 PM

srichavali.bsky.social

@srichavali.bsky.social

Spoke at @arize.bsky.social’s AI Builder Meetup a few weeks back & the talk is now live!

Covered the basics of observability + evals, and showed via a Mastra agent how to set up tracing, run evals, & start your iteration cycle.

Check it out here 🚀
www.youtube.com/watch?v=qQGQ...

TypeScript Agents: How To Build and Evaluate

YouTube video by Arize AI

www.youtube.com

December 4, 2025 at 7:23 PM

srichavali.bsky.social

@srichavali.bsky.social

Learn to prompt better

May 7, 2025 at 7:26 PM

srichavali.bsky.social

@srichavali.bsky.social

🚀 Phoenix now supports MCP (Model Context Protocol)

This lets tools like Claude query prompts, datasets, and experiment results directly from a Phoenix instance (cloud or self-hosted).

Check out our docs to learn more about how to spin it up 👇

Here’s how it works:
www.youtube.com/watch?v=mHeZ...

Model Context Protocol & Arize Phoenix Integration

YouTube video by Arize AI

www.youtube.com

April 17, 2025 at 7:24 PM

Reposted

John Gilhuly

@johngilhuly.bsky.social

🤖 Building agents, but not sure how to measure their performance?

Our newest blog post on @hf.co has you covered!

This post shows you how to use @arize-phoenix.bsky.social to trace and evaluate your smolagents.

Credit to @srichavali.bsky.social and @aymeric-roucher.bsky.social

February 28, 2025 at 5:19 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news