MLflow
banner
mlflow.org
MLflow
@mlflow.org
80 followers 6 following 200 posts
An open source machine learning platform for managing the complete ML lifecycle
Posts Media Videos Starter Packs
Two weeks from today — don’t miss it! 🙌

📣 Join the next 𝗠𝗟𝗳𝗹𝗼𝘄 𝗖𝗼𝗺𝗺𝘂𝗻𝗶𝘁𝘆 𝗠𝗲𝗲𝘁𝘂𝗽 on November 12 to explore the latest in AI observability and agent tracing.

Featuring two exciting talks:
🔹 𝗖𝗹𝗮𝘂𝗱𝗲 𝗖𝗼𝗱𝗲 & 𝗔𝗴𝗲𝗻𝘁 𝗦𝗗𝗞 𝗧𝗿𝗮𝗰𝗶𝗻𝗴
🔹 𝗢𝗽𝗲𝗻𝗧𝗲𝗹𝗲𝗺𝗲𝘁𝗿𝘆 (𝗢𝗧𝗘𝗟) 𝗦𝘂𝗽𝗽𝗼𝗿𝘁

🔗 RSVP: luma.com/mlflowmeetup...

#mlflow #oss
MLflow Community Meetup | November 2025 · Luma
November MLflow Community Meetup We'll dive into: 🔹 Claude Code & Claude Agent SDK Tracing Building agentic applications is complex, but it doesn't have to…
luma.com
Building agentic applications is complex, but it doesn’t have to be slow.

In this blog, Samraj Moorjani shows how to go from idea to measurable results in hours using the Claude Agent SDK with MLflow for observability and evaluation.

🔗 Dive in: mlflow.org/blog/mlflow-...

#MLflow #agentic
Rapidly Prototype and Evaluate Agents with Claude Agent SDK and MLflow | MLflow
How to quickly prototype an agent using the Claude Agent SDK then instrument and evaluate it with MLflow
mlflow.org
📣 Join the next 𝗠𝗟𝗳𝗹𝗼𝘄 𝗖𝗼𝗺𝗺𝘂𝗻𝗶𝘁𝘆 𝗠𝗲𝗲𝘁𝘂𝗽 on November 12 to explore the latest in AI observability and agent tracing — featuring two exciting talks:

🔹 𝗖𝗹𝗮𝘂𝗱𝗲 𝗖𝗼𝗱𝗲 & 𝗔𝗴𝗲𝗻𝘁 𝗦𝗗𝗞 𝗧𝗿𝗮𝗰𝗶𝗻𝗴
🔹 𝗢𝗽𝗲𝗻𝗧𝗲𝗹𝗲𝗺𝗲𝘁𝗿𝘆 (𝗢𝗧𝗘𝗟) 𝗦𝘂𝗽𝗽𝗼𝗿𝘁

🔗 RSVP: luma.com/mlflowmeetup...

#MLflow #opensource #oss #genai
MLflow Community Meetup | November 2025 · Luma
November MLflow Community Meetup We'll dive into: 🔹 Claude Code & Claude Agent SDK Tracing Building agentic applications is complex, but it doesn't have to…
luma.com
⏰ Don’t forget — MLflow Office Hours are this Wednesday, October 22 at 8AM PT on Zoom!

Come for:
🔹 Real-time MLflow troubleshooting and guidance
🔹 Best practices for managing LLM & GenAI experiments
🔹 A look ahead at new MLflow features

🔗 RSVP: luma.com/officehours1...

#opensource #oss #MLflow
MLflow Office Hours | October 22 · Zoom · Luma
luma.com
With that metadata—and MLflow’s MCP features released recently—the judge can make tool calls to MLflow to search spans and query different aspects of the trace.

Office Hours: Wed, Oct 22: luma.com/officehours1...

#mlflow #agenticjudges #llm #genai
MLflow Office Hours | October 22 · Zoom · Luma
luma.com
In this mode, the judge gets the MLflow trace info object: input to the call, output, and basically the root span ID for that trace.
Missed last week’s #MLflow Community Meetup? Check out Ben Wilson on agentic judges: “The judge no longer works as an LLM as a judge—it actually works as an agent as a judge.”

🎥 Full video: www.youtube.com/live/bkMabn8...

#opensource #oss #agenticjudges
⚡ In this lightning talk at MLOps World, Danny Chiao tackled a top agent challenge: ensuring high quality output.

Rather than labeling and analyzing traces by hand, MLflow makes it easy to log, evaluate, and iterate faster—using techniques leading companies rely on to deploy agents in production. ✅
🚨 Reminder: MLflow Community Meetup is tomorrow, Oct 8 at 4:00 PM PT!

We'll explore trace‑aware, feedback‑aligned judges and versioned eval datasets in MLflow. You don't wait to miss it!

🎥 LIVE on LinkedIn, YouTube & X
🔗 RSVP: luma.com/mlflow-1001

#opensource #oss #mlflow
Building better LLM evals? Ben Wilson highlights how frameworks like #DSPy boost judge prompts—and reliability—as models evolve.

Tips for judge reproducibility/reliability: use reproducible pipelines, re-tune logic as endpoints change, & standardize. ✅

🎥 Watch more: www.youtube.com/live/HTxpmnO...
🚀 Headed to MLOps World | GenAI Summit 2025 next week? Don’t miss an exciting lightning talk from Danny Chiao, Engineering Lead at Databricks!

🎤 𝗧𝗲𝗰𝗵𝗻𝗶𝗾𝘂𝗲𝘀 𝘁𝗼 𝗯𝘂𝗶𝗹𝗱 𝗵𝗶𝗴𝗵 𝗾𝘂𝗮𝗹𝗶𝘁𝘆 𝗮𝗴𝗲𝗻𝘁𝘀 𝗳𝗮𝘀𝘁𝗲𝗿 𝘄𝗶𝘁𝗵 𝗠𝗟𝗳𝗹𝗼𝘄

🗓️ October 9
📍 Austin, TX
🔗 Learn more: mlopsworld.com#agenda

#MLflow #GenAI #MLOps #LLM
🚨 RESCHEDULED: Wednesday, October 8

The next MLflow Community Meetup happens NEXT Wednesday, Oct 8 at 4PM PT—and you won’t want to miss it.

RSVP here: luma.com/mlflow-1001

#opensource #oss #mlflow
Our next MLflow Community Meetup is happening TODAY—Wednesday, October 1 at 4PM PT! 🙌

Don’t miss this chance to connect and learn:
🔹 Smarter Evaluations with Trace-Aware, Feedback-Aligned Judges
🔹 Keeping Eval Datasets Relevant as Your App Evolves

✅ RSVP: luma.com/mlflow-1001

#oss #mlflow #genai
MLflow Community Meetup · Luma
Join us for the next MLflow Community Meetup on October 1 at 4PM PT! Ben Wilson, MLflow Maintainer, will dive deep into: Building Smarter Evals with…
luma.com
MLflow @mlflow.org · Sep 30
🚀 The fifth “Invoice Extraction with OpenAI + MLflow” session is now available! #MLflow Ambassador Shrinath Suresh dives into designing a custom scorer to evaluate invoice extraction models beyond just labels or LLM-as-a-judge.

🎥 youtu.be/SmuhOmOYXSg?...
📖 medium.com/@shrinath.su...

#opensource
MLflow @mlflow.org · Sep 29
🚨 Just 2 days away!

The next #MLflow Community Meetup happens this Wednesday, Oct 1 at 4PM PT—and you won’t want to miss it.

We will cover:
🔹 𝗕𝘂𝗶𝗹𝗱𝗶𝗻𝗴 𝗦𝗺𝗮𝗿𝘁𝗲𝗿 𝗘𝘃𝗮𝗹𝘀 𝘄𝗶𝘁𝗵 𝗧𝗿𝗮𝗰𝗲-𝗔𝘄𝗮𝗿𝗲, 𝗙𝗲𝗲𝗱𝗯𝗮𝗰𝗸-𝗔𝗹𝗶𝗴𝗻𝗲𝗱 𝗝𝘂𝗱𝗴𝗲𝘀
🔹 𝗞𝗲𝗲𝗽𝗶𝗻𝗴 𝗘𝘃𝗮𝗹 𝗗𝗮𝘁𝗮𝘀𝗲𝘁𝘀 𝗥𝗲𝗹𝗲𝘃𝗮𝗻𝘁 𝗮𝘀 𝗬𝗼𝘂𝗿 𝗔𝗽𝗽 𝗖𝗵𝗮𝗻𝗴𝗲𝘀

RSVP 👉 luma.com/mlflow-1001

#oss
MLflow Community Meetup · Luma
Join us for the next MLflow Community Meetup on October 1 at 4PM PT! Ben Wilson, MLflow Maintainer, will dive deep into: Building Smarter Evals with…
luma.com
MLflow @mlflow.org · Sep 23
Episode 3 of Invoice Extraction is live! 🚀

#MLflow Ambassador Shrinath Suresh explores prompt versioning, comparing, automating workflows, and effective reuse with MLflow’s Prompt Registry.

🎥 Watch: youtube.com/watch?v=fau8...
📖 Read: medium.com/@shrinath.su...

#opensource #oss #genai
#3 Prompt Engineering & MLflow Prompt Management
YouTube video by TheAIGuy
youtube.com
MLflow @mlflow.org · Sep 18
Businesses run on invoices. Getting structured data from them—fast and accurate—is critical.

🧩 GPT-5 is powerful for extraction
⚙️ MLflow 3 makes it repeatable & traceable

Demo + blog from #MLflow Ambassador Shrinath Suresh! 👇

▶️ youtu.be/E5GSWLhI5uA
📝 medium.com/@shrinath.su...

#opensource #oss
Introduction to invoice extraction using OpenAI GPT5 and MLflow 3.3
YouTube video by TheAIGuy
youtu.be
MLflow @mlflow.org · Sep 17
🚀 Join the next MLflow Community Meetup on Oct 1 at 4PM PT!

🔹 𝗕𝘂𝗶𝗹𝗱𝗶𝗻𝗴 𝗦𝗺𝗮𝗿𝘁𝗲𝗿 𝗘𝘃𝗮𝗹𝘀 𝘄𝗶𝘁𝗵 𝗧𝗿𝗮𝗰𝗲-𝗔𝘄𝗮𝗿𝗲, 𝗙𝗲𝗲𝗱𝗯𝗮𝗰𝗸-𝗔𝗹𝗶𝗴𝗻𝗲𝗱 𝗝𝘂𝗱𝗴𝗲𝘀
🔹 𝗞𝗲𝗲𝗽𝗶𝗻𝗴 𝗘𝘃𝗮𝗹 𝗗𝗮𝘁𝗮𝘀𝗲𝘁𝘀 𝗥𝗲𝗹𝗲𝘃𝗮𝗻𝘁 𝗮𝘀 𝗬𝗼𝘂𝗿 𝗔𝗽𝗽 𝗖𝗵𝗮𝗻𝗴𝗲𝘀

Bring your questions about dataset management, evaluation workflows! ✅

RSVP 👉 luma.com/mlflow-1001

#opensource #oss
MLflow Community Meetup · Luma
Join us for the next MLflow Community Meetup on October 1 at 4PM PT! Ben Wilson, MLflow Maintainer, will dive deep into: Building Smarter Evals with…
luma.com
MLflow @mlflow.org · Sep 15
This blog highlights how MLflow’s #GenAI capabilities streamline development of an LLM-based Optical Character Recognition (OCR) tool. These capabilities reduce friction, accelerate workflows, and deliver value to both technical and non-technical contributors.

🚀 Dive in: mlflow.org/blog/mlflow-...
MLflow @mlflow.org · Sep 11
This blog looks at the “Coffee Machine” approach: global teams set up standardized #ML pipelines, & local teams adapt them using their own #data. ☕

#MLflow supports every step, making it possible to track changes, register model variants, & maintain reproducibility.

🔗 medium.com/dscier/brewi...
MLflow @mlflow.org · Sep 9
📣 Happening Tomorrow — MLflow Office Hours!

Join #MLflow maintainers for a live Q&A session! Whether you’re running MLflow in production or experimenting with LLMs & GenAI, this is your chance to bring real challenges and get direct feedback.

🕒 Sept 10 @ 3PM SGT
🎟 RSVP: lu.ma/mlflow-910

#oss
MLflow @mlflow.org · Sep 5
The real unlock → MLflow’s tracing integration. Every tool call + reasoning step gets captured and replayable. When an agent fails, you can see why—not guess. Critical for debugging multi-step chains + production bottlenecks.

🔗 Learn more: mlflow.org/docs/latest/...

#LLMOps #AI #MLflow #oss
Evaluating Agents | MLflow
AI Agents are an emerging pattern of GenAI applications that can use tools, make decisions, and execute multi-step workflows. However, evaluating the performance of those complex agents is challenging...
mlflow.org
MLflow @mlflow.org · Sep 5
MLflow lets you create custom scorers for agent behavior: did it use the right tool, in the right order, with proper reasoning? Datasets can encode patterns + decisions, not just input–output. You’re testing how the agent thinks—not just what it outputs.

#AgentEvaluation #MLflow #opensource