While GDPVal is exponentially better than an SAT score, you need to do your own evaluation on model-task fit along multiple metrics.
Giving your AI a Job Interview
open.substack.com/pub/oneusefu...
While GDPVal is exponentially better than an SAT score, you need to do your own evaluation on model-task fit along multiple metrics.
Giving your AI a Job Interview
open.substack.com/pub/oneusefu...
Coincidentally, I also was just digging into evals & found TruLens; recommend checking that out, too.
Coincidentally, I also was just digging into evals & found TruLens; recommend checking that out, too.
medium.com/google-cloud...
medium.com/google-cloud...
Neil Postman
I've had so many Orwell v Huxley debates*...never imagined we'd wind up with both at the same time.
Neil Postman
I've had so many Orwell v Huxley debates*...never imagined we'd wind up with both at the same time.
www.derekthompson.org/p/the-end-of...
www.derekthompson.org/p/the-end-of...
openai.com/index/how-pe...
and Anthropic:
www.anthropic.com/research/ant...
Growth acceleration coming from low-mid-income countries so will be fascinating to see impacts of AI dispersion.
openai.com/index/how-pe...
and Anthropic:
www.anthropic.com/research/ant...
Growth acceleration coming from low-mid-income countries so will be fascinating to see impacts of AI dispersion.
"Build with ADK (or any framework), equip with MCP (or any tool), and communicate with A2A to remote agents, local agents, and humans. "
a2a-protocol.org/latest/
"Build with ADK (or any framework), equip with MCP (or any tool), and communicate with A2A to remote agents, local agents, and humans. "
a2a-protocol.org/latest/