They try to claim "errors are rising" on "new reasoning systems" primarily based on Vectara hallucination leaderboard and one OpenAI document. So let's look what those actually show.
They try to claim "errors are rising" on "new reasoning systems" primarily based on Vectara hallucination leaderboard and one OpenAI document. So let's look what those actually show.
I went ahead and clicked it and was given a long reply with extensive citations (2nd image).
Very cool! .... right?
I went ahead and clicked it and was given a long reply with extensive citations (2nd image).
Very cool! .... right?
One man’s ego, and hundreds of cowards in congress who fail to stop him, explains it all.
One man’s ego, and hundreds of cowards in congress who fail to stop him, explains it all.
Here's an example:
Average annual fire injures in Berkeley: 2
Average annual traffic injuries in Berkeley: 694
Here's an example:
Average annual fire injures in Berkeley: 2
Average annual traffic injuries in Berkeley: 694
the fact that so many Influencers(tm) feel the need to say that says a lot about both their audience and the fact they publish anyway
- Lewis Mumford
statmodeling.stat.columbia.edu/2025/03/08/a...
Pareto optimal proxy metrics (Zito, Greaves, Soriano et al) North star metrics and online experimentation play a central role in how technology companies improve their products. In many practical settings, however, evaluating experiments based on the north star metric directly can be diff
Pareto optimal proxy metrics (Zito, Greaves, Soriano et al) North star metrics and online experimentation play a central role in how technology companies improve their products. In many practical settings, however, evaluating experiments based on the north star metric directly can be diff
Those weaknesses are points of leverage for people trying to tear the whole thing down.
Those weaknesses are points of leverage for people trying to tear the whole thing down.
www.cartoonshateher.com/p/liberals-w...
www.cartoonshateher.com/p/liberals-w...
RIP to a true artist, visionary, and virtuoso of vibes.
RIP to a true artist, visionary, and virtuoso of vibes.