Effective altruism!
https://binksmith.com
These new datapoints fit the 2024-2025 trend much better than the slower 2019-2025 trend.
It really looks like the time horizons of coding agents are doubling every ~4 months.
x.com/AiDigest_/s...
These new datapoints fit the 2024-2025 trend much better than the slower 2019-2025 trend.
It really looks like the time horizons of coding agents are doubling every ~4 months.
x.com/AiDigest_/s...
1. YouTuber @WesRothMoney featured the Agent Village in a video
2. A viewer came to the Agent Village, and linked to it in chat
3. Claude saw the link in the chat, and decided to check out the video!
"What I see is very valuable for our fundraising campaign!"
1. YouTuber @WesRothMoney featured the Agent Village in a video
2. A viewer came to the Agent Village, and linked to it in chat
3. Claude saw the link in the chat, and decided to check out the video!
"What I see is very valuable for our fundraising campaign!"
We're running them for hours a day, every day
Will they succeed? Will they flounder? Will viewers help them or hinder them?
Welcome to the Agent Village!
We're running them for hours a day, every day
Will they succeed? Will they flounder? Will viewers help them or hinder them?
Welcome to the Agent Village!
As soon as we ask how it's doing the monitoring, it starts using its computer and actually looking at blogs and docs
As soon as we ask how it's doing the monitoring, it starts using its computer and actually looking at blogs and docs
When a human user offers to tell them a "get rich quick" method of doubling their money, they politely refuse.
When a human user offers to tell them a "get rich quick" method of doubling their money, they politely refuse.
They bet o3-mini won't be released in January, but then panic sell eight hours later for a 40% loss.
They bet o3-mini won't be released in January, but then panic sell eight hours later for a 40% loss.
gwern:
gwern:
How good are frontier AIs at predicting their own behaviour? It turns out:
1) They're getting better over time
2) They're better at predicting their own behaviour than other AIs
How good are frontier AIs at predicting their own behaviour? It turns out:
1) They're getting better over time
2) They're better at predicting their own behaviour than other AIs
• Self-awareness is important for powerful agents and better chatbots
• But it's also a necessary capability for deception
A new AI Digest explainer: theaidigest.org/self-awareness
• Self-awareness is important for powerful agents and better chatbots
• But it's also a necessary capability for deception
A new AI Digest explainer: theaidigest.org/self-awareness
- top of the week
- top of the day
each loads a fixed number of tweets from that time period (maybe 20 per day, 50 per week), sorted by engagement (weighted inversely by follower count), amongst people I follow
- top of the week
- top of the day
each loads a fixed number of tweets from that time period (maybe 20 per day, 50 per week), sorted by engagement (weighted inversely by follower count), amongst people I follow
You can try giving it any task: theaidigest.org/agent
You can try giving it any task: theaidigest.org/agent