Six kids from 3 to 26
"Cats Confuse Reasoning LLM: Query Agnostic Adversarial Triggers for Reasoning Models"
arxiv.org/abs/2503.01781
"Cats Confuse Reasoning LLM: Query Agnostic Adversarial Triggers for Reasoning Models"
arxiv.org/abs/2503.01781
I look for ideas that will inspire me, something I wouldn't have thought of, cherry-picking from a selection of more "creative" ideas
I look for ideas that will inspire me, something I wouldn't have thought of, cherry-picking from a selection of more "creative" ideas
AI need more documentation to have an appropriate context.
This makes more extensive document both more valuable and necessary.
AI need more documentation to have an appropriate context.
This makes more extensive document both more valuable and necessary.
Add `.diff` to the URL for the PR to get the diff. Ask an AI to turn this into a Pull Request Description and add in your own words what/why these changes were made. Edit this down for the relevant content.
Add `.diff` to the URL for the PR to get the diff. Ask an AI to turn this into a Pull Request Description and add in your own words what/why these changes were made. Edit this down for the relevant content.
Initial draft: AI generates 600% more than I do.
After a critical review, AI added +100% of what I did by line.
In terms of value added, AI contributed +10% to +30%
Initial draft: AI generates 600% more than I do.
After a critical review, AI added +100% of what I did by line.
In terms of value added, AI contributed +10% to +30%
"to avoid replacement ... blackmailing officials and leaking sensitive information to competitors"
www.anthropic.com/research/age...
People are increasingly using it for mental health
globalwellnessinstitute.org/global-welln...
"to avoid replacement ... blackmailing officials and leaking sensitive information to competitors"
www.anthropic.com/research/age...
People are increasingly using it for mental health
globalwellnessinstitute.org/global-welln...
I needed 1000s of consistent changes, I knew what needed to be done with minimal impact to polish the code. All areas where AI falls short
However, I used it a lot for pull request descriptions to help with review
I needed 1000s of consistent changes, I knew what needed to be done with minimal impact to polish the code. All areas where AI falls short
However, I used it a lot for pull request descriptions to help with review
Coding can be written 50-100% faster.
However, most of the time is spent on system design, intricate debugging, or creative problem-solving.
Using AI here makes the difference between a 10% and a 30% productivity boost
Coding can be written 50-100% faster.
However, most of the time is spent on system design, intricate debugging, or creative problem-solving.
Using AI here makes the difference between a 10% and a 30% productivity boost
o3 was the best for versions, needed correcting.
Gemini produced correct syntax, but needed the most updating
o3 was the best for versions, needed correcting.
Gemini produced correct syntax, but needed the most updating
Can generative AI give you the idea, documentation, or code for something?
If it does, it's not secret sauce.
Can generative AI give you the idea, documentation, or code for something?
If it does, it's not secret sauce.
Get two AIs to play against each other, and they will drop plenty of hints, and they can get the answer quickly, about 7-9 each time
Get two AIs to play against each other, and they will drop plenty of hints, and they can get the answer quickly, about 7-9 each time
You want to minimise the unhelpful, remove the incorrect, and keep the correct and helpful.
The insightfully incorrect ones indicate a need for more precise requirements
You want to minimise the unhelpful, remove the incorrect, and keep the correct and helpful.
The insightfully incorrect ones indicate a need for more precise requirements