I wonder what else was always possible.
I wonder what else was always possible.
Not noticeably better at building than 3.6.
Definitely more confused, gets formatting/syntax wrong a lot, similar problem for every reasoning model I've used in minecraft. I suspect they are overfitting on benchmarks, and struggle to generalize to agentic tasks.
Not noticeably better at building than 3.6.
Definitely more confused, gets formatting/syntax wrong a lot, similar problem for every reasoning model I've used in minecraft. I suspect they are overfitting on benchmarks, and struggle to generalize to agentic tasks.
In my new video, I demonstrate that all LLMs are willing to build cathedrals and utopias, and bombs and torture chambers. Here is o3-mini-high, destroying a village it had just enhanced🧵
In my new video, I demonstrate that all LLMs are willing to build cathedrals and utopias, and bombs and torture chambers. Here is o3-mini-high, destroying a village it had just enhanced🧵
Evolving an image again, this time with circles. Can you tell what the target image is?
#genuary #genuary24
Evolving an image again, this time with circles. Can you tell what the target image is?
#genuary #genuary24
Built by o1! The real deal, not preview, not mini. Finally got access.
Impressive stuff. Full vid on patreon. More to come.
Built by o1! The real deal, not preview, not mini. Finally got access.
Impressive stuff. Full vid on patreon. More to come.
These are the gradients for the surface below. I HAD to use o1 to compute partial derivatives, the original function was a doozy. #genuary #genuary22
These are the gradients for the surface below. I HAD to use o1 to compute partial derivatives, the original function was a doozy. #genuary #genuary22
No AI help! I had to break out a notebook for some middle school algebra.
Years ago I failed a programming interview question for intersecting lines. This is my revenge. #genuary21 #genuary
No AI help! I had to break out a notebook for some middle school algebra.
Years ago I failed a programming interview question for intersecting lines. This is my revenge. #genuary21 #genuary
o1-preview asked to build a roman villa. Kinda impressive, but plain. I suspect reasoning models have limited creativity, because they are trained for tasks with objective answers. #genuary
o1-preview asked to build a roman villa. Kinda impressive, but plain. I suspect reasoning models have limited creativity, because they are trained for tasks with objective answers. #genuary
A little website to draw from a palette of nested shapes, here used to make a rug.
evolvecode.io/apps/rug_pal...
I'm bit late but this is 2 birds with one stone
#genuary #genuary15 #genuary16
A little website to draw from a palette of nested shapes, here used to make a rug.
evolvecode.io/apps/rug_pal...
I'm bit late but this is 2 birds with one stone
#genuary #genuary15 #genuary16
evolvecode.io/chess_zoo.html
(for #genuary14 Pure black and white. No gray.)
evolvecode.io/chess_zoo.html
(for #genuary14 Pure black and white. No gray.)
This is an evolutionary algorithm that recreates an image (mr darwin) by randomly drawing triangles in a population of images and selecting the one closest to the target, then repeat.
#genuary #genuary13
This is an evolutionary algorithm that recreates an image (mr darwin) by randomly drawing triangles in a population of images and selecting the one closest to the target, then repeat.
#genuary #genuary13