Simon Willison
@simon.fedi.simonwillison.net.ap.brid.gy
Open source developer building tools to help journalists, archivists, librarians and others analyze, explore and publish their data. https://datasette.io […]
[bridged from https://fedi.simonwillison.net/@simon on the fediverse by https://fed.brid.gy/ ]
[bridged from https://fedi.simonwillison.net/@simon on the fediverse by https://fed.brid.gy/ ]
I ran their "Write a POV-Ray file that shows a pelican riding on a bicycle" prompt against the new gpt-5-codex-mini and got the worst result yet! https://simonwillison.net/2025/Nov/9/pelican-on-a-bike-raytracer-edition/
Pelican on a Bike—Raytracer Edition
beetle_b ran this prompt against a bunch of recent LLMs: Write a POV-Ray file that shows a pelican riding on a bicycle. This turns out to be a harder challenge …
simonwillison.net
November 9, 2025 at 5:17 PM
I ran their "Write a POV-Ray file that shows a pelican riding on a bicycle" prompt against the new gpt-5-codex-mini and got the worst result yet! https://simonwillison.net/2025/Nov/9/pelican-on-a-bike-raytracer-edition/
For comparison, here are the pelicans riding bicycles drawn by GPT-5-Codex-Mini (the new model), GPT-5-Codex and full GPT-5 - all produced via the same hacked version of the Codex CLI tool
November 9, 2025 at 3:48 AM
For comparison, here are the pelicans riding bicycles drawn by GPT-5-Codex-Mini (the new model), GPT-5-Codex and full GPT-5 - all produced via the same hacked version of the Codex CLI tool
I also recorded a 7 minute YouTube video showing how I got Codex to reverse-engineer and then extend itself in order to draw me that pelican https://www.youtube.com/watch?v=9o1_DL9uNlM
November 9, 2025 at 3:40 AM
I also recorded a 7 minute YouTube video showing how I got Codex to reverse-engineer and then extend itself in order to draw me that pelican https://www.youtube.com/watch?v=9o1_DL9uNlM
Since there's no API access yet I got OpenAI's Codex coding agent to rewrite itself (in Rust) to add a new "codex prompt ..." command which I could use to run prompts against the private models that are only available within that tool - full details here […]
Original post on fedi.simonwillison.net
fedi.simonwillison.net
November 9, 2025 at 3:39 AM
Since there's no API access yet I got OpenAI's Codex coding agent to rewrite itself (in Rust) to add a new "codex prompt ..." command which I could use to run prompts against the private models that are only available within that tool - full details here […]
uv makes testing different projects against upgraded dependencies so much easier - no need to think about virtual environments, uv handles them almost invisibly
I wrote more about my uv testing tricks in this TIL https://til.simonwillison.net/python/uv-tests
I wrote more about my uv testing tricks in this TIL https://til.simonwillison.net/python/uv-tests
Testing different Python versions with uv with-editable and uv-test
A quick uv recipe I figured out today, for running the tests for a project against multiple Python versions.
til.simonwillison.net
November 6, 2025 at 6:44 PM
uv makes testing different projects against upgraded dependencies so much easier - no need to think about virtual environments, uv handles them almost invisibly
I wrote more about my uv testing tricks in this TIL https://til.simonwillison.net/python/uv-tests
I wrote more about my uv testing tricks in this TIL https://til.simonwillison.net/python/uv-tests
And here's an example of one of my code research prompts
November 6, 2025 at 4:06 PM
And here's an example of one of my code research prompts
Here's my research repo - each of the 13 folders is a different research project, and the README is automatically updated by an LLM to include summaries describing each one https://github.com/simonw/research?tab=readme-ov-file#research-projects-carried-out-by-ai-tools
November 6, 2025 at 4:03 PM
Here's my research repo - each of the 13 folders is a different research project, and the README is automatically updated by an LLM to include summaries describing each one https://github.com/simonw/research?tab=readme-ov-file#research-projects-carried-out-by-ai-tools
It's a great TikTok! https://www.tiktok.com/@belligerentbarbies/video/7568380008633257271
November 5, 2025 at 3:24 PM
It's a great TikTok! https://www.tiktok.com/@belligerentbarbies/video/7568380008633257271
And in case you don't make it as far as the "miscellaneous tips" section, here's a bunch of lessons I learned about working with coding agents that I picked up along the way https://simonwillison.net/2025/Nov/4/datasette-10a20/#miscellaneous-tips-i-picked-up-along-the-way
November 4, 2025 at 9:47 PM
And in case you don't make it as far as the "miscellaneous tips" section, here's a bunch of lessons I learned about working with coding agents that I picked up along the way https://simonwillison.net/2025/Nov/4/datasette-10a20/#miscellaneous-tips-i-picked-up-along-the-way
Here's the September edition, if you want to get a preview and catch up with some news that's over a month old now https://gist.github.com/simonw/d6d4d86afc0d76767c63f23fc5137030
september-2025.md
GitHub Gist: instantly share code, notes, and snippets.
gist.github.com
November 1, 2025 at 10:17 PM
Here's the September edition, if you want to get a preview and catch up with some news that's over a month old now https://gist.github.com/simonw/d6d4d86afc0d76767c63f23fc5137030
And it's not just Cursor... rival agentic coding IDE Windsurf announced their own custom RL-trained fast coding model today as well!
Here are notes and a pelican on Windsurf's new SWE-1.5 model https://simonwillison.net/2025/Oct/29/swe-15/
Here are notes and a pelican on Windsurf's new SWE-1.5 model https://simonwillison.net/2025/Oct/29/swe-15/
Introducing SWE-1.5: Our Fast Agent Model
Here's the second fast coding model released by a coding agent IDE in the same day - the first was Composer-1 by Cursor. This time it's Windsurf releasing SWE-1.5: Today …
simonwillison.net
October 30, 2025 at 12:06 AM
And it's not just Cursor... rival agentic coding IDE Windsurf announced their own custom RL-trained fast coding model today as well!
Here are notes and a pelican on Windsurf's new SWE-1.5 model https://simonwillison.net/2025/Oct/29/swe-15/
Here are notes and a pelican on Windsurf's new SWE-1.5 model https://simonwillison.net/2025/Oct/29/swe-15/
So that's DeepSeek, Qwen, Moonshot (Kimi), Z.ai (GLM) and now MiniMax all turning out frontier open weight models in China
The ai-in-china tag on my blog is pretty busy this year! https://simonwillison.net/tags/ai-in-china/
The ai-in-china tag on my blog is pretty busy this year! https://simonwillison.net/tags/ai-in-china/
Simon Willison on ai-in-china
77 posts tagged ‘ai-in-china’. See also my tags for Qwen and DeepSeek.
simonwillison.net
October 29, 2025 at 11:04 PM
So that's DeepSeek, Qwen, Moonshot (Kimi), Z.ai (GLM) and now MiniMax all turning out frontier open weight models in China
The ai-in-china tag on my blog is pretty busy this year! https://simonwillison.net/tags/ai-in-china/
The ai-in-china tag on my blog is pretty busy this year! https://simonwillison.net/tags/ai-in-china/