Simon Willison
@simonwillison.net
Independent AI researcher, creator of datasette.io and llm.datasette.io, building open source tools for data journalism, writing about a lot of stuff at https://simonwillison.net/
Fun new variant on my pelican riding a bicycle SVG benchmark by BeetleB, who instead used "Write a POV-Ray file that shows a pelican riding on a bicycle" to get ray-traced pelicans: blog.nawaz.org/posts/2025/O...
November 9, 2025 at 5:22 PM
Fun new variant on my pelican riding a bicycle SVG benchmark by BeetleB, who instead used "Write a POV-Ray file that shows a pelican riding on a bicycle" to get ray-traced pelicans: blog.nawaz.org/posts/2025/O...
For comparison, here are the pelicans riding bicycles drawn by GPT-5-Codex-Mini (the new model), GPT-5-Codex and full GPT-5 - all produced via the same hacked version of the Codex CLI tool
November 9, 2025 at 3:47 AM
For comparison, here are the pelicans riding bicycles drawn by GPT-5-Codex-Mini (the new model), GPT-5-Codex and full GPT-5 - all produced via the same hacked version of the Codex CLI tool
OpenAI partially released a new model yesterday called GPT-5-Codex-Mini
No API access yet, but I did some truly horrible things to their Codex CLI app to get it to spit out this SVG of a pelican riding a bicycle
No API access yet, but I did some truly horrible things to their Codex CLI app to get it to spit out this SVG of a pelican riding a bicycle
November 9, 2025 at 3:37 AM
OpenAI partially released a new model yesterday called GPT-5-Codex-Mini
No API access yet, but I did some truly horrible things to their Codex CLI app to get it to spit out this SVG of a pelican riding a bicycle
No API access yet, but I did some truly horrible things to their Codex CLI app to get it to spit out this SVG of a pelican riding a bicycle
And here's an example of one of my code research prompts
November 6, 2025 at 4:02 PM
And here's an example of one of my code research prompts
Here's my research repo - each of the 13 folders is a different research project, and the README is automatically updated by an LLM to include summaries describing each one github.com/simonw/resea...
November 6, 2025 at 3:59 PM
Here's my research repo - each of the 13 folders is a different research project, and the README is automatically updated by an LLM to include summaries describing each one github.com/simonw/resea...
And in case you don't make it as far as the "miscellaneous tips" section, here's a bunch of lessons I learned about working with coding agents that I picked up along the way simonwillison.net/2025/Nov/4/d...
November 4, 2025 at 11:37 PM
And in case you don't make it as far as the "miscellaneous tips" section, here's a bunch of lessons I learned about working with coding agents that I picked up along the way simonwillison.net/2025/Nov/4/d...
Just sent out the October edition of my sponsors-only monthly newsletter - you can pay me $10/month to send you less!
Here's the table of contents
simonwillison.net/2025/Nov/1/s...
Here's the table of contents
simonwillison.net/2025/Nov/1/s...
November 1, 2025 at 10:15 PM
Just sent out the October edition of my sponsors-only monthly newsletter - you can pay me $10/month to send you less!
Here's the table of contents
simonwillison.net/2025/Nov/1/s...
Here's the table of contents
simonwillison.net/2025/Nov/1/s...
When I asked it to figure out it how to run cmarkgfm in Pyodide it noticed that the repo already had notes on Pyodide and reviewed them before starting work on the new project gistpreview.github.io?6d778a8f9c4c...
October 23, 2025 at 9:54 PM
When I asked it to figure out it how to run cmarkgfm in Pyodide it noticed that the repo already had notes on Pyodide and reviewed them before starting work on the new project gistpreview.github.io?6d778a8f9c4c...
Prompt -> Result tools.simonwillison.net/terminal-to-...
October 23, 2025 at 4:40 AM
Prompt -> Result tools.simonwillison.net/terminal-to-...
Asynchronous coding agents are the fastest and safest route to running coding agents in a sandbox without constant supervision
October 22, 2025 at 12:41 PM
Asynchronous coding agents are the fastest and safest route to running coding agents in a sandbox without constant supervision
Just for fun, I had Claude Code figure out how to run the ~2001-era Perl and C SLOCCount program in WebAssembly in the browser, complete with a UI for counting source code lines from pasted text, a GitHub repository or a zip file
simonwillison.net/2025/Oct/22/...
Here's an animated demo
simonwillison.net/2025/Oct/22/...
Here's an animated demo
October 22, 2025 at 6:24 AM
Just for fun, I had Claude Code figure out how to run the ~2001-era Perl and C SLOCCount program in WebAssembly in the browser, complete with a UI for counting source code lines from pasted text, a GitHub repository or a zip file
simonwillison.net/2025/Oct/22/...
Here's an animated demo
simonwillison.net/2025/Oct/22/...
Here's an animated demo
It's neat to see them encourage developers to add ARIA tags to pages though, an "agent" can be thought of as effectively another form of assistive technology
October 21, 2025 at 6:49 PM
It's neat to see them encourage developers to add ARIA tags to pages though, an "agent" can be thought of as effectively another form of assistive technology
Here's my vibe-coded tool for displaying the Responses JSON returned from a deep research API call in a more readable way: tools.simonwillison.net/deep-researc... - built by Claude Code in this session: gist.github.com/simonw/d2251... - because life's too short to waste it digging through raw JSON
October 18, 2025 at 7:31 PM
Here's my vibe-coded tool for displaying the Responses JSON returned from a deep research API call in a more readable way: tools.simonwillison.net/deep-researc... - built by Claude Code in this session: gist.github.com/simonw/d2251... - because life's too short to waste it digging through raw JSON
I misquoted the llama.cpp performance numbers in my original post, here's the updated section which now distinguishes between token read speed and token generation speed
October 15, 2025 at 12:45 AM
I misquoted the llama.cpp performance numbers in my original post, here's the updated section which now distinguishes between token read speed and token generation speed
Claude Code tip: prompt "use sub-agents" and it will do exactly that, firing up multiple parallel sub-agents to achieve a task, each with a fresh context
"Write markdown documentation for the context passed to each of the templates in this project. Use sub-agents." simonwillison.net/2025/Oct/11/...
"Write markdown documentation for the context passed to each of the templates in this project. Use sub-agents." simonwillison.net/2025/Oct/11/...
October 11, 2025 at 9:32 PM
Claude Code tip: prompt "use sub-agents" and it will do exactly that, firing up multiple parallel sub-agents to achieve a task, each with a fresh context
"Write markdown documentation for the context passed to each of the templates in this project. Use sub-agents." simonwillison.net/2025/Oct/11/...
"Write markdown documentation for the context passed to each of the templates in this project. Use sub-agents." simonwillison.net/2025/Oct/11/...
I really enjoy being able to spin up custom visualization tools as part of writing up a short blog post like this, here's my new Graphviz URL-based DOT renderer, built as a Claude artifact in a couple of minutes tools.simonwillison.net/dot#digraph%... - transcript: claude.ai/share/2b78a9...
October 10, 2025 at 11:40 PM
I really enjoy being able to spin up custom visualization tools as part of writing up a short blog post like this, here's my new Graphviz URL-based DOT renderer, built as a Claude artifact in a couple of minutes tools.simonwillison.net/dot#digraph%... - transcript: claude.ai/share/2b78a9...
OpenAI released a new image gen model today - gpt-image-1-mini - which they describe as "80% less expensive than the large model"
I vibe-coded a Python CLI tool to try it out. Here's "racoon eating cheese wearing a top hat, realistic photo" which cost ~0.2 cents simonwillison.net/2025/Oct/6/g...
I vibe-coded a Python CLI tool to try it out. Here's "racoon eating cheese wearing a top hat, realistic photo" which cost ~0.2 cents simonwillison.net/2025/Oct/6/g...
October 6, 2025 at 10:57 PM
OpenAI released a new image gen model today - gpt-image-1-mini - which they describe as "80% less expensive than the large model"
I vibe-coded a Python CLI tool to try it out. Here's "racoon eating cheese wearing a top hat, realistic photo" which cost ~0.2 cents simonwillison.net/2025/Oct/6/g...
I vibe-coded a Python CLI tool to try it out. Here's "racoon eating cheese wearing a top hat, realistic photo" which cost ~0.2 cents simonwillison.net/2025/Oct/6/g...
Joshua Rogers, using AI tooling responsibly and professionally, reported 22+ genuine issues in curl that are now being addressed
Especially notable because curl had problems with floods of garbage slop AI "security issues" in the past that were nothing of the sort simonwillison.net/2025/Oct/2/c...
Especially notable because curl had problems with floods of garbage slop AI "security issues" in the past that were nothing of the sort simonwillison.net/2025/Oct/2/c...
October 2, 2025 at 3:16 PM
Joshua Rogers, using AI tooling responsibly and professionally, reported 22+ genuine issues in curl that are now being addressed
Especially notable because curl had problems with floods of garbage slop AI "security issues" in the past that were nothing of the sort simonwillison.net/2025/Oct/2/c...
Especially notable because curl had problems with floods of garbage slop AI "security issues" in the past that were nothing of the sort simonwillison.net/2025/Oct/2/c...
Where it really shines is in their new claude.ai Code Interpreter mode - I had it checkout my GitHub repo, install dependencies, run tests and experiment with a complex new feature, all prompted from the web browser on my iPhone simonwillison.net/2025/Sep/29/...
September 29, 2025 at 6:17 PM
Where it really shines is in their new claude.ai Code Interpreter mode - I had it checkout my GitHub repo, install dependencies, run tests and experiment with a complex new feature, all prompted from the web browser on my iPhone simonwillison.net/2025/Sep/29/...
I shared a few more notes here, including this hopefully reassuring note for anyone afraid of the impact this will have on their career as a programmer simonwillison.net/2025/Sep/29/...
September 29, 2025 at 4:29 PM
I shared a few more notes here, including this hopefully reassuring note for anyone afraid of the impact this will have on their career as a programmer simonwillison.net/2025/Sep/29/...
New on Niche Musems: my write-up
of a visit to the Musical Museum in Brentford, London... player pianos, self-playing violins, and orchestrions! www.niche-museums.com/115
of a visit to the Musical Museum in Brentford, London... player pianos, self-playing violins, and orchestrions! www.niche-museums.com/115
September 21, 2025 at 3:53 PM
New on Niche Musems: my write-up
of a visit to the Musical Museum in Brentford, London... player pianos, self-playing violins, and orchestrions! www.niche-museums.com/115
of a visit to the Musical Museum in Brentford, London... player pianos, self-playing violins, and orchestrions! www.niche-museums.com/115
Leaked memo from Amazon that warns existing H1B holders to avoid travel back into the USA after the September 21st deadline - their lawyers evaluated the new executive order as not just affecting new applications
September 20, 2025 at 4:14 PM
Leaked memo from Amazon that warns existing H1B holders to avoid travel back into the USA after the September 21st deadline - their lawyers evaluated the new executive order as not just affecting new applications
The worst offenders for constantly redefining agents with new, vague and inconsistent definitions are OpenAI themselves simonwillison.net/2025/Sep/18/...
September 18, 2025 at 7:32 PM
The worst offenders for constantly redefining agents with new, vague and inconsistent definitions are OpenAI themselves simonwillison.net/2025/Sep/18/...
Includes this note about why agents as human replacements is my least favorite definition - because unlike AI agents, humans have agency!
September 18, 2025 at 7:24 PM
Includes this note about why agents as human replacements is my least favorite definition - because unlike AI agents, humans have agency!
It's over! Nice to see a world championship where the USA and China didn't dominate the rankings
September 13, 2025 at 3:02 PM
It's over! Nice to see a world championship where the USA and China didn't dominate the rankings