Simon Willison
@simon.fedi.simonwillison.net.ap.brid.gy
Open source developer building tools to help journalists, archivists, librarians and others analyze, explore and publish their data. https://datasette.io […]
[bridged from https://fedi.simonwillison.net/@simon on the fediverse by https://fed.brid.gy/ ]
[bridged from https://fedi.simonwillison.net/@simon on the fediverse by https://fed.brid.gy/ ]
For comparison, here are the pelicans riding bicycles drawn by GPT-5-Codex-Mini (the new model), GPT-5-Codex and full GPT-5 - all produced via the same hacked version of the Codex CLI tool
November 9, 2025 at 3:48 AM
For comparison, here are the pelicans riding bicycles drawn by GPT-5-Codex-Mini (the new model), GPT-5-Codex and full GPT-5 - all produced via the same hacked version of the Codex CLI tool
OpenAI partially released a new model yesterday called GPT-5-Codex-Mini
No API access yet, but I did some truly horrible things to their Codex CLI app to get it to spit out this SVG of a pelican riding a bicycle
No API access yet, but I did some truly horrible things to their Codex CLI app to get it to spit out this SVG of a pelican riding a bicycle
November 9, 2025 at 3:38 AM
OpenAI partially released a new model yesterday called GPT-5-Codex-Mini
No API access yet, but I did some truly horrible things to their Codex CLI app to get it to spit out this SVG of a pelican riding a bicycle
No API access yet, but I did some truly horrible things to their Codex CLI app to get it to spit out this SVG of a pelican riding a bicycle
And here's an example of one of my code research prompts
November 6, 2025 at 4:06 PM
And here's an example of one of my code research prompts
Here's my research repo - each of the 13 folders is a different research project, and the README is automatically updated by an LLM to include summaries describing each one https://github.com/simonw/research?tab=readme-ov-file#research-projects-carried-out-by-ai-tools
November 6, 2025 at 4:03 PM
Here's my research repo - each of the 13 folders is a different research project, and the README is automatically updated by an LLM to include summaries describing each one https://github.com/simonw/research?tab=readme-ov-file#research-projects-carried-out-by-ai-tools
And in case you don't make it as far as the "miscellaneous tips" section, here's a bunch of lessons I learned about working with coding agents that I picked up along the way https://simonwillison.net/2025/Nov/4/datasette-10a20/#miscellaneous-tips-i-picked-up-along-the-way
November 4, 2025 at 9:47 PM
And in case you don't make it as far as the "miscellaneous tips" section, here's a bunch of lessons I learned about working with coding agents that I picked up along the way https://simonwillison.net/2025/Nov/4/datasette-10a20/#miscellaneous-tips-i-picked-up-along-the-way
Just sent out the October edition of my sponsors-only monthly newsletter - you can pay me $10/month to send you less!
Here's the table of contents
https://simonwillison.net/2025/Nov/1/sponsors-only-newsletter/
Here's the table of contents
https://simonwillison.net/2025/Nov/1/sponsors-only-newsletter/
November 1, 2025 at 10:15 PM
Just sent out the October edition of my sponsors-only monthly newsletter - you can pay me $10/month to send you less!
Here's the table of contents
https://simonwillison.net/2025/Nov/1/sponsors-only-newsletter/
Here's the table of contents
https://simonwillison.net/2025/Nov/1/sponsors-only-newsletter/
Prompt -> Result https://tools.simonwillison.net/terminal-to-html
October 23, 2025 at 4:40 AM
Prompt -> Result https://tools.simonwillison.net/terminal-to-html
Asynchronous coding agents are the fastest and safest route to running coding agents in a sandbox without constant supervision
October 22, 2025 at 12:41 PM
Asynchronous coding agents are the fastest and safest route to running coding agents in a sandbox without constant supervision
Just for fun, I had Claude Code figure out how to run the ~2001-era Perl and C SLOCCount program in WebAssembly in the browser, complete with a UI for counting source code lines from pasted text, a GitHub repository or a zip file […]
[Original post on fedi.simonwillison.net]
[Original post on fedi.simonwillison.net]
October 22, 2025 at 6:23 AM
Just for fun, I had Claude Code figure out how to run the ~2001-era Perl and C SLOCCount program in WebAssembly in the browser, complete with a UI for counting source code lines from pasted text, a GitHub repository or a zip file […]
[Original post on fedi.simonwillison.net]
[Original post on fedi.simonwillison.net]
It's neat to see them encourage developers to add ARIA tags to pages though, an "agent" can be thought of as effectively another form of assistive technology
October 21, 2025 at 6:50 PM
It's neat to see them encourage developers to add ARIA tags to pages though, an "agent" can be thought of as effectively another form of assistive technology
Here's my vibe-coded tool for displaying the Responses JSON returned from a deep research API call in a more readable way: https://tools.simonwillison.net/deep-research-viewer#gist=3454a4ce40f8547a5c65c911de611ff4 - built by Claude Code in this session […]
[Original post on fedi.simonwillison.net]
[Original post on fedi.simonwillison.net]
October 18, 2025 at 7:32 PM
Here's my vibe-coded tool for displaying the Responses JSON returned from a deep research API call in a more readable way: https://tools.simonwillison.net/deep-research-viewer#gist=3454a4ce40f8547a5c65c911de611ff4 - built by Claude Code in this session […]
[Original post on fedi.simonwillison.net]
[Original post on fedi.simonwillison.net]
I misquoted the llama.cpp performance numbers in my original post, here's the updated section which now distinguishes between token read speed and token generation speed
October 15, 2025 at 12:46 AM
I misquoted the llama.cpp performance numbers in my original post, here's the updated section which now distinguishes between token read speed and token generation speed
Where it really shines is in their new https://claude.ai/ Code Interpreter mode - I had it checkout my GitHub repo, install dependencies, run tests and experiment with a complex new feature, all prompted from the web browser on my iPhone […]
[Original post on fedi.simonwillison.net]
[Original post on fedi.simonwillison.net]
September 29, 2025 at 6:17 PM
Where it really shines is in their new https://claude.ai/ Code Interpreter mode - I had it checkout my GitHub repo, install dependencies, run tests and experiment with a complex new feature, all prompted from the web browser on my iPhone […]
[Original post on fedi.simonwillison.net]
[Original post on fedi.simonwillison.net]
New on Niche Musems: my write-up
of a visit to the Musical Museum in Brentford, London... player pianos, self-playing violins, and orchestrions! https://www.niche-museums.com/115
of a visit to the Musical Museum in Brentford, London... player pianos, self-playing violins, and orchestrions! https://www.niche-museums.com/115
September 21, 2025 at 4:00 PM
New on Niche Musems: my write-up
of a visit to the Musical Museum in Brentford, London... player pianos, self-playing violins, and orchestrions! https://www.niche-museums.com/115
of a visit to the Musical Museum in Brentford, London... player pianos, self-playing violins, and orchestrions! https://www.niche-museums.com/115
The official White House "rapid response" account on Twitter has now denied that this affects current visa holders https://twitter.com/rapidresponse47/status/1969476188008575149
September 20, 2025 at 7:58 PM
The official White House "rapid response" account on Twitter has now denied that this affects current visa holders https://twitter.com/rapidresponse47/status/1969476188008575149
Leaked memo from Amazon that warns existing H1B holders to avoid travel back into the USA after the September 21st deadline - their lawyers evaluated the new executive order as not just affecting new applications https://www.businessinsider.com/read-memos-sent-big-tech-trump-h-1b-changes-2025-9
September 20, 2025 at 3:58 PM
Leaked memo from Amazon that warns existing H1B holders to avoid travel back into the USA after the September 21st deadline - their lawyers evaluated the new executive order as not just affecting new applications https://www.businessinsider.com/read-memos-sent-big-tech-trump-h-1b-changes-2025-9
The worst offenders for constantly redefining agents with new, vague and inconsistent definitions are OpenAI themselves https://simonwillison.net/2025/Sep/18/agents/#openai-need-to-get-their-story-straight
September 18, 2025 at 7:32 PM
The worst offenders for constantly redefining agents with new, vague and inconsistent definitions are OpenAI themselves https://simonwillison.net/2025/Sep/18/agents/#openai-need-to-get-their-story-straight
Includes this note about why agents as human replacements is my least favorite definition - because unlike AI agents, humans have agency!
September 18, 2025 at 7:25 PM
Includes this note about why agents as human replacements is my least favorite definition - because unlike AI agents, humans have agency!
And an update, since it turns out Anthropic announced a new memory feature yesterday that's more similar to how OpenAI's works https://www.anthropic.com/news/memory
September 12, 2025 at 8:23 AM
And an update, since it turns out Anthropic announced a new memory feature yesterday that's more similar to how OpenAI's works https://www.anthropic.com/news/memory
Updated that post to add some notes on an important aspect I'd missed:
September 11, 2025 at 7:27 AM
Updated that post to add some notes on an important aspect I'd missed:
As a bonus I had GPT-5 figure out how to render the resulting chart entirely in the browser using Pyodide to run Python and matplotlib using WebAssembly - here's the result https://tools.simonwillison.net/ai-adoption
September 9, 2025 at 7:00 AM
As a bonus I had GPT-5 figure out how to render the resulting chart entirely in the browser using Pyodide to run Python and matplotlib using WebAssembly - here's the result https://tools.simonwillison.net/ai-adoption
I got Codex CLI and GPT-5 to help me modify the Transformers.js Llama 3.2 chat demo to enable loading that 1.2GB model from a local folder instead of fetching it from a URL
Full details including the Codex transcript and prompts I used here: https://simonwillison.net/2025/Sep/8/webgpu-local-folder/
Full details including the Codex transcript and prompts I used here: https://simonwillison.net/2025/Sep/8/webgpu-local-folder/
September 8, 2025 at 9:04 PM
I got Codex CLI and GPT-5 to help me modify the Transformers.js Llama 3.2 chat demo to enable loading that 1.2GB model from a local folder instead of fetching it from a URL
Full details including the Codex transcript and prompts I used here: https://simonwillison.net/2025/Sep/8/webgpu-local-folder/
Full details including the Codex transcript and prompts I used here: https://simonwillison.net/2025/Sep/8/webgpu-local-folder/
Some notes on gpt-realtime - a slightly confusing release since it appears gpt-realtime replaces gpt-4o-realtime-preview but is still accompanied by the much cheaper gpt-4o-mini-realtime-preview https://simonwillison.net/2025/Sep/1/introducing-gpt-realtime/
September 1, 2025 at 5:38 PM
Some notes on gpt-realtime - a slightly confusing release since it appears gpt-realtime replaces gpt-4o-realtime-preview but is still accompanied by the much cheaper gpt-4o-mini-realtime-preview https://simonwillison.net/2025/Sep/1/introducing-gpt-realtime/
I left it running overnight for the full 50 inference steps - my 64GB M2 MacBook Pro took 2 hours 59 minutes to generate this image
August 20, 2025 at 3:35 PM
I left it running overnight for the full 50 inference steps - my 64GB M2 MacBook Pro took 2 hours 59 minutes to generate this image
And the results from that eval, which runs 30 questions from the 2025 American Invitational Mathematics Examination 8 times each (240 prompts total) https://static.simonwillison.net/static/2025/gpt-oss-20b-aime25/gpt-oss-20b-low_temp1.0_20250816_094011.html
August 17, 2025 at 3:55 AM
And the results from that eval, which runs 30 questions from the 2025 American Invitational Mathematics Examination 8 times each (240 prompts total) https://static.simonwillison.net/static/2025/gpt-oss-20b-aime25/gpt-oss-20b-low_temp1.0_20250816_094011.html