Simon Willison
banner
simon.fedi.simonwillison.net.ap.brid.gy
Simon Willison
@simon.fedi.simonwillison.net.ap.brid.gy
Open source developer building tools to help journalists, archivists, librarians and others analyze, explore and publish their data. https://datasette.io […]

[bridged from https://fedi.simonwillison.net/@simon on the fediverse by https://fed.brid.gy/ ]
For comparison, here are the pelicans riding bicycles drawn by GPT-5-Codex-Mini (the new model), GPT-5-Codex and full GPT-5 - all produced via the same hacked version of the Codex CLI tool
November 9, 2025 at 3:48 AM
OpenAI partially released a new model yesterday called GPT-5-Codex-Mini

No API access yet, but I did some truly horrible things to their Codex CLI app to get it to spit out this SVG of a pelican riding a bicycle
November 9, 2025 at 3:38 AM
And here's an example of one of my code research prompts
November 6, 2025 at 4:06 PM
Here's my research repo - each of the 13 folders is a different research project, and the README is automatically updated by an LLM to include summaries describing each one https://github.com/simonw/research?tab=readme-ov-file#research-projects-carried-out-by-ai-tools
November 6, 2025 at 4:03 PM
And in case you don't make it as far as the "miscellaneous tips" section, here's a bunch of lessons I learned about working with coding agents that I picked up along the way https://simonwillison.net/2025/Nov/4/datasette-10a20/#miscellaneous-tips-i-picked-up-along-the-way
November 4, 2025 at 9:47 PM
Just sent out the October edition of my sponsors-only monthly newsletter - you can pay me $10/month to send you less!

Here's the table of contents
https://simonwillison.net/2025/Nov/1/sponsors-only-newsletter/
November 1, 2025 at 10:15 PM
October 23, 2025 at 4:40 AM
Asynchronous coding agents are the fastest and safest route to running coding agents in a sandbox without constant supervision
October 22, 2025 at 12:41 PM
Just for fun, I had Claude Code figure out how to run the ~2001-era Perl and C SLOCCount program in WebAssembly in the browser, complete with a UI for counting source code lines from pasted text, a GitHub repository or a zip file […]

[Original post on fedi.simonwillison.net]
October 22, 2025 at 6:23 AM
It's neat to see them encourage developers to add ARIA tags to pages though, an "agent" can be thought of as effectively another form of assistive technology
October 21, 2025 at 6:50 PM
Here's my vibe-coded tool for displaying the Responses JSON returned from a deep research API call in a more readable way: https://tools.simonwillison.net/deep-research-viewer#gist=3454a4ce40f8547a5c65c911de611ff4 - built by Claude Code in this session […]

[Original post on fedi.simonwillison.net]
October 18, 2025 at 7:32 PM
I misquoted the llama.cpp performance numbers in my original post, here's the updated section which now distinguishes between token read speed and token generation speed
October 15, 2025 at 12:46 AM
Where it really shines is in their new https://claude.ai/ Code Interpreter mode - I had it checkout my GitHub repo, install dependencies, run tests and experiment with a complex new feature, all prompted from the web browser on my iPhone […]

[Original post on fedi.simonwillison.net]
September 29, 2025 at 6:17 PM
New on Niche Musems: my write-up
of a visit to the Musical Museum in Brentford, London... player pianos, self-playing violins, and orchestrions! https://www.niche-museums.com/115
September 21, 2025 at 4:00 PM
The official White House "rapid response" account on Twitter has now denied that this affects current visa holders https://twitter.com/rapidresponse47/status/1969476188008575149
September 20, 2025 at 7:58 PM
Leaked memo from Amazon that warns existing H1B holders to avoid travel back into the USA after the September 21st deadline - their lawyers evaluated the new executive order as not just affecting new applications https://www.businessinsider.com/read-memos-sent-big-tech-trump-h-1b-changes-2025-9
September 20, 2025 at 3:58 PM
The worst offenders for constantly redefining agents with new, vague and inconsistent definitions are OpenAI themselves https://simonwillison.net/2025/Sep/18/agents/#openai-need-to-get-their-story-straight
September 18, 2025 at 7:32 PM
Includes this note about why agents as human replacements is my least favorite definition - because unlike AI agents, humans have agency!
September 18, 2025 at 7:25 PM
And an update, since it turns out Anthropic announced a new memory feature yesterday that's more similar to how OpenAI's works https://www.anthropic.com/news/memory
September 12, 2025 at 8:23 AM
Updated that post to add some notes on an important aspect I'd missed:
September 11, 2025 at 7:27 AM
As a bonus I had GPT-5 figure out how to render the resulting chart entirely in the browser using Pyodide to run Python and matplotlib using WebAssembly - here's the result https://tools.simonwillison.net/ai-adoption
September 9, 2025 at 7:00 AM
I got Codex CLI and GPT-5 to help me modify the Transformers.js Llama 3.2 chat demo to enable loading that 1.2GB model from a local folder instead of fetching it from a URL

Full details including the Codex transcript and prompts I used here: https://simonwillison.net/2025/Sep/8/webgpu-local-folder/
September 8, 2025 at 9:04 PM
Some notes on gpt-realtime - a slightly confusing release since it appears gpt-realtime replaces gpt-4o-realtime-preview but is still accompanied by the much cheaper gpt-4o-mini-realtime-preview https://simonwillison.net/2025/Sep/1/introducing-gpt-realtime/
September 1, 2025 at 5:38 PM
I left it running overnight for the full 50 inference steps - my 64GB M2 MacBook Pro took 2 hours 59 minutes to generate this image
August 20, 2025 at 3:35 PM
And the results from that eval, which runs 30 questions from the 2025 American Invitational Mathematics Examination 8 times each (240 prompts total) https://static.simonwillison.net/static/2025/gpt-oss-20b-aime25/gpt-oss-20b-low_temp1.0_20250816_094011.html
August 17, 2025 at 3:55 AM