archtoad.bsky.social
@archtoad.bsky.social
Brandon Bird: "King of the Cage"
brandonbird.com
November 19, 2025 at 1:31 PM
I connected my laptop to my piano and typed into the terminal “connect to my piano and play a few notes with midi” and it worked first try. This is some Star Trek shit. If you told me 5 years ago this would be possible today I would not have believed you.
November 6, 2025 at 9:22 PM
“I don’t want to hear from Mitchell because I don’t think I would enjoy her content” - sure whatever (you’re misrepresenting her work but that’s your choice). “I don’t want to hear from Mitchell because she doesn’t know how NNs work” makes you sound like an uninformed asshole.
November 4, 2025 at 2:13 PM
The paper as a whole holds up! It’s about the risks/limitations of scaling language models - all very relevant today! How many NLP papers from 2020-2021 can you say that about?
November 4, 2025 at 1:39 PM
So to recap, you don’t want to ever hear from Mitchell because of one sentence in a paper that summarizes her co-authors position re: a linguistic theory about form vs meaning, which disqualifies her from ever knowing how these things work “in a relevant sense” ?
November 4, 2025 at 1:37 PM
The premise of the paper is “there are risks/downsides to larger models.” Nowhere in the paper does it claim anything like “language models can’t generalize to unseen prompts.” You’re just straw manning some thesis onto the paper based on the phrase “Stochastic Parrots.”
November 4, 2025 at 1:14 PM
I don’t think this is bad faith. Margaret Mitchell has a long CV with plenty of papers that go beyond the scope of the Stochastic Parrots paper that clearly demonstrate she knows how NNs work?
November 4, 2025 at 12:56 PM
I just put in my global AGENTS.md that every python project uses uv and briefly explain how to use “uv run” - haven’t had to remind it since
AGENTS.md
AGENTS.md is a simple, open format for guiding coding agents. Think of it as a README for agents.
AGENTS.md
November 2, 2025 at 10:29 PM
“Traditional NLP models like BERT…”
October 31, 2025 at 9:47 AM
My takeaway is deberta baseline is the winner here? Way easier to train/deploy. Also what if you scaled the encoder-classifier up to a comparable size?
October 30, 2025 at 12:04 PM
Right but we have users who are like “I can’t find the [microsoft] copilot button” - getting them to install/figure out Claude code is just not practical.
October 25, 2025 at 11:57 AM
Good stuff. Does this thinking extend to more general things like Microsoft copilot and ChatGPT? Or are you saying normies should start using coding agents
October 25, 2025 at 1:58 AM
There was an interesting paper earlier this year about a “recurrent depth” technique that allowed the model to reuse layers … this what you mean? arxiv.org/abs/2502.05171
Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach
We study a novel language model architecture that is capable of scaling test-time computation by implicitly reasoning in latent space. Our model works by iterating a recurrent block, thereby unrolling...
arxiv.org
October 15, 2025 at 8:35 AM
Yeah plenty of examples of code golf / people trying to put like 5 lines of code in a single line to show that They Can and it just makes unreadable garbage
October 14, 2025 at 4:38 PM
Not sure what you mean by “traditional UX” but I’d agree that having creative UX people who can think outside the box is more important than ever
October 7, 2025 at 4:55 PM
I’ve had many meetings where people are arguing over how the prototype should be built and by the end of the meeting I’m like “here it is”
September 21, 2025 at 4:08 PM
I just heard about rapids.ai which is a concrete effort to do all the data science, etc. things on GPUs
RAPIDS | GPU Accelerated Data Science
Open source GPU accelerated data science libraries
rapids.ai
August 25, 2025 at 9:24 PM
Was thinking about this re: “wow I should really get better and writing clear and consistent documentation for my repos so my agents know how to use it”
July 15, 2025 at 4:48 PM
Check out lucumr.pocoo.org/2025/7/3/too... from @mitsuhiko.at if you haven’t… basically saying that CLIs >>> MCP (e.g., gh vs GitHub MCP)
Tools: Code Is All You Need
The solution to agentic flows was code all along.
lucumr.pocoo.org
July 15, 2025 at 6:51 AM
I love your concept about building bespoke dev tools (like ways to search logs) for the agents - would love to hear about more of these and how you approach building them!
July 10, 2025 at 11:49 PM
moondream.ai is another
Moondream
Moondream AI - Vision language model for everyone
moondream.ai
July 9, 2025 at 4:49 PM