Lightnews — Scholar-powered news

archtoad.bsky.social

@archtoad.bsky.social

@brandonbird.bsky.social already thought this one through brandonbird.com/kingofcage.h...

Brandon Bird: "King of the Cage"

brandonbird.com

November 19, 2025 at 1:31 PM

archtoad.bsky.social

@archtoad.bsky.social

I connected my laptop to my piano and typed into the terminal “connect to my piano and play a few notes with midi” and it worked first try. This is some Star Trek shit. If you told me 5 years ago this would be possible today I would not have believed you.

November 6, 2025 at 9:22 PM

archtoad.bsky.social

@archtoad.bsky.social

“I don’t want to hear from Mitchell because I don’t think I would enjoy her content” - sure whatever (you’re misrepresenting her work but that’s your choice). “I don’t want to hear from Mitchell because she doesn’t know how NNs work” makes you sound like an uninformed asshole.

November 4, 2025 at 2:13 PM

archtoad.bsky.social

@archtoad.bsky.social

The paper as a whole holds up! It’s about the risks/limitations of scaling language models - all very relevant today! How many NLP papers from 2020-2021 can you say that about?

November 4, 2025 at 1:39 PM

archtoad.bsky.social

@archtoad.bsky.social

So to recap, you don’t want to ever hear from Mitchell because of one sentence in a paper that summarizes her co-authors position re: a linguistic theory about form vs meaning, which disqualifies her from ever knowing how these things work “in a relevant sense” ?

November 4, 2025 at 1:37 PM

archtoad.bsky.social

@archtoad.bsky.social

The premise of the paper is “there are risks/downsides to larger models.” Nowhere in the paper does it claim anything like “language models can’t generalize to unseen prompts.” You’re just straw manning some thesis onto the paper based on the phrase “Stochastic Parrots.”

November 4, 2025 at 1:14 PM

archtoad.bsky.social

@archtoad.bsky.social

I don’t think this is bad faith. Margaret Mitchell has a long CV with plenty of papers that go beyond the scope of the Stochastic Parrots paper that clearly demonstrate she knows how NNs work?

November 4, 2025 at 12:56 PM

archtoad.bsky.social

@archtoad.bsky.social

I just put in my global AGENTS.md that every python project uses uv and briefly explain how to use “uv run” - haven’t had to remind it since

AGENTS.md

AGENTS.md is a simple, open format for guiding coding agents. Think of it as a README for agents.

AGENTS.md

November 2, 2025 at 10:29 PM

archtoad.bsky.social

@archtoad.bsky.social

“Traditional NLP models like BERT…”

October 31, 2025 at 9:47 AM

archtoad.bsky.social

@archtoad.bsky.social

My takeaway is deberta baseline is the winner here? Way easier to train/deploy. Also what if you scaled the encoder-classifier up to a comparable size?

October 30, 2025 at 12:04 PM

archtoad.bsky.social

@archtoad.bsky.social

Right but we have users who are like “I can’t find the [microsoft] copilot button” - getting them to install/figure out Claude code is just not practical.

October 25, 2025 at 11:57 AM

archtoad.bsky.social

@archtoad.bsky.social

Good stuff. Does this thinking extend to more general things like Microsoft copilot and ChatGPT? Or are you saying normies should start using coding agents

October 25, 2025 at 1:58 AM

archtoad.bsky.social

@archtoad.bsky.social

There was an interesting paper earlier this year about a “recurrent depth” technique that allowed the model to reuse layers … this what you mean? arxiv.org/abs/2502.05171

Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

We study a novel language model architecture that is capable of scaling test-time computation by implicitly reasoning in latent space. Our model works by iterating a recurrent block, thereby unrolling...

arxiv.org

October 15, 2025 at 8:35 AM

archtoad.bsky.social

@archtoad.bsky.social

Yeah plenty of examples of code golf / people trying to put like 5 lines of code in a single line to show that They Can and it just makes unreadable garbage

October 14, 2025 at 4:38 PM

archtoad.bsky.social

@archtoad.bsky.social

Not sure what you mean by “traditional UX” but I’d agree that having creative UX people who can think outside the box is more important than ever

October 7, 2025 at 4:55 PM

archtoad.bsky.social

@archtoad.bsky.social

I’ve had many meetings where people are arguing over how the prototype should be built and by the end of the meeting I’m like “here it is”

September 21, 2025 at 4:08 PM

archtoad.bsky.social

@archtoad.bsky.social

I just heard about rapids.ai which is a concrete effort to do all the data science, etc. things on GPUs

RAPIDS | GPU Accelerated Data Science

Open source GPU accelerated data science libraries

rapids.ai

August 25, 2025 at 9:24 PM

archtoad.bsky.social

@archtoad.bsky.social

a green witch singing into a microphone with the words in the year 2000

ALT: a green witch singing into a microphone with the words in the year 2000

media.tenor.com

August 14, 2025 at 10:18 AM

archtoad.bsky.social

@archtoad.bsky.social

Something like github.com/AnswerDotAI/... ?

GitHub - AnswerDotAI/llms-txt: The /llms.txt file, helping language models use your website

The /llms.txt file, helping language models use your website - AnswerDotAI/llms-txt

github.com

July 18, 2025 at 10:03 AM

archtoad.bsky.social

@archtoad.bsky.social

Was thinking about this re: “wow I should really get better and writing clear and consistent documentation for my repos so my agents know how to use it”

July 15, 2025 at 4:48 PM

archtoad.bsky.social

@archtoad.bsky.social

Check out lucumr.pocoo.org/2025/7/3/too... from @mitsuhiko.at if you haven’t… basically saying that CLIs >>> MCP (e.g., gh vs GitHub MCP)

Tools: Code Is All You Need

The solution to agentic flows was code all along.

lucumr.pocoo.org

July 15, 2025 at 6:51 AM

archtoad.bsky.social

@archtoad.bsky.social

I love your concept about building bespoke dev tools (like ways to search logs) for the agents - would love to hear about more of these and how you approach building them!

July 10, 2025 at 11:49 PM