WARNING: I talk about kids sometimes
ChatGPT has a “guardian_tool” where it can fetch policies
here’s what mine has, the only policy is around elections. This smells like some politician made a big stink and needed to be calmed
gist.github.com/tkellogg/200...
ChatGPT has a “guardian_tool” where it can fetch policies
here’s what mine has, the only policy is around elections. This smells like some politician made a big stink and needed to be calmed
gist.github.com/tkellogg/200...
especially around Thanksgiving, you go home and show people what AI is capable of, in a way that’s very easy to grok
it changes things
especially around Thanksgiving, you go home and show people what AI is capable of, in a way that’s very easy to grok
it changes things
Wired: decentralized neural network
Wired: decentralized neural network
independent researchers working out in the open on a wildly different concept
they exposed it to 1B tokens of the SYNTH dataset, probably can train longer
independent researchers working out in the open on a wildly different concept
they exposed it to 1B tokens of the SYNTH dataset, probably can train longer
they exposed it to 1B tokens of the SYNTH dataset, probably can train longer
As far as I can tell it's between 5.1 and 10.2 seconds, depending on which end of the 2019 IEA Netflix energy usage estimate you use
simonwillison.net/2025/Nov/29/...
2. pretrain scaling is practically his idea, so ofc he’s not against it. He just thinks there’s more pieces to the puzzle
2. pretrain scaling is practically his idea, so ofc he’s not against it. He just thinks there’s more pieces to the puzzle
otoh my notifications are now clogged with likes, so maybe she knows what people want
otoh my notifications are now clogged with likes, so maybe she knows what people want
their day job is quant trading. AGI is a side hustle
Fascinating paper that explores how to RL but focused on process over outcome
It’s sort of similar to a GAN, but with loops for each the generator & verifier as well as an outer loop
github.com/deepseek-ai/...
their day job is quant trading. AGI is a side hustle
Fascinating article. They argue that the reason for NVIDIA’s circular investment deals is to intertwine their own fate with that of the big labs, to keep themselves on top
OpenAI saved 30% on their NVIDIA GPUs merely by buying TPUs
open.substack.com/pub/semianal...
Fascinating article. They argue that the reason for NVIDIA’s circular investment deals is to intertwine their own fate with that of the big labs, to keep themselves on top
OpenAI saved 30% on their NVIDIA GPUs merely by buying TPUs
open.substack.com/pub/semianal...
Thomas is building extremely cool stuff. Not just today, it’s like a compulsion for him. He can’t NOT build cool stuff. Ever. So my feed is full of him talking about stuff I never would’ve thought to try
go follow him. I’m waiting.
Thomas is building extremely cool stuff. Not just today, it’s like a compulsion for him. He can’t NOT build cool stuff. Ever. So my feed is full of him talking about stuff I never would’ve thought to try
go follow him. I’m waiting.
i showed my daughter a funny video, she laughed, but she was awestruck when i told her it wasn’t AI generated
art isn’t going away, and there’s something distinct about authentic art
i showed my daughter a funny video, she laughed, but she was awestruck when i told her it wasn’t AI generated
art isn’t going away, and there’s something distinct about authentic art
Modern LLMs (GPT-5.1, Claude 4.5, Gemini 3) produce excellent code and can be a significant productivity boost to software engineers who take the time to learn how to effectively apply them - especially if used with coding agent tools
Modern LLMs (GPT-5.1, Claude 4.5, Gemini 3) produce excellent code and can be a significant productivity boost to software engineers who take the time to learn how to effectively apply them - especially if used with coding agent tools
but the model is an advertisement for their infrastructure (that you should use!), and so peek at that too! You should be able to replicate this for your domain
but the model is an advertisement for their infrastructure (that you should use!), and so peek at that too! You should be able to replicate this for your domain
Fascinating paper that explores how to RL but focused on process over outcome
It’s sort of similar to a GAN, but with loops for each the generator & verifier as well as an outer loop
github.com/deepseek-ai/...
Fascinating paper that explores how to RL but focused on process over outcome
It’s sort of similar to a GAN, but with loops for each the generator & verifier as well as an outer loop
github.com/deepseek-ai/...
that was Obama’s schtick. Social, tech, economic, any kind of progress will do
now it feels like the left and right are fighting over which kind of *regress* is better
seems like someone will probably win
that was Obama’s schtick. Social, tech, economic, any kind of progress will do
now it feels like the left and right are fighting over which kind of *regress* is better
seems like someone will probably win