Lightnews — Scholar-powered news

Tim Kellogg

@timkellogg.me

we need more computer science rom coms

Tim Kellogg @timkellogg.me · 2h

you talking about this??

November 30, 2025 at 10:54 PM

Tim Kellogg

@timkellogg.me

PSA: pythong.org is NOT the site you intend, there is no Python documentation there

November 30, 2025 at 10:34 PM

Tim Kellogg

@timkellogg.me

i hear ikea is selling christmas trees

A cozy living-room scene illustrated in a warm, cartoon style. A family of four is assembling an artificial Christmas tree from what appears to be an IKEA kit. The father, sitting on the couch, looks frustrated and is holding a tiny pine branch and a screwdriver, trying to figure out how it attaches. The mother sits beside him, calmly reading an IKEA instruction booklet that shows diagrams for assembling the tree.

On the floor, their two children are helping: the girl in a yellow shirt holds an Allen wrench while kneeling beside a large cardboard box filled with loose plastic pine needles. The boy in a red shirt is also kneeling, holding a clump of needles and examining the thin, mostly bare tree trunk that’s standing on a metal base in the center of the room. The coffee table is covered with green needles, tools, and assembly pieces. More boxes, ornaments, and a string of lights sit around the room. A window shows a dark evening outside, and framed nature art hangs on the wall. The whole scene humorously captures the chaos of building a complicated fake tree piece by piece.

November 30, 2025 at 10:10 PM

Tim Kellogg

@timkellogg.me

i’m looking at the ChatGPT & Gemini apps, reverse engineering them

ChatGPT has a “guardian_tool” where it can fetch policies

here’s what mine has, the only policy is around elections. This smells like some politician made a big stink and needed to be calmed

gist.github.com/tkellogg/200...

ChatGPT tool

ChatGPT tool. GitHub Gist: instantly share code, notes, and snippets.

gist.github.com

November 30, 2025 at 2:48 PM

Tim Kellogg

@timkellogg.me

OpenAI’s project stargate is very behind schedule

Epoch AI @epochai.bsky.social · 2d

In May, OpenAI announced a 1 GW Stargate cluster in Abu Dhabi, with an initial 200 MW expected in 2026. We located this cluster from space and found it will barely make that 200 MW milestone, by end of 2026. Optimistically, the cluster can scale to 1 GW by Q3 2027. See how in 🧵

November 30, 2025 at 1:24 PM

Tim Kellogg

@timkellogg.me

hot take: Sora & Nano Banana are critical to societal acceptance of AI

especially around Thanksgiving, you go home and show people what AI is capable of, in a way that’s very easy to grok

it changes things

November 30, 2025 at 12:57 AM

Reposted by Tim Kellogg

Dulany, pumpkinhead truther 🎃

@dulanyw.bsky.social

Tired: decentralized social network

Wired: decentralized neural network

November 29, 2025 at 6:12 PM

Tim Kellogg

@timkellogg.me

this whole smol model thing that @dorialexander.bsky.social started is reminding me of entropix

independent researchers working out in the open on a wildly different concept

Tim Kellogg @timkellogg.me · 1d

the experiment continues — recognizable text below 1M parameters(!!)

they exposed it to 1B tokens of the SYNTH dataset, probably can train longer

"torch _dtype is deprecated! Use "dtype instead!
PerceptronForCausalLM
(model): PerceptronModel
(embed_tokens): Embedding(8197, 64, padding_idx=3)
(hidden_layer): PerceptronDecoder Layer (
(self_attn): PerceptronAttention(
(q_proj: Linear(in_features=64, out_features=2048, bias=False) (k_proj): Linear(in_features=64, out_features=2048, bias=False) (v_proj): Linear(in_features=64, out_features=2048, bias=False) (o_proj): Linear(in_features=128, out_features=64, bias=False)
(q_norm): PerceptronRMSNorm((16,), eps=1e-06)
(k_ norm): PerceptronRMSNorm((16,), eps=1e-06)
(mlp): PerceptronMLP(
(gate_proj): Linear(in_features=64, out_features=256, bias=False) (up_proj): Linear(in_features=64, out_features=256, bias=False)
(down_proj): Linear (in_features=256, out_features=64, bias-False)
(act_fn): SiLUActivation()
(pooler): UnconventionalTalentRevealedHere(magical-16)
(input _layernorm): PerceptronRMSNorm((64, ), eps-1e-06)
(post attention_layernorm): PerceptronRMSNorm((64,), eps=1e-06)
(norm): PerceptronRMSNorm((64, ), eps=1e-06)
(rotary_emb): PerceptronRotaryEmbedding()
(Im_head): Linear(in_features=64, out_features=8197, bias=False)
PerceptronForCausalLM'> Total parameters: 975393, Trainable parameters: 975393 tensor ([[8192, 659,
174, 4365, 313, 238, 2014, 92, 7462, 34, 8193, 8192,
663,
174]])
<| im_start |›user
why is the sky blue?‹|im_end ||im_start|›assistant
<think>
Query: "why do some people in the other ones like the same way to make it"
Parse components:
- "try" » need specific numbers, not just "bad" → temporal comparison. "different ways" + temporal question.
### 1. Semantic parsing
"Basic" = "width-country" - ambiguous. • High confidence.
- "destrish" = "friendly" = "graid" = "math"

Mariusz Kurman & @mkurman88
X.com
HA! Check this out, bro! :D First checkpoint of PERCEPTRON 0.975M!
High eval loss (3.91), so stay tuned, brothers and sisters.
And it's a fcking SINGLE hidden layer model D Literally: embeddings → hidden layer → norm →
embeddings

November 29, 2025 at 2:26 PM

Tim Kellogg

@timkellogg.me

the experiment continues — recognizable text below 1M parameters(!!)

they exposed it to 1B tokens of the SYNTH dataset, probably can train longer

November 29, 2025 at 1:57 PM

Tim Kellogg

@timkellogg.me

i’m all for contextual numbers. this is great

Simon Willison @simonwillison.net · 1d

Out of curiosity I decided to try and run the numbers on how much Netflix you can watch for the energy cost of a ChatGPT prompt

As far as I can tell it's between 5.1 and 10.2 seconds, depending on which end of the 2019 IEA Netflix energy usage estimate you use

simonwillison.net/2025/Nov/29/...

In June 2025 Sam Altman claimed about ChatGPT that "the average query uses about 0.34 watt-hours".

In March 2020 George Kamiya of the International Energy Agency estimated that "streaming a Netflix video in 2019 typically consumed 0.12-0.24kWh of electricity per hour" - that's 240 watt-hours per hour at the higher end.

Assuming that higher end, a ChatGPT prompt by Sam Altman's estimate uses:

0.34 Wh / (240 Wh / 3600 seconds) = 5.1 seconds of Netflix

Or double that, 10.2 seconds, if you take the lower end of the Netflix estimate instead.

I'm always interested in anything that can help contextualize a number like "0.34 watt-hours" - I think this comparison to Netflix is a neat way of doing that.

This is evidently not the whole story with regards to AI energy usage - training costs, data center buildout costs and the ongoing fierce competition between the providers all add up to a very significant carbon footprint for the AI industry as a whole.

November 29, 2025 at 12:55 PM

Tim Kellogg

@timkellogg.me

1. yeah, i was annoyed that no one else noticed that he isn’t actually anti pretrain scaling

2. pretrain scaling is practically his idea, so ofc he’s not against it. He just thinks there’s more pieces to the puzzle

Ilya Sutskever & @ilyasut
X.com
One point I made that didn't come across:
- Scaling the current thing will keep leading to improvements. In particular, it won't stall.
- But something important will continue to be missing.

November 29, 2025 at 12:51 PM

Tim Kellogg

@timkellogg.me

i let my 9yo use sora for a few minutes and she used up my quota remixing everything into hamsters

otoh my notifications are now clogged with likes, so maybe she knows what people want

November 29, 2025 at 12:18 AM

Tim Kellogg

@timkellogg.me

for those confused about why DeepSeek keeps releasing math models:

their day job is quant trading. AGI is a side hustle

Tim Kellogg @timkellogg.me · 3d

DeepSeek-Math-V2: self-verification

Fascinating paper that explores how to RL but focused on process over outcome

It’s sort of similar to a GAN, but with loops for each the generator & verifier as well as an outer loop

github.com/deepseek-ai/...

Thanks — here’s a clean, accurate description of the image without over-interpreting anything or attributing identities:

⸻

This illustration shows a closed-loop AI training and verification system, centered around a glowing cube labeled Unified Self-Verification Model. Multiple subsystems connect to it with curved arrows, creating a multi-stage pipeline.

Top-left

A small panel labeled “COLD START” shows silhouettes of human experts handing documents to a robot. A label reads “EXPERT DATA”. The robot is marked “INITIAL VERIFIER.”

Left side

A machine labeled “GENERATOR” emits data toward the central model via blue arrows. Nearby is a panel titled “AUTO-LABELING VIA SCALED COMPUTE” showing branching lines of generated labels flowing into the loop.

Top-center

A blocky structure marked “META-VERIFIER (STATIC)” sends a bright golden beam into the central model.

Right side

A cube-like “VERIFIER” module receives outputs from the central model and displays mixed “pass/fail” icons and red flags. It feeds its evaluations back into the central loop.

Bottom

A golden arrow flows into a container titled “GOLDEN DATASET,” which represents validated high-quality data feeding back into earlier steps of the pipeline.

Overall

Blue arrows represent generation and verification flows; golden arrows represent validated or high-confidence data circulating back into the system. Circuit-pattern artwork forms the background.

⸻

If you want, I can also explain what conceptual training architecture this resembles (e.g., iterative self-verification, multi-stage verifier stacks, or how it relates to your TTC/Verifier thoughts).

November 28, 2025 at 11:34 PM

Tim Kellogg

@timkellogg.me

Semianalysis: TPU dominance

Fascinating article. They argue that the reason for NVIDIA’s circular investment deals is to intertwine their own fate with that of the big labs, to keep themselves on top

OpenAI saved 30% on their NVIDIA GPUs merely by buying TPUs

open.substack.com/pub/semianal...

Google TPUv7: The 900lb Gorilla In the Room

Anthropic’s 1GW+ TPUs, New customers Meta/SSI/xAI/OAI, Full Stack Review of v7 Ironwood, CUDA Moat at risk, Next Generation TPUv8AX and TPUv8X versus Vera Rubin

open.substack.com

November 28, 2025 at 6:32 PM

Tim Kellogg

@timkellogg.me

avoiding family?? then give @advanced-eschatonics.com a follow!!

Thomas is building extremely cool stuff. Not just today, it’s like a compulsion for him. He can’t NOT build cool stuff. Ever. So my feed is full of him talking about stuff I never would’ve thought to try

go follow him. I’m waiting.

November 28, 2025 at 5:54 PM

Tim Kellogg

@timkellogg.me

it seems like the natural trajectory is for live art to take over

i showed my daughter a funny video, she laughed, but she was awestruck when i told her it wasn’t AI generated

art isn’t going away, and there’s something distinct about authentic art

November 28, 2025 at 3:01 PM

Reposted by Tim Kellogg

JD Long

@jdlong.cerebralmastication.com

I monitor our Claude Code usage and when I see a devs usage spike I interview them to see what cool stuff they’ve figured out how to do. Every single one has figured out meta-prompting: having the model help them write a thorough prompt first. That unleashes a lot of cool usage.

November 27, 2025 at 9:11 PM

Tim Kellogg

@timkellogg.me

happy christmas season sora.chatgpt.com/p/s_6929b2c9...

moth on Sora

more people die from black friday shoping every year than shark attack. “you mean they die from happiness? from all the great sales?”

sora.chatgpt.com

November 28, 2025 at 2:34 PM

Reposted by Tim Kellogg

Simon Willison

@simonwillison.net

At the risk of starting the flame war to end all flame wars...

Modern LLMs (GPT-5.1, Claude 4.5, Gemini 3) produce excellent code and can be a significant productivity boost to software engineers who take the time to learn how to effectively apply them - especially if used with coding agent tools

November 27, 2025 at 7:55 PM

Tim Kellogg

@timkellogg.me

i hereby am requesting that @ai2.bsky.social make a 1T base model and have PrimeIntellect posttrain it

November 27, 2025 at 7:41 PM

Tim Kellogg

@timkellogg.me

they take GLM-4.5-Air (small model) and post train it to out-perform the already near-SOTA 3x-larger GLM-4.5

but the model is an advertisement for their infrastructure (that you should use!), and so peek at that too! You should be able to replicate this for your domain

Here’s a clear, faithful description of the image — no inference about identities, just what’s visually present:

⸻

Description

A retro, comic-style poster with bold headline text at the top:

“STOP JUST FINE-TUNING. START REASONING.”

Below it, smaller text reads:

“INTELLECT-3: We didn’t just give you the weights, we gave you the whole darn factory.”

The main illustration is split visually into two contrasting scenes:

Left Side
• A tall, gray, heavily guarded facility labeled “FRONTIER LABS – TOP SECRET AI.”
• Barbed wire, security cameras, and a huge, locked metal gate make the building look closed and inaccessible.
• A small person in a suit is sneaking out through a tiny mouse-hole in the wall, nervously holding a stack of cash and glancing back over their shoulder.

Right Side
• A bright, sunny, welcoming outdoor scene.
• A friendly, cartoonish robot with a glowing lightbulb above its head is holding a blueprint and smiling.
• A banner overhead reads:
“INTELLECT-3 — OPEN FACTORY – Y’ALL COME IN!”
• A sign hanging from the robot says:
“12B ACTIVE PARAMS. BEATS GIANTS. SERIOUSLY.”
• Next to the robot is a pile of labeled components:
• Tools
• Code
• Web Browsers
• PRIME-RL
• Environments Hub
• PRIME Sandboxes

Bottom Text

A caption spans the width of the image:

“Punches way above its weight on Math, Code, & Science.
Runs on consumer hardware. Open-source. No secret sauce. Just sauce.”

Below that are three stylized buttons:
• [Download Model]
• [Fork Repo]
• [Join Hub]

⸻

If you want, I can also break down the messaging or compare it to other open-source-vs-frontier lab memes you’ve been collecting.

November 27, 2025 at 4:28 PM

Tim Kellogg

@timkellogg.me

DeepSeek-Math-V2: self-verification

Fascinating paper that explores how to RL but focused on process over outcome

It’s sort of similar to a GAN, but with loops for each the generator & verifier as well as an outer loop

github.com/deepseek-ai/...

November 27, 2025 at 2:19 PM

Reposted by Tim Kellogg

Scott Riley

@scott.is

thank u for choosing linkedin. here are a bunch of comments made by people you do not care about on posts from people you somehow care even less about. also your inbox is a personalised advertising billboard. also your notifications are randomised every morning based on what you will hate the most.

November 27, 2025 at 12:37 AM

Tim Kellogg

@timkellogg.me

in the past, leftward-leaning folk were tied together by one uniting force: progress

that was Obama’s schtick. Social, tech, economic, any kind of progress will do

now it feels like the left and right are fighting over which kind of *regress* is better

seems like someone will probably win

November 27, 2025 at 12:16 AM

Tim Kellogg

@timkellogg.me

inb4 the left goes anti-solar along the path to being anti-AI

Epoch AI @epochai.bsky.social · 4d

Whatever happens at the model level, one thing is clear: hyperscalers are building enough new infrastructure to put city-scale amounts of power and compute behind AI.

November 26, 2025 at 11:49 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news