Lightnews — Scholar-powered news

Lee Sharkey

@leesharkey.bsky.social

630 followers 110 following 12 posts

Scruting matrices @ Apollo Research

Posts Replies Media Videos

Lee Sharkey

@leesharkey.bsky.social

New interpretability paper from Apollo Research!

🟢Attribution-based Parameter Decomposition 🟢

It's a new way to decompose neural network parameters directly into mechanistic components.

It overcomes many of the issues with SAEs! 🧵

January 27, 2025 at 7:29 PM

Reposted by Lee Sharkey

Laura

@lauraruis.bsky.social

To my surprise, we find the opposite of what I thought when we started this project:

The approach to reasoning LLMs use looks unlike retrieval, and more like a generalisable strategy synthesising procedural knowledge from many documents doing a similar form of reasoning.

November 20, 2024 at 4:35 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news