Lightnews — Scholar-powered news

Emanuele Marconato

@ema-ridopoco.bsky.social

280 followers 270 following 23 posts

Post-doc @ University of Trento. I did my PhD @ University of Trento and the University of Pisa. I like #concepts, #symbols, and #representations, but I still don't know what they are.

📍 Trento, Italy
🧵 #identifiability, #shortcuts, #interpretability

Posts Replies Media Videos

Emanuele Marconato

@ema-ridopoco.bsky.social

Joint work with Sebastian Weichwald, Sebastien Lachapelle, and Luigi Gresele 🙏

For more info, check the full paper 👇

arxiv.org/abs/2410.235...

https://arxiv.org/abs/2410.23501

t.co

June 17, 2025 at 3:12 PM

Emanuele Marconato

@ema-ridopoco.bsky.social

🧵Summary

A mathematical proof that, under suitable conditions, linear properties hold for either all or none of the equivalent models with same next-token distribution 😎

Exciting open questions on empirical findings remain🤔 - check Section 6 (Discussion) in the paper!

8/9

June 17, 2025 at 3:12 PM

Emanuele Marconato

@ema-ridopoco.bsky.social

3⃣ We demonstrate what linear properties are shared by all or none LLMs.

🔥 Under mild assumptions, relational linear properties are shared!

⚠️ Parallel vectors may not be shared (they are under diversity)!

7/9

June 17, 2025 at 3:12 PM

Emanuele Marconato

@ema-ridopoco.bsky.social

We also describe other linear properties: linear subspaces, probing, steering, based on relational strings (Paccanaro and Hinton, 2001).

💡They arise when the LLM can predict next-tokens for textual queries like: "What is the written language?" for many context strings!

6/9

June 17, 2025 at 3:12 PM

Emanuele Marconato

@ema-ridopoco.bsky.social

2⃣ We reformulate linear properties of LLMs based on textual strings, depending on how LLMs predict next tokens

💡Parallel vectors arise from same log-ratios of next-token probs

E.g. same ratio for "easy"/"easiest" and "strong"/"strongest" in all contexts => parallel vecs

5/9

June 17, 2025 at 3:12 PM

Emanuele Marconato

@ema-ridopoco.bsky.social

💡The extended linear equivalence underlies that two models' representations are linearly related, but in a subspace

‼️Outside that subspace, representations can differ a lot!

4/9

June 17, 2025 at 3:12 PM

Emanuele Marconato

@ema-ridopoco.bsky.social

1⃣We extend the results by Khemakem et al. (2020), Roeder et al. (2021), removing a diversity assumption.

For the first time, we relate models with different repr. dimensions & find that repr.s of LLMs with same distribution are related by an “extended linear equivalence”!

3/9

June 17, 2025 at 3:12 PM

Emanuele Marconato

@ema-ridopoco.bsky.social

Contributions:

1⃣ An identifiability result for LLMs
2⃣A 𝙧𝙚𝙡𝙖𝙩𝙞𝙤𝙣𝙖𝙡 reformulation of linear properties
3⃣ A proof of what properties are 𝙘𝙤𝙫𝙖𝙧𝙞𝙖𝙣𝙩 (~to Physics, cf. Villar et al. (2023)): hold for all or none of the LLMs with same next-token distribution

2/9

June 17, 2025 at 3:12 PM

Emanuele Marconato

@ema-ridopoco.bsky.social

@yanai.bsky.social this is very interesting!! FYI, we studied the ubiquity, rather than emergence, of linear relational properties here:

openreview.net/forum?id=XCm...