Lightnews — Scholar-powered news

Victor Morand

@victormorand.bsky.social

PhD Student @ Sorbonne Université (@mlia-isir.bsky.social)
Research in information retrieval and conversational search: Towards Language models that know what they know 🧠
Homepage : victormorand.github.io

Posts Replies Media Videos

Victor Morand

@victormorand.bsky.social

👀Up next
Building upon these findings, we've managed to externalize this internal mechanism, creating a general-purpose mention detector with promising results. Stay tuned! 🔜

October 22, 2025 at 8:16 AM

Victor Morand

@victormorand.bsky.social

I'll be presenting this work at @blackboxnlp.bsky.social in Suzhou, happy to chat there or here if you are interested !

October 22, 2025 at 8:16 AM

Victor Morand

@victormorand.bsky.social

3️⃣ The Entity Lens
Our method enables reconstruction of entity mentions from any representation within LLMs, allowing to ask: “What entity is the model thinking about right now?”
💡 When reading ‘the City of Lights iconic monument’, the model internally “thinks” of Paris and the Eiffel Tower !

October 22, 2025 at 8:16 AM

Victor Morand

@victormorand.bsky.social

2️⃣ LLMs develop entity-specific mechanisms.

By sucessfully learning "Tasks Vectors" steering the model to reconstruct the mention, we uncover new evidence that LLMs form dedicated internal circuits to represent and manipulate multi-token entities.

October 22, 2025 at 8:16 AM

Victor Morand

@victormorand.bsky.social

1️⃣ Common entities are (almost) part of the Vocabulary.

We prove that common multi-token mentions (e.g. "Eiffel Tower") can be recovered from the middle-layer hidden state of its last token only !
Uncommon mentions aren't fully encoded this way; but rather retrieved from the context when needed.

October 22, 2025 at 8:16 AM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news