Checkout our paper for more details:
📜 arxiv.org/pdf/2510.26784
💻 github.com/arnab-api/fi...
🌐 filter.baulab.info
Checkout our paper for more details:
📜 arxiv.org/pdf/2510.26784
💻 github.com/arnab-api/fi...
🌐 filter.baulab.info
Check Henderson & Morris Jr (1976): dl.acm.org/doi/abs/10....
Check Henderson & Morris Jr (1976): dl.acm.org/doi/abs/10....
We investigate this further and find that when the question is presented first, the LM can can *eagerly* evaluate each option as it sees them, and store a "flag" directly in the latents.
We investigate this further and find that when the question is presented first, the LM can can *eagerly* evaluate each option as it sees them, and store a "flag" directly in the latents.
Also checkout @jackmerullo.bsky.social's work on LLM's reusing sub-circuits in different tasks.
x.com/jack_merull...
Also checkout @jackmerullo.bsky.social's work on LLM's reusing sub-circuits in different tasks.
x.com/jack_merull...
bsky.app/profile/sfe...
bsky.app/profile/sfe...
To test them, we transport their query states from one context to another. We find that will trigger the execution of the same filtering operation, even if the new context has a new list of items and format!
To test them, we transport their query states from one context to another. We find that will trigger the execution of the same filtering operation, even if the new context has a new list of items and format!