Siva Reddy
@sivareddyg.bsky.social
1.2K followers
250 following
19 posts
Assistant Professor @Mila-Quebec.bsky.social
Co-Director @McGill-NLP.bsky.social
Researcher @ServiceNow.bsky.social
Alumni: @StanfordNLP.bsky.social, EdinburghNLP
Natural Language Processor #NLProc
Posts
Media
Videos
Starter Packs
Reposted by Siva Reddy
Siva Reddy
@sivareddyg.bsky.social
· Jul 29
I am delighted to share our new #PNAS paper, with @grvkamath.bsky.social @msonderegger.bsky.social and @sivareddyg.bsky.social, on whether age matters for the adoption of new meanings. That is, as words change meaning, does the rate of adoption vary across generations? www.pnas.org/doi/epdf/10....
Reposted by Siva Reddy
Siva Reddy
@sivareddyg.bsky.social
· May 1
Language Models Largely Exhibit Human-like Constituent Ordering Preferences
Though English sentences are typically inflexible vis-à-vis word order, constituents often show far more variability in ordering. One prominent theory presents the notion that constituent ordering is ...
arxiv.org
Siva Reddy
@sivareddyg.bsky.social
· May 1
Language Models Largely Exhibit Human-like Constituent Ordering Preferences
Though English sentences are typically inflexible vis-à-vis word order, constituents often show far more variability in ordering. One prominent theory presents the notion that constituent ordering is ...
arxiv.org
Siva Reddy
@sivareddyg.bsky.social
· May 1
Siva Reddy
@sivareddyg.bsky.social
· May 1
Reposted by Siva Reddy
Benno Krojer
@bennokrojer.bsky.social
· May 1
Congratulations to Mila members @adadtur.bsky.social , Gaurav Kamath and @sivareddyg.bsky.social for their SAC award at NAACL! Check out Ada's talk in Session I: Oral/Poster 6. Paper: arxiv.org/abs/2502.05670
Reposted by Siva Reddy
Siva Reddy
@sivareddyg.bsky.social
· Apr 3
Reposted by Siva Reddy
Benno Krojer
@bennokrojer.bsky.social
· Apr 1
Models like DeepSeek-R1 🐋 mark a fundamental shift in how LLMs approach complex problems. In our preprint on R1 Thoughtology, we study R1’s reasoning chains across a variety of tasks; investigating its capabilities, limitations, and behaviour.
🔗: mcgill-nlp.github.io/thoughtology/
🔗: mcgill-nlp.github.io/thoughtology/
Siva Reddy
@sivareddyg.bsky.social
· Apr 1
Siva Reddy
@sivareddyg.bsky.social
· Apr 1
Models like DeepSeek-R1 🐋 mark a fundamental shift in how LLMs approach complex problems. In our preprint on R1 Thoughtology, we study R1’s reasoning chains across a variety of tasks; investigating its capabilities, limitations, and behaviour.
🔗: mcgill-nlp.github.io/thoughtology/
🔗: mcgill-nlp.github.io/thoughtology/
Reposted by Siva Reddy
Reposted by Siva Reddy
Siva Reddy
@sivareddyg.bsky.social
· Mar 4
📢New Paper Alert!🚀
Human alignment balances social expectations, economic incentives, and legal frameworks. What if LLM alignment worked the same way?🤔
Our latest work explores how social, economic, and contractual alignment can address incomplete contracts in LLM alignment🧵
Human alignment balances social expectations, economic incentives, and legal frameworks. What if LLM alignment worked the same way?🤔
Our latest work explores how social, economic, and contractual alignment can address incomplete contracts in LLM alignment🧵
Siva Reddy
@sivareddyg.bsky.social
· Feb 21
Presenting ✨ 𝐂𝐇𝐀𝐒𝐄: 𝐆𝐞𝐧𝐞𝐫𝐚𝐭𝐢𝐧𝐠 𝐜𝐡𝐚𝐥𝐥𝐞𝐧𝐠𝐢𝐧𝐠 𝐬𝐲𝐧𝐭𝐡𝐞𝐭𝐢𝐜 𝐝𝐚𝐭𝐚 𝐟𝐨𝐫 𝐞𝐯𝐚𝐥𝐮𝐚𝐭𝐢𝐨𝐧 ✨
Work w/ fantastic advisors Dima Bahdanau and @sivareddyg.bsky.social
Thread 🧵:
Work w/ fantastic advisors Dima Bahdanau and @sivareddyg.bsky.social
Thread 🧵:
Reposted by Siva Reddy
Benno Krojer
@bennokrojer.bsky.social
· Dec 8
AURORA 🌌 is now accepted as a Spotlight at NeurIPS 🥂
We wondered if a model can do *controlled* video generation but in a *single* step?
So we built a dataset+model for “taking actions” on images via editing, or what you could call single-step controlled video gen
We wondered if a model can do *controlled* video generation but in a *single* step?
So we built a dataset+model for “taking actions” on images via editing, or what you could call single-step controlled video gen
Did you miss the recent Auroras? No problem! ✨🎆
Super excited to share AURORA, a *general* image editing model + high-quality data that improves where prev work fails the most:
Performing *action or movement* edits, i.e. a kind of world model setup
Insights/Details ⬇️
Super excited to share AURORA, a *general* image editing model + high-quality data that improves where prev work fails the most:
Performing *action or movement* edits, i.e. a kind of world model setup
Insights/Details ⬇️
Siva Reddy
@sivareddyg.bsky.social
· Nov 29
I’m thrilled to share that I’ve finished my Ph.D. at Mila and Polytechnique Montreal. For the last 4.5 years, I have worked on creating new faithfulness-centric paradigms for NLP Interpretability. Read my vision for the future of interpretability in our new position paper: arxiv.org/abs/2405.05386
Interpretability Needs a New Paradigm
Interpretability is the study of explaining models in understandable terms to humans. At present, interpretability is divided into two paradigms: the intrinsic paradigm, which believes that only model...
arxiv.org
Siva Reddy
@sivareddyg.bsky.social
· Nov 26
Reposted by Siva Reddy
Niclas Overby Ⓝ
@overby.me
· Nov 24