Ruochen
banner
ruochenzhang.bsky.social
Ruochen
@ruochenzhang.bsky.social
PhD@browncs doing multilingual things <= Undergrad@SUTD
Reposted by Ruochen
📣 New paper!

We observe that reasoning language models finetuned only on English data are capable of zero-shot cross-lingual reasoning through a "quote-and-think" pattern.

However, this does not mean they reason the same way across all languages or in new domains.

[1/N]
May 9, 2025 at 7:53 PM
Reposted by Ruochen
📢 Calling all SEA-passionate individuals!

SEACrowd is excited to launch our contributor call for SEA-VL Phase 2: Building Visual Language Models for Southeast Asia! 🌏

After the success of Phase 1, we're now taking on a bigger mission (see thread)👇
May 8, 2025 at 9:41 AM
Reposted by Ruochen
SEA-VL: Building AI for Southeast Asian Research 🌏

We release SEA-VL, the largest vision-language dataset tailored for SEA’s diverse culture.

📜 arXiv: arxiv.org/abs/2503.07920
🤗 Data: huggingface.co/collections/...

Check the thread 🧵
March 13, 2025 at 11:36 AM
Reposted by Ruochen
LMs need linguistics! New paper, with @futrell.bsky.social, on LMs and linguistics that conveys our excitement about what the present moment means for linguistics and what linguistics can do for LMs. Paper: arxiv.org/abs/2501.17047. 🧵below.
January 29, 2025 at 4:07 PM
📣 Calling #NeurIPS2024 participants!

While everyone enjoys the last day of the beautiful Vancouver 🏔️🇨🇦, consider join our initiative and contribute to building models with more inclusivity and diversity, and mitigating implicit and explicit bias.

‼️Cuz we are in the post-training era now‼️
⭐️ We're going to launch Grassroots Science, a year-long ambitious, massive-scale, fully open-source initiative aimed at developing multilingual LLMs aligned to diverse and inclusive human preferences in Feb 2025.

🌐 Check our website: grassroots.science.

#NLProc #GrassrootsScience
Grassroots Science
A global initiative focused on developing state-of-the-art multilingual language models through grassroots efforts.
grassroots.science
December 16, 2024 at 2:25 AM
Reposted by Ruochen
⭐️ We're going to launch Grassroots Science, a year-long ambitious, massive-scale, fully open-source initiative aimed at developing multilingual LLMs aligned to diverse and inclusive human preferences in Feb 2025.

🌐 Check our website: grassroots.science.

#NLProc #GrassrootsScience
Grassroots Science
A global initiative focused on developing state-of-the-art multilingual language models through grassroots efforts.
grassroots.science
December 9, 2024 at 5:02 AM
Reposted by Ruochen
First bsky post about tinlab at #EMNLP2024! A few highlights:

* Presentations from Aditya Yedetore and Hayley Ross on neural network generalizations!
* I'm giving a keynote at GenBench & organizing BlackboxNLP
* Ask me about our faculty hiring & PhD/postdoc positions at Boston University!
November 8, 2024 at 4:59 PM