Multilingual Representation Workshop @ EMNLP 2025
banner
mrl-workshop.bsky.social
Multilingual Representation Workshop @ EMNLP 2025
@mrl-workshop.bsky.social
The 5th edition of our workshop will be co-located with EMNLP in Suzhou, China!
https://sigtyp.github.io/ws2025-mrl.html
Finally, we recognize the Hawaiian submission to the shared task. Thank you for your contributions!
November 9, 2025 at 8:14 AM
The second is 7PERFECTION, a dataset for seven Nigerian languages!
November 9, 2025 at 8:13 AM
The first of our best contribution awards goes to AraPIQA!
November 9, 2025 at 8:12 AM
We would like to recognize three honorable mention submissions. Great work!
November 9, 2025 at 8:11 AM
@aliceatkaist.bsky.social joins us to give the final keynote of the day about code switching in multilingual language models!
November 9, 2025 at 6:04 AM
Join us in hall C3 at posters 137-168 for our in-person poster session. Join us online on Zoom via Underline for our virtual poster session!
November 9, 2025 at 3:00 AM
Now Pontus Stenetorp shares an oral history of UK-LLM!
November 9, 2025 at 1:54 AM
@kellymarchisio.bsky.social from Cohere presents “Building Multilingual LLMs in Industry”, sharing insights on training multilinguality at scale!
November 9, 2025 at 1:26 AM
It’s not too late to get involved! Until early 2026, we will be accepting submissions for languages not already represented in Global PIQA. If you’re interested, please fill out this form and we will contact you with details!
docs.google.com/forms/d/e/1F...
Global PIQA Contributor Interest Form
Thanks for your interest in contributing to Global PIQA! Please fill out the form and we will contact you with details about how to get involved!
docs.google.com
October 29, 2025 at 3:50 PM
There are seven languages where even the best proprietary LLM scores less than 80% (chance: 50%). Sub-Saharan African languages lag behind Western European languages by ~15%. Thus Global PIQA highlights languages which are very poorly served by large, proprietary models.
October 29, 2025 at 3:50 PM
The top proprietary models achieve ~90% accuracy, which falls short of human accuracy (~95%). The best open models perform significantly worse, with the best open model performance from Gemma 3 (27B) at 82.4%.
October 29, 2025 at 3:50 PM
This dataset is created and owned by the contributors, all of whom were offered authorship. We believe this is more fair to annotators and is likely to result in a higher-quality dataset, as it is constructed by the NLP researchers who will use it.
October 29, 2025 at 3:50 PM
Global PIQA includes subsets for 116 unique language varieties. These cover five continents, 14 language families, and 23 writing systems. Over 50% of examples reference local foods, customs, traditions, or other culturally-specific elements.
October 29, 2025 at 3:50 PM
Correct, you can submit ARR papers that already have their reviews by Sep 23. More instructions soon!

TBD on whether the workshop will be hybrid.
August 25, 2025 at 3:56 PM