Jekaterina Novikova
@j-novikova-nlp.bsky.social
Principal AI research scientist @Vanguard_Group | Research in NLP, multimodal AI, LLMs, evaluation | own opinions only 🇨🇦🇪🇺🏳️🌈
The pleasure was mine! Thanks, for such an interesting conversation
October 30, 2025 at 12:43 AM
The pleasure was mine! Thanks, for such an interesting conversation
Reposted by Jekaterina Novikova
🎧 Hear Dr. Hupkes discuss her work on GenBench and how consistency, generalization, and reasoning shape our understanding of LLMs.
🎬 YouTube: www.youtube.com/watch?v=CuTW...
🎙️ Apple Podcasts: podcasts.apple.com/ca/podcast/w...
🎧 Spotify: open.spotify.com/show/51RJNlZ...
#WiAIR #NLP #WomenInAI
🎬 YouTube: www.youtube.com/watch?v=CuTW...
🎙️ Apple Podcasts: podcasts.apple.com/ca/podcast/w...
🎧 Spotify: open.spotify.com/show/51RJNlZ...
#WiAIR #NLP #WomenInAI
Generalization in AI, with Dr. Dieuwke Hupkes
YouTube video by Women in AI Research WiAIR
www.youtube.com
July 18, 2025 at 4:12 PM
🎧 Hear Dr. Hupkes discuss her work on GenBench and how consistency, generalization, and reasoning shape our understanding of LLMs.
🎬 YouTube: www.youtube.com/watch?v=CuTW...
🎙️ Apple Podcasts: podcasts.apple.com/ca/podcast/w...
🎧 Spotify: open.spotify.com/show/51RJNlZ...
#WiAIR #NLP #WomenInAI
🎬 YouTube: www.youtube.com/watch?v=CuTW...
🎙️ Apple Podcasts: podcasts.apple.com/ca/podcast/w...
🎧 Spotify: open.spotify.com/show/51RJNlZ...
#WiAIR #NLP #WomenInAI
Sounds like the way back to the closet..
February 26, 2025 at 3:10 AM
Sounds like the way back to the closet..
Results from evaluating 15 models on INCLUDE reveal stark performance variations among languages, emphasizing the need for equitable AI tools.
Public release of INCLUDE encourages further research on fair and inclusive AI.
Dataset: huggingface.co/datasets/Coh...
/4
Public release of INCLUDE encourages further research on fair and inclusive AI.
Dataset: huggingface.co/datasets/Coh...
/4
CohereForAI/include-base-44 · Datasets at Hugging Face
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
huggingface.co
January 23, 2025 at 4:07 PM
Results from evaluating 15 models on INCLUDE reveal stark performance variations among languages, emphasizing the need for equitable AI tools.
Public release of INCLUDE encourages further research on fair and inclusive AI.
Dataset: huggingface.co/datasets/Coh...
/4
Public release of INCLUDE encourages further research on fair and inclusive AI.
Dataset: huggingface.co/datasets/Coh...
/4
INCLUDE is the largest multilingual benchmark of its kind, containing 197,243 MCQA pairs from 1,926 examinations across 44 languages and 15 scripts coming from 52 countries.
/3
/3
January 23, 2025 at 4:07 PM
INCLUDE is the largest multilingual benchmark of its kind, containing 197,243 MCQA pairs from 1,926 examinations across 44 languages and 15 scripts coming from 52 countries.
/3
/3
LLMs hold immense potential, but performance disparities across languages limit their global impact. INCLUDE is a large multilingual language understanding
benchmark that includes regional educational, professional, and practical tests collected by native speakers.
/2
benchmark that includes regional educational, professional, and practical tests collected by native speakers.
/2
January 23, 2025 at 4:07 PM
LLMs hold immense potential, but performance disparities across languages limit their global impact. INCLUDE is a large multilingual language understanding
benchmark that includes regional educational, professional, and practical tests collected by native speakers.
/2
benchmark that includes regional educational, professional, and practical tests collected by native speakers.
/2
⏳Submission deadline: Feb 17, 2025
🗓️Workshop date: April 26-May 1, 2025 (TBD)
📍 Join us in Yokohama, Japan (also hybrid)
Submit your work and help shape the future of LLMs!
🗓️Workshop date: April 26-May 1, 2025 (TBD)
📍 Join us in Yokohama, Japan (also hybrid)
Submit your work and help shape the future of LLMs!
January 3, 2025 at 2:07 AM
⏳Submission deadline: Feb 17, 2025
🗓️Workshop date: April 26-May 1, 2025 (TBD)
📍 Join us in Yokohama, Japan (also hybrid)
Submit your work and help shape the future of LLMs!
🗓️Workshop date: April 26-May 1, 2025 (TBD)
📍 Join us in Yokohama, Japan (also hybrid)
Submit your work and help shape the future of LLMs!