dokato.bsky.social
@dokato.bsky.social
data science, ml, ai, llms
Reposted
🚀 We are excited to introduce Kaleidoscope, the largest culturally-authentic exam benchmark.

📌 Most VLM benchmarks are English-centric or rely on translations—missing linguistic & cultural nuance. Kaleidoscope expands in-language multilingual 🌎 & multimodal 👀 VLMs evaluation
April 10, 2025 at 8:24 PM