Dataset is public on 🤗 huggingface
Wondering how much VLM know about ancient Chinese fashion over time? 👘👘👘Check it out!!
arxiv.org/abs/2506.01565
huggingface.co/datasets/lizho…
Dataset is public on 🤗 huggingface
Wondering how much VLM know about ancient Chinese fashion over time? 👘👘👘Check it out!!
arxiv.org/abs/2506.01565
huggingface.co/datasets/lizho…
Can Multimodal Retrieval Enhance Cultural Awareness in Vision-Language Models?
Excited to introduce RAVENEA, a new benchmark aimed at evaluating cultural understanding in VLMs through RAG.
arxiv.org/abs/2505.14462
More details:👇