Yaxin L
yaxinliu.bsky.social
Yaxin L
@yaxinliu.bsky.social
Postdoc at Georgetown University. PhD in cognitive & developmental sciences.
Hybrid-intelligence| creativity| spatial cognition | gender differences
Reposted by Yaxin L
Here's another example from our lab, just accepted at ICLR, young kids can do these easily but Multi Modal Models fail.
arxiv.org/abs/2407.177...
KiVA: Kid-inspired Visual Analogies for Testing Large Multimodal Models
This paper investigates visual analogical reasoning in large multimodal models (LMMs) compared to human adults and children. A "visual analogy" is an abstract rule inferred from one image and applied ...
arxiv.org
January 23, 2025 at 6:31 PM