Lightnews — Scholar-powered news

Agneet Chatterjee

@agneet.bsky.social

150 followers 37 following 2 posts

Image and Video generation.

https://agneetchatterjee.com/

Posts Replies Media Videos

Agneet Chatterjee

@agneet.bsky.social

We also develop a benchmark to evaluate spatial understanding of VLM's. The core idea is to use synthetic images which avoids any possibility of test time leakage: arxiv.org/abs/2408.02231

REVISION: Rendering Tools Enable Spatial Fidelity in Vision-Language Models

Text-to-Image (T2I) and multimodal large language models (MLLMs) have been adopted in solutions for several computer vision and multimodal learning tasks. However, it has been found that such vision-l...

arxiv.org

November 26, 2024 at 3:26 PM

Agneet Chatterjee

@agneet.bsky.social

@csprofkgd.bsky.social could you add me too? Thank you!

November 24, 2024 at 9:11 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news