Rishabh Kabra
rkabra.bsky.social
Rishabh Kabra
@rkabra.bsky.social
Research Engineer at Google DeepMind
Reposted by Rishabh Kabra
Scaling 4D Representations

Self-supervised learning from video does scale! In our latest work, we scaled masked auto-encoding models to 22B params, boosting performance on pose estimation, tracking & more.

Paper: arxiv.org/abs/2412.15212
Code & models: github.com/google-deepmind/representations4d
July 10, 2025 at 11:52 AM
Veo 3: Celebrating festival season
YouTube video by Google UK
www.youtube.com
June 30, 2025 at 10:51 AM
Reposted by Rishabh Kabra
Generative Video Diffusion: does a model trained with this objective learn better features compared to image generation?

We investigated this question and more in our latest work, please check it out!

*From Image to Video: An Empirical Study of Diffusion Representations*
arxiv.org/abs/2502.07001
February 13, 2025 at 4:11 PM
I’m hanging out at NeurIPS this week. Come check out my co-authors’ presentations of the following Spotlight papers!
December 11, 2024 at 6:38 PM