Rishabh Kabra
rkabra.bsky.social
Rishabh Kabra
@rkabra.bsky.social
Research Engineer at Google DeepMind
I had a score disappear even when the reviewer said they will maintain their score. So it has likely nothing to do with whether the score changed.
August 4, 2025 at 5:17 PM
A self-supervised video representation model that allows visual tokens to move “off-the-grid” to represent scene elements consistently as they move across the image plane. We evaluate on downstream tasks including point tracking, monocular depth estimation, and object tracking.

moog-paper.github.io
Moving Off-the-Grid: Scene-Grounded Video Representations
Moving Off-the-Grid: Scene-Grounded Video Representations
moog-paper.github.io
December 11, 2024 at 6:58 PM
*Moving Off-the-Grid: Scene-Grounded Video Representations*.

Thursday afternoon poster.
December 11, 2024 at 6:57 PM
We learn per-object tokens (Neural Assets) that disentangle appearance and 3D pose from multi-object scenes. A sequence-of-tokens format allows us to reuse the text-to-image architecture of existing generative models.

neural-assets-paper.github.io
Neural Assets
Neural Assets
neural-assets-paper.github.io
December 11, 2024 at 6:54 PM
*Neural Assets: 3D-Aware Multi-Object Scene Synthesis with Image Diffusion Models*.

Thursday morning poster.
December 11, 2024 at 6:50 PM