Tobias Weyand
tobw.net
Tobias Weyand
@tobw.net
Researcher at Google DeepMind working towards human-level video understanding

🔗 tobw.net
We're excited to release Minerva 🕵️‍♀️, a benchmark to evaluate if AI can truly reason about videos, from spotting game-changing moments in sports 🏀 to understanding character motivations in short films 🍿. We provide the "why" behind the answers! Pointers below 👇
May 13, 2025 at 12:06 AM
6yo daughter: Papa, are you the boss of Google?
Me: No
6yo daughter: Why?
January 8, 2025 at 4:05 AM
Excited to share Long-Video Masked Autoencoder (LVMAE) our team just published at NeurIPS'24! We boost the context length of video models using an adaptive decoder and a dual-masking strategy and achieve SotA on several video benchmarks.

Paper: arxiv.org/abs/2411.13683
The blogpost is out about our recent work on training masked autoencoders on long(-er) videos. The paper was accepted to NeurIPS`24.
More at: goo.gle/4fW5aIc
Extending video masked autoencoders to 128 frames
goo.gle
December 5, 2024 at 10:56 PM
Is there a better way to find the publication venue of an ArXiv paper than searching for the title on Google / Google Scholar / OpenReview and checking authors' websites?
November 24, 2024 at 11:16 PM
Tap, tap. Is this thing on?
November 24, 2024 at 11:13 PM