I am a Computer Vision scientist building a company to help manufacturing supervisors optimize assembly line operations.
I did my PhD from ETH Zurich in 2013, lifting object detections in single images to 3D scene representations.
Models today can label pixels and detect objects with high accuracy. But does that mean they truly understand scenes?
Super excited to share our new paper and a new task in computer vision: Visual Jenga!
📄 arxiv.org/abs/2503.21770
🔗 visualjenga.github.io
Models today can label pixels and detect objects with high accuracy. But does that mean they truly understand scenes?
Super excited to share our new paper and a new task in computer vision: Visual Jenga!
📄 arxiv.org/abs/2503.21770
🔗 visualjenga.github.io
(Chain of thought/think step-by-step was the first powerful prompting technique that was discovered, now Reasoners do it automatically
(Chain of thought/think step-by-step was the first powerful prompting technique that was discovered, now Reasoners do it automatically
"Virtual employees" formulation is being pushed by sales teams as it establishes a high price point for the AI tool and is easy to fit in existing enterprise budgets. Just replace the headcount with the AI employee.
But it's a terrible UX, and that's why it's going to fail.
"Virtual employees" formulation is being pushed by sales teams as it establishes a high price point for the AI tool and is easy to fit in existing enterprise budgets. Just replace the headcount with the AI employee.
But it's a terrible UX, and that's why it's going to fail.
I am a Computer Vision scientist building a company to help manufacturing supervisors optimize assembly line operations.
I did my PhD from ETH Zurich in 2013, lifting object detections in single images to 3D scene representations.
I am a Computer Vision scientist building a company to help manufacturing supervisors optimize assembly line operations.
I did my PhD from ETH Zurich in 2013, lifting object detections in single images to 3D scene representations.
Our latest work addresses this problem!
YT: youtu.be/lEUluMdNHcc
arXiv: arxiv.org/abs/2412.02930
Our latest work addresses this problem!
YT: youtu.be/lEUluMdNHcc
arXiv: arxiv.org/abs/2412.02930