Chengzu
chengzu-li.bsky.social
Chengzu
@chengzu-li.bsky.social
PhD student at Language Technology Lab, University of Cambridge
Forget just thinking in words.

🔔Our New Preprint:
🚀 New Era of Multimodal Reasoning🚨
🔍 Imagine While Reasoning in Space with MVoT

Multimodal Visualization-of-Thought (MVoT) revolutionizes reasoning by generating visual "thoughts" that transform how AI thinks, reasons, and explains itself.
January 14, 2025 at 2:50 PM