And if you happen to be in Singapore, I'll be teaching a for-credit version of this class again next semester at NUS, under CS6208! Reach out if you're interested in auditing.
cosilab.notion.site/cs6101-raci-...
And if you happen to be in Singapore, I'll be teaching a for-credit version of this class again next semester at NUS, under CS6208! Reach out if you're interested in auditing.
cosilab.notion.site/cs6101-raci-...
bsky.app/profile/xuan...
twitter.com/xuanalogue/s...
bsky.app/profile/xuan...
@lanceying.bsky.social (paper lead), @heyodogo.bsky.social, Katie M Colins, Megan Wei, Ced Zhang, @tbrookewilson.bsky.social, and my co-senior authors Lio Wong + @joshtenenbaum.bsky.social.
@lanceying.bsky.social (paper lead), @heyodogo.bsky.social, Katie M Colins, Megan Wei, Ced Zhang, @tbrookewilson.bsky.social, and my co-senior authors Lio Wong + @joshtenenbaum.bsky.social.
Or find the paper here:
aclanthology.org/2025.finding...
Or find the paper here:
aclanthology.org/2025.finding...
In contrast, large reasoning models like OpenAI o3 show a much weaker correlation, in line with other work indicating that LRMs struggle at ToM reasoning.
In contrast, large reasoning models like OpenAI o3 show a much weaker correlation, in line with other work indicating that LRMs struggle at ToM reasoning.
- From language, LLMs synthesize code representing agent + env. models that support coherent inference
- VLMs parse images to env. states
- From language, LLMs synthesize code representing agent + env. models that support coherent inference
- VLMs parse images to env. states