Lightnews — Scholar-powered news

Oscar Mañas

@oscmansan.bsky.social

170 followers 170 following 9 posts

Research scientist at Meta, PhD candidate at Mila and Université de Montréal. Working on multimodal vision+language generation. Català a Zúric.

Posts Replies Media Videos

Oscar Mañas

@oscmansan.bsky.social

Headed to @cvprconference.bsky.social in Nashville! I'll be presenting our work on Multimodal Reward-guided Decoding. Let's connect if you're around!

June 10, 2025 at 6:21 PM

Oscar Mañas

@oscmansan.bsky.social

TFW you find a memory leak in your code two days before the rebuttal's deadline

May 15, 2025 at 2:18 PM

Oscar Mañas

@oscmansan.bsky.social

Heading to Singapore for the next 1.5 weeks for @iclr-conf.bsky.social. If you're around and want to meet up, hit me up!

April 21, 2025 at 11:49 PM

Reposted by Oscar Mañas

Simons Institute for the Theory of Computing

@simonsinstitute.bsky.social

"Tokenize Everything!" Luke Zettlemoyer of
@uofwa.bsky.social on using GPT-like autoregressive techniques for training multimodal models (text, images, audio etc.) at the Simons Institute workshop on The Future of Language Models and Transformers simons.berkeley.edu/workshops/fu...

April 1, 2025 at 8:55 PM

Oscar Mañas

@oscmansan.bsky.social

I quite like this analogy by Oriol Vinyals:
* LLM ~= core electric brain
* Agent ~= LLM with a digital body

youtu.be/78mEYaztGaw

Gemini 2.0 and the evolution of agentic AI with Oriol Vinyals

YouTube video by Google DeepMind

youtu.be

December 22, 2024 at 1:32 AM

Oscar Mañas

@oscmansan.bsky.social

Curious about how to effectively steer the behavior of multimodal LLMs during inference to improve their visual grounding?

Join me today at 4:30pm at the AFM workshop at @NeurIPSConf, where I'll be presenting a poster on my work. Come by to learn more!

openreview.net/forum?id=VWJ...

December 14, 2024 at 6:18 PM

Oscar Mañas

@oscmansan.bsky.social

Tomorrow at 3:15pm I'll be presenting my work at @mila-quebec.bsky.social's booth (#104) at @neuripsconf.bsky.social. Come to learn more about controlling multimodal LLMs via reward-guided decoding!

🔗 openreview.net/forum?id=VWJ...

Controlling Multimodal LLMs via Reward-guided Decoding

As Multimodal Large Language Models (MLLMs) gain widespread applicability, it is becoming increasingly desirable to adapt them for diverse user needs. In this paper, we study the adaptation of...

openreview.net

December 10, 2024 at 3:04 AM

Reposted by Oscar Mañas

François Fleuret

@francois.fleuret.org

All this being said, Meta/FAIR remains the only place where you can do open AI research with a group of stellar colleagues ten times larger than any university + big-tech computational capabilities level.

December 5, 2024 at 5:48 PM

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news