Lightnews — Scholar-powered news

Nicolas Espinosa Dice

@nico-espinosa-dice.bsky.social

cs phd student @cornelluniversity.bsky.social. previously @harveymuddcollege.bsky.social. working on reinforcement learning & generative models. https://nico-espinosadice.github.io/

Posts Replies Media Videos

Reposted by Nicolas Espinosa Dice

Gokul Swamy

@gokul.dev

Shortcut models enable scaling offline RL, both at train-time at test-time! We beat so many other algorithms on so many tasks we had to stick most of the results in the appendix 😅. Very proud of @nico-espinosa-dice.bsky.social for spearheading this project, check out his thread!

Nicolas Espinosa Dice @nico-espinosa-dice.bsky.social · Jun 12

by incorporating self-consistency during offline RL training, we unlock three orthogonal directions of scaling:

1. efficient training (i.e. limit backprop through time)
2. expressive model classes (e.g. flow matching)
3. inference-time scaling (sequential and parallel)

June 12, 2025 at 11:14 PM

Nicolas Espinosa Dice

@nico-espinosa-dice.bsky.social

June 12, 2025 at 7:34 PM

Nicolas Espinosa Dice

@nico-espinosa-dice.bsky.social

Excited to share our new paper, "Efficient Imitation Under Misspecification"

@gokul.dev's thread ↓

Gokul Swamy @gokul.dev · Apr 7

I think of misspecification (e.g. embodiment / sensory gaps) as the fundamental reason behavioral cloning isn't "all you need" for imitation as matching actions matching outcomes. Introducing @nico-espinosa-dice.bsky.social's #ICLR2025 paper proving that "local search" *is* all you need! [1/n]

April 7, 2025 at 10:08 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news