Lightnews — Scholar-powered news

Mattie Fellows

@mattieml.bsky.social

1.7K followers 110 following 19 posts

Reinforcement Learning Postdoc at FLAIR, University of Oxford @universityofoxford.bsky.social

All opinions are my own.

Posts Replies Media Videos

Mattie Fellows

@mattieml.bsky.social

1/2 Offline RL has always bothered me. It promises that by exploiting offline data, an agent can learn to behave near-optimally once deployed. In real life, it breaks this promise, requiring large amount of online samples for tuning and has no guarantees of behaving safely to achieve desired goals.

May 30, 2025 at 8:39 AM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news