Lightnews — Scholar-powered news

Anastasiia Pedan

@pedanana.bsky.social

60 followers 9 following 7 posts

Posts Replies Media Videos

Anastasiia Pedan

@pedanana.bsky.social

my main takeaway from a talk on reward design in rl: ai only beat humans when they were asked not to collaborate 👀👀

August 25, 2025 at 7:07 PM

Anastasiia Pedan

@pedanana.bsky.social

Would you be surprised to learn that many empirical implementations of value-aware model learning (VAML) algos, including MuZero, lead to incorrect model & value functions when training stochastic models 🤕? In our new @icmlconf.bsky.social 2025 paper, we show why this happens and how to fix it 🦾!

June 19, 2025 at 2:40 AM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news