Lightnews — Scholar-powered news

I just read "Welcome to the Era of Experience", advocating for agents grounded from interacting with the real world and getting their rewards from an "open-world" instead of simulation+simple reward.

I open bluesky

You propose agents working out for you. Someone is preparing for a grant 🤔

April 24, 2025 at 8:25 PM

Jean-Marc

@jeanmarcalkazzi.bsky.social

An extension could be for multi-agent systems, although way more complex to include the safety constraints (no collision)

April 4, 2025 at 8:16 PM

Jean-Marc

@jeanmarcalkazzi.bsky.social

arxiv.org/abs/2503.19173

Graph neural networks extrapolate out-of-distribution for shortest paths

Neural networks (NNs), despite their success and wide adoption, still struggle to extrapolate out-of-distribution (OOD), i.e., to inputs that are not well-represented by their training dataset. Addres...

arxiv.org

April 4, 2025 at 8:16 PM

Jean-Marc

@jeanmarcalkazzi.bsky.social

And a great blog post David wrote here: www.davidsilver.uk/wp-content/u...

www.davidsilver.uk

April 4, 2025 at 6:09 AM

Jean-Marc

@jeanmarcalkazzi.bsky.social

You can find the paper here: ojs.aaai.org/index.php/AI...

Cooperative Pathfinding | Proceedings of the AAAI Conference on Artificial Intelligence and Interactive Digital Entertainment

ojs.aaai.org

April 4, 2025 at 6:09 AM

Jean-Marc

@jeanmarcalkazzi.bsky.social

...For every step Agent 1 makes, you reserve 2 cells in the shared table. For every step Agent 2 makes you reserve at least 1 cell.

April 4, 2025 at 6:09 AM

Jean-Marc

@jeanmarcalkazzi.bsky.social

What if some agents are slower/faster, how would the reservation table look like?
You discretize time based on their relative speed.
Imagine agent 1 4x as fast as agent 2. The shared table has cells of unit 1. Agent 1 table has cells of unit 2 and Agent 2 table cells of unit 0.5...

April 4, 2025 at 6:09 AM

Jean-Marc

@jeanmarcalkazzi.bsky.social

The idea is explained in this figure.
Agents reserve their shortest path in sequence, and a new Space-Time A* approach is used to find the shortest non-reserved path.

April 4, 2025 at 6:09 AM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news