Lightnews — Scholar-powered news

Amir-massoud Farahmand

@sologen.bsky.social

650 followers 200 following 160 posts

Research Goal: Understanding the computational and statistical principles required to design AI/RL agents.
Associate Professor at Polytechnique Montréal and Mila. 🇨🇦
academic.sologen.net

Posts Replies Media Videos

Amir-massoud Farahmand

@sologen.bsky.social

Thank you!
GAIL and DAGGER were on my radar, and both are good papers (I am a DAGGER fan!). I'd read your DQfD paper years ago. I like the idea of combining both expert data and RL. This is something we developed back in 2013 (you cited it as Kim et al. 2013).
www.sologen.net/papers/APID%...

December 16, 2025 at 3:55 AM

Amir-massoud Farahmand

@sologen.bsky.social

Thank you!
Bagnell's paper is nice intro.
I just found this "Is Behaviour Cloning All You Need" earlier today. Looks very interesting.

December 16, 2025 at 3:48 AM

Amir-massoud Farahmand

@sologen.bsky.social

Thank you! GAIL was under my radar, and is a good paper.
Thanks for bringing the DICE family to my attention. That's exactly what I wanted to find here.
(Haven't read the Diffusion Policies paper closely.)

December 16, 2025 at 1:33 AM

Amir-massoud Farahmand

@sologen.bsky.social

I am asking for a course I am designing. I have some papers in mind, but I want to make sure I am not missing a good paper.

December 16, 2025 at 12:33 AM

Amir-massoud Farahmand

@sologen.bsky.social

This is from Michael Littman's Markov games as a framework for multi-agent reinforcement (1994). Simple, yet powerful!
courses.cs.duke.edu/cps296.3/spr...

courses.cs.duke.edu

December 12, 2025 at 3:18 AM

Amir-massoud Farahmand

@sologen.bsky.social

The theory of Markov Decision Processes (MDP’s), which underlies much of the recent work on reinforcement learning, assumes that the agent’s environment is stationary and as such contains no other adaptive agents."

December 12, 2025 at 3:18 AM

Amir-massoud Farahmand

@sologen.bsky.social

cf. Wittgenstein's last statement in Tractatus: "What we cannot speak about we must pass over in silence".

November 29, 2025 at 8:56 PM

Amir-massoud Farahmand

@sologen.bsky.social

No, it is not virtue signalling! :P

November 29, 2025 at 7:33 AM

Amir-massoud Farahmand

@sologen.bsky.social

How many bits of "creative" information, injected in the form of nudging prompts, are needed in order to get to a point in the space of ideas that is truely novel?

November 16, 2025 at 4:13 AM

Amir-massoud Farahmand

@sologen.bsky.social

A related though: Some people may say LLMs can just generate ideas similar to what they have seen in their data. We can argue about that, but even if they do, one can ask a question similar to Aha! Distance:

November 16, 2025 at 4:13 AM

Amir-massoud Farahmand

@sologen.bsky.social

How many of these questions are needed in order to successfully transition from TO to TN? This is what I call Aha! Distance.

November 16, 2025 at 4:13 AM

Amir-massoud Farahmand

@sologen.bsky.social

The path from TO to TN is paved by some major questions in the form: "... but TO doesn't predict A" or "what if you do X instead of Y?".
A and X might very well be already known at the time of TO being the prevalant theory, but they are not examined and compared with TO.

November 16, 2025 at 4:13 AM

Amir-massoud Farahmand

@sologen.bsky.social

We know who are on the faculty hiring committees ... !

November 15, 2025 at 1:08 AM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news