Amir-massoud Farahmand
sologen.bsky.social
Amir-massoud Farahmand
@sologen.bsky.social
Research Goal: Understanding the computational and statistical principles required to design AI/RL agents.
Associate Professor at Polytechnique Montréal and Mila. 🇨🇦
academic.sologen.net
Thank you!
GAIL and DAGGER were on my radar, and both are good papers (I am a DAGGER fan!). I'd read your DQfD paper years ago. I like the idea of combining both expert data and RL. This is something we developed back in 2013 (you cited it as Kim et al. 2013).
www.sologen.net/papers/APID%...
December 16, 2025 at 3:55 AM
Thank you!
Bagnell's paper is nice intro.
I just found this "Is Behaviour Cloning All You Need" earlier today. Looks very interesting.
December 16, 2025 at 3:48 AM
Thank you! GAIL was under my radar, and is a good paper.
Thanks for bringing the DICE family to my attention. That's exactly what I wanted to find here.
(Haven't read the Diffusion Policies paper closely.)
December 16, 2025 at 1:33 AM
I am asking for a course I am designing. I have some papers in mind, but I want to make sure I am not missing a good paper.
December 16, 2025 at 12:33 AM
This is from Michael Littman's Markov games as a framework for multi-agent reinforcement (1994). Simple, yet powerful!
courses.cs.duke.edu/cps296.3/spr...
courses.cs.duke.edu
December 12, 2025 at 3:18 AM
The theory of Markov Decision Processes (MDP’s), which underlies much of the recent work on reinforcement learning, assumes that the agent’s environment is stationary and as such contains no other adaptive agents."
December 12, 2025 at 3:18 AM
cf. Wittgenstein's last statement in Tractatus: "What we cannot speak about we must pass over in silence".
November 29, 2025 at 8:56 PM
No, it is not virtue signalling! :P
November 29, 2025 at 7:33 AM
How many bits of "creative" information, injected in the form of nudging prompts, are needed in order to get to a point in the space of ideas that is truely novel?
November 16, 2025 at 4:13 AM
A related though: Some people may say LLMs can just generate ideas similar to what they have seen in their data. We can argue about that, but even if they do, one can ask a question similar to Aha! Distance:
November 16, 2025 at 4:13 AM
How many of these questions are needed in order to successfully transition from TO to TN? This is what I call Aha! Distance.
November 16, 2025 at 4:13 AM
The path from TO to TN is paved by some major questions in the form: "... but TO doesn't predict A" or "what if you do X instead of Y?".
A and X might very well be already known at the time of TO being the prevalant theory, but they are not examined and compared with TO.
November 16, 2025 at 4:13 AM
We know who are on the faculty hiring committees ... !
November 15, 2025 at 1:08 AM