Associate Professor at Polytechnique Montréal and Mila. 🇨🇦
academic.sologen.net
GAIL and DAGGER were on my radar, and both are good papers (I am a DAGGER fan!). I'd read your DQfD paper years ago. I like the idea of combining both expert data and RL. This is something we developed back in 2013 (you cited it as Kim et al. 2013).
www.sologen.net/papers/APID%...
GAIL and DAGGER were on my radar, and both are good papers (I am a DAGGER fan!). I'd read your DQfD paper years ago. I like the idea of combining both expert data and RL. This is something we developed back in 2013 (you cited it as Kim et al. 2013).
www.sologen.net/papers/APID%...
Bagnell's paper is nice intro.
I just found this "Is Behaviour Cloning All You Need" earlier today. Looks very interesting.
Bagnell's paper is nice intro.
I just found this "Is Behaviour Cloning All You Need" earlier today. Looks very interesting.
Thanks for bringing the DICE family to my attention. That's exactly what I wanted to find here.
(Haven't read the Diffusion Policies paper closely.)
Thanks for bringing the DICE family to my attention. That's exactly what I wanted to find here.
(Haven't read the Diffusion Policies paper closely.)
courses.cs.duke.edu/cps296.3/spr...
courses.cs.duke.edu/cps296.3/spr...
A and X might very well be already known at the time of TO being the prevalant theory, but they are not examined and compared with TO.
A and X might very well be already known at the time of TO being the prevalant theory, but they are not examined and compared with TO.