Paper: arxiv.org/abs/2412.01770
Website: casher-robot-learning.github.io/CASHER/
USD assets: github.com/casher-robot...
Collaborators:
@marcelto.bsky.social
Carrie Yuan
Vidyaraanya Macha
@ankile.bsky.social
Anthony S
Pulkit Agrawal
@abhishekunique7.bsky.social
Paper: arxiv.org/abs/2412.01770
Website: casher-robot-learning.github.io/CASHER/
USD assets: github.com/casher-robot...
Collaborators:
@marcelto.bsky.social
Carrie Yuan
Vidyaraanya Macha
@ankile.bsky.social
Anthony S
Pulkit Agrawal
@abhishekunique7.bsky.social
We realize that even 10 demos becomes a burden when you scale up to 100, 500, 1000... environments. After an initial training phase of the generalist policy, we leverage it to provide its own demos to bootstrap RL and master new environments! (4/N)
We realize that even 10 demos becomes a burden when you scale up to 100, 500, 1000... environments. After an initial training phase of the generalist policy, we leverage it to provide its own demos to bootstrap RL and master new environments! (4/N)