Listen "Teaching robot policies without new demonstrations: interview with Jiahui Zhang and Jesse Zhang"
Episode Synopsis
The ReWiND method, which consists of three phases: learning a reward function, pre-training, and using the reward function and pre-trained policy to learn a new language-specified task online. In their paper ReWiND: Language-Guided Rewards Teach Robot Policies without New Demonstrations, which was presented at CoRL 2025, Jiahui Zhang, Yusen Luo, Abrar Anwar, Sumedh A. Sontakke, […]
ZARZA We are Zarza, the prestigious firm behind major projects in information technology.