Skip to yearly menu bar Skip to main content


(4 events)   Timezone:  
Show all
Toggle Poster Visibility
Oral
Thu Jul 12 05:30 AM -- 05:50 AM (PDT) @ A1
The Mirage of Action-Dependent Baselines in Reinforcement Learning
George Tucker · Surya Bhupatiraju · Shixiang Gu · Richard E Turner · Zoubin Ghahramani · Sergey Levine
[ PDF [ Video
Oral
Thu Jul 12 05:50 AM -- 06:10 AM (PDT) @ A1
Smoothed Action Value Functions for Learning Gaussian Policies
Ofir Nachum · Mohammad Norouzi · George Tucker · Dale Schuurmans
[ PDF [ Video
Oral
Thu Jul 12 06:10 AM -- 06:20 AM (PDT) @ A1
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
Tuomas Haarnoja · Aurick Zhou · Pieter Abbeel · Sergey Levine
[ PDF
Oral
Thu Jul 12 06:20 AM -- 06:30 AM (PDT) @ A1
Addressing Function Approximation Error in Actor-Critic Methods
Scott Fujimoto · Herke van Hoof · David Meger
[ PDF [ Video