Tim G. J. Rudner, Vitchyr H. Pong, Rowan McAllister, Yarin Gal, Sergey Levine · Outcome-Driven Reinforcement Learning via Variational Inference · SlidesLive

Categories

EN

Log in Talk to sales

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: Outcome-Driven Reinforcement Learning via Variational Inference

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v3-stream-016-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v3-stream-016-alpha.b-cdn.net
sl-yoda-v3-stream-016-beta.b-cdn.net
1504562137.rsc.cdn77.org
1896834465.rsc.cdn77.org

Subtitles
Off
English

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

Outcome-Driven Reinforcement Learning via Variational Inference

Outcome-Driven Reinforcement Learning via Variational Inference

Dec 6, 2021

Speakers

Tim G. J. Rudner

Speaker · 2 followers

Vitchyr H. Pong

Speaker · 0 followers

Rowan McAllister

Speaker · 0 followers

About

While reinforcement learning algorithms provide automated acquisition of optimal policies, practical application of such methods requires a number of design decisions, such as manually designing reward functions that not only define the task, but also provide sufficient shaping to accomplish it. In this paper, we view reinforcement learning as inferring policies that achieve desired outcomes, rather than as a problem of maximizing rewards. To solve this inference problem, we establish a novel va…

Organizer

NeurIPS 2021

Account · 1.9k followers

About NeurIPS 2021

Neural Information Processing Systems (NeurIPS) is a multi-track machine learning and computational neuroscience conference that includes invited talks, demonstrations, symposia and oral and poster presentations of refereed papers. Following the conference, there are workshops which provide a less formal setting.

Like the format? Trust SlidesLive to capture your next event!

Professional recording and live streaming, delivered globally.

Sharing

Recommended Videos

Presentations on similar topic, category or speaker

End-To-End Online sPHENIX Trigger Detection Pipeline

05:00

End-To-End Online sPHENIX Trigger Detection Pipeline

Watch later

Favorite

Tingting Xuan, …

NeurIPS 2021 3 years ago

Bandit Quickest Changepoint Detection

15:01

Bandit Quickest Changepoint Detection

Watch later

Favorite

Aditya Gopalan, …

NeurIPS 2021 3 years ago

Equinox: neural networks in JAX via callable PyTrees and filtered transformations

07:32

Equinox: neural networks in JAX via callable PyTrees and filtered transformations

Watch later

Favorite

NeurIPS 2021 3 years ago

Heavy Ball Momentum for Conditional Gradient

11:41

Heavy Ball Momentum for Conditional Gradient

Watch later

Favorite

Biangcong Li, …

NeurIPS 2021 3 years ago

Tractable Density Estimation on Learned Manifolds with Conformal Embedding Flows

14:45

Tractable Density Estimation on Learned Manifolds with Conformal Embedding Flows

Watch later

Favorite

Brendan Leigh Ross, …

NeurIPS 2021 3 years ago

To The Point: Correspondence-driven monocular 3D category reconstruction

05:27

To The Point: Correspondence-driven monocular 3D category reconstruction

Watch later

Favorite

Filippos Kokkinos

NeurIPS 2021 3 years ago