Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: Outcome-Driven Reinforcement Learning via Variational Inference
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v3-stream-016-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v3-stream-016-alpha.b-cdn.net
      • sl-yoda-v3-stream-016-beta.b-cdn.net
      • 1504562137.rsc.cdn77.org
      • 1896834465.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            Outcome-Driven Reinforcement Learning via Variational Inference
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            Outcome-Driven Reinforcement Learning via Variational Inference

            Dec 6, 2021

            Speakers

            TGJR
            TGJR

            Tim G. J. Rudner

            Speaker · 2 followers

            VHP

            Vitchyr H. Pong

            Speaker · 0 followers

            RM

            Rowan McAllister

            Speaker · 0 followers

            About

            While reinforcement learning algorithms provide automated acquisition of optimal policies, practical application of such methods requires a number of design decisions, such as manually designing reward functions that not only define the task, but also provide sufficient shaping to accomplish it. In this paper, we view reinforcement learning as inferring policies that achieve desired outcomes, rather than as a problem of maximizing rewards. To solve this inference problem, we establish a novel va…

            Organizer

            N2
            N2

            NeurIPS 2021

            Account · 1.9k followers

            About NeurIPS 2021

            Neural Information Processing Systems (NeurIPS) is a multi-track machine learning and computational neuroscience conference that includes invited talks, demonstrations, symposia and oral and poster presentations of refereed papers. Following the conference, there are workshops which provide a less formal setting.

            Like the format? Trust SlidesLive to capture your next event!

            Professional recording and live streaming, delivered globally.

            Sharing

            Recommended Videos

            Presentations on similar topic, category or speaker

            Early Work
            08:43

            Early Work

            Jong Wook Kim

            N2
            N2
            NeurIPS 2021 3 years ago

            Total of 1 viewers voted for saving the presentation to eternal vault which is 0.1%

            Hardware Design and Accurate Simulation for Benchmarking of 3D Reconstruction Algorithms
            05:04

            Hardware Design and Accurate Simulation for Benchmarking of 3D Reconstruction Algorithms

            Sebastian Koch, …

            N2
            N2
            NeurIPS 2021 3 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Fast Algorithms for L∞ -constrained S-rectangular Robust MDPs
            06:14

            Fast Algorithms for L∞ -constrained S-rectangular Robust MDPs

            Bahram Behzadian, …

            N2
            N2
            NeurIPS 2021 3 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Behavior From the Void: Unsupervised Active Pre-Training
            10:34

            Behavior From the Void: Unsupervised Active Pre-Training

            Hao Liu, …

            N2
            N2
            NeurIPS 2021 3 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Well-tuned Simple Nets can Excel on Tabular Data
            09:15

            Well-tuned Simple Nets can Excel on Tabular Data

            Arlind Kadra, …

            N2
            N2
            NeurIPS 2021 3 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            On the Convergence of Prior-Guided Zeroth-Order Optimization Algorithms
            13:49

            On the Convergence of Prior-Guided Zeroth-Order Optimization Algorithms

            Shuyu Cheng, …

            N2
            N2
            NeurIPS 2021 3 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Interested in talks like this? Follow NeurIPS 2021