Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: Outcome-Driven Reinforcement Learning via Variational Inference
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v3-stream-016-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v3-stream-016-alpha.b-cdn.net
      • sl-yoda-v3-stream-016-beta.b-cdn.net
      • 1504562137.rsc.cdn77.org
      • 1896834465.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            Outcome-Driven Reinforcement Learning via Variational Inference
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            Outcome-Driven Reinforcement Learning via Variational Inference

            Dec 6, 2021

            Speakers

            TGJR
            TGJR

            Tim G. J. Rudner

            Speaker · 2 followers

            VHP

            Vitchyr H. Pong

            Speaker · 0 followers

            RM

            Rowan McAllister

            Speaker · 0 followers

            About

            While reinforcement learning algorithms provide automated acquisition of optimal policies, practical application of such methods requires a number of design decisions, such as manually designing reward functions that not only define the task, but also provide sufficient shaping to accomplish it. In this paper, we view reinforcement learning as inferring policies that achieve desired outcomes, rather than as a problem of maximizing rewards. To solve this inference problem, we establish a novel va…

            Organizer

            N2
            N2

            NeurIPS 2021

            Account · 1.9k followers

            About NeurIPS 2021

            Neural Information Processing Systems (NeurIPS) is a multi-track machine learning and computational neuroscience conference that includes invited talks, demonstrations, symposia and oral and poster presentations of refereed papers. Following the conference, there are workshops which provide a less formal setting.

            Like the format? Trust SlidesLive to capture your next event!

            Professional recording and live streaming, delivered globally.

            Sharing

            Recommended Videos

            Presentations on similar topic, category or speaker

            End-To-End Online sPHENIX Trigger Detection Pipeline
            05:00

            End-To-End Online sPHENIX Trigger Detection Pipeline

            Tingting Xuan, …

            N2
            N2
            NeurIPS 2021 3 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Bandit Quickest Changepoint Detection
            15:01

            Bandit Quickest Changepoint Detection

            Aditya Gopalan, …

            N2
            N2
            NeurIPS 2021 3 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Equinox: neural networks in JAX via callable PyTrees and filtered transformations
            07:32

            Equinox: neural networks in JAX via callable PyTrees and filtered transformations

            Patrick Kidger

            N2
            N2
            NeurIPS 2021 3 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Heavy Ball Momentum for Conditional Gradient
            11:41

            Heavy Ball Momentum for Conditional Gradient

            Biangcong Li, …

            N2
            N2
            NeurIPS 2021 3 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Tractable Density Estimation on Learned Manifolds with Conformal Embedding Flows
            14:45

            Tractable Density Estimation on Learned Manifolds with Conformal Embedding Flows

            Brendan Leigh Ross, …

            N2
            N2
            NeurIPS 2021 3 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            To The Point: Correspondence-driven monocular 3D category reconstruction
            05:27

            To The Point: Correspondence-driven monocular 3D category reconstruction

            Filippos Kokkinos

            N2
            N2
            NeurIPS 2021 3 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Interested in talks like this? Follow NeurIPS 2021