Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: Contrastive Value Learning: Implicit Models for Simple Offline RL
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v2-stream-007-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v2-stream-007-alpha.b-cdn.net
      • sl-yoda-v2-stream-007-beta.b-cdn.net
      • 1678031076.rsc.cdn77.org
      • 1932936657.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            Contrastive Value Learning: Implicit Models for Simple Offline RL
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            Contrastive Value Learning: Implicit Models for Simple Offline RL

            Dez 2, 2022

            Sprecher:innen

            BM

            Bogdan Mazoure

            Speaker · 0 followers

            BE

            Benjamin Eysenbach

            Speaker · 0 followers

            ON

            Ofir Nachum

            Speaker · 2 followers

            Über

            Model-based reinforcement learning (RL) methods are appealing in the offline setting because they allow an agent to reason about the consequences of actions without interacting with the environment. Prior methods learn a 1-step dynamics model, which predicts the next state given the current state and action. These models do not immediately tell the agent which actions to take, but must be integrated into a larger RL framework. Can we model the environment dynamics in a different way, such that t…

            Organisator

            N2
            N2

            NeurIPS 2022

            Account · 962 followers

            Gefällt euch das Format? Vertraut auf SlidesLive, um euer nächstes Event festzuhalten!

            Professionelle Aufzeichnung und Livestreaming – weltweit.

            Freigeben

            Empfohlene Videos

            Präsentationen, deren Thema, Kategorie oder Sprecher:in ähnlich sind

            Online Training Through Time for Spiking Neural Networks
            01:01

            Online Training Through Time for Spiking Neural Networks

            Mingqing Xiao, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            [Re] An Implementation of Fair Robust Learning
            04:30

            [Re] An Implementation of Fair Robust Learning

            Ian Hardy

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            EGSDE: Unpaired Image-to-Image Translation via Energy-Guided Stochastic Differential Equations
            03:57

            EGSDE: Unpaired Image-to-Image Translation via Energy-Guided Stochastic Differential Equations

            Min Zhao, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Statistical Downscaling of Sea Surface Temperature Projections with a Multivariate Gaussian Process Model
            03:52

            Statistical Downscaling of Sea Surface Temperature Projections with a Multivariate Gaussian Process Model

            Ayesha Ekanayaka, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            ML for protein structure prediction - a biology perspective
            22:38

            ML for protein structure prediction - a biology perspective

            Kathryn Tunyasuvunakool

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Lifting the Information Ratio: An Information-Theoretic Analysis of Thompson Sampling
            04:44

            Lifting the Information Ratio: An Information-Theoretic Analysis of Thompson Sampling

            Ludovic Schwartz, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Interessiert an Vorträgen wie diesem? NeurIPS 2022 folgen