Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v2-stream-005-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v2-stream-005-alpha.b-cdn.net
      • sl-yoda-v2-stream-005-beta.b-cdn.net
      • 1034628162.rsc.cdn77.org
      • 1409346856.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble

            Dec 6, 2021

            Speakers

            GA

            Gaon An

            Speaker · 0 followers

            SM

            Seungyong Moon

            Speaker · 0 followers

            JK

            Jang-Hyun Kim

            Speaker · 0 followers

            About

            Offline reinforcement learning (offline RL), which aims to find an optimal policy from a previously collected static dataset, bears algorithmic difficulties due to function approximation errors from out-of-distribution (OOD) data points. To this end, offline RL algorithms adopt either a constraint or a penalty term that explicitly guides the policy to stay close to the given dataset. However, prior methods typically require accurate estimation of the behavior policy or sampling from OOD data poi…

            Organizer

            N2
            N2

            NeurIPS 2021

            Account · 1.9k followers

            About NeurIPS 2021

            Neural Information Processing Systems (NeurIPS) is a multi-track machine learning and computational neuroscience conference that includes invited talks, demonstrations, symposia and oral and poster presentations of refereed papers. Following the conference, there are workshops which provide a less formal setting.

            Like the format? Trust SlidesLive to capture your next event!

            Professional recording and live streaming, delivered globally.

            Sharing

            Recommended Videos

            Presentations on similar topic, category or speaker

            Next-Generation Datasets for Safe Self-driving
            27:33

            Next-Generation Datasets for Safe Self-driving

            Raquel Urtasun

            N2
            N2
            NeurIPS 2021 3 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Long-Short Transformer: Efficient Transformers for Language and Vision
            11:44

            Long-Short Transformer: Efficient Transformers for Language and Vision

            Chen Zhu, …

            N2
            N2
            NeurIPS 2021 3 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Automatic Symmetry Discovery with Lie Algebra Convolutional Network
            14:42

            Automatic Symmetry Discovery with Lie Algebra Convolutional Network

            Nima Dehmamy, …

            N2
            N2
            NeurIPS 2021 3 years ago

            Total of 1 viewers voted for saving the presentation to eternal vault which is 0.1%

            AutoDC: Automated data-centric processing
            01:54

            AutoDC: Automated data-centric processing

            Zac Yung-Chun Liu, …

            N2
            N2
            NeurIPS 2021 3 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Directed Spectral Measures Improve Latent Network Models Of Neural Populations
            11:43

            Directed Spectral Measures Improve Latent Network Models Of Neural Populations

            Neil Gallagher, …

            N2
            N2
            NeurIPS 2021 3 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Continuous Mean-Covariance Bandits
            11:33

            Continuous Mean-Covariance Bandits

            Yihan Du, …

            N2
            N2
            NeurIPS 2021 3 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Interested in talks like this? Follow NeurIPS 2021