Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: Counterexample Guided RL Policy Refinement Using Bayesian Optimization
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v2-stream-007-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v2-stream-007-alpha.b-cdn.net
      • sl-yoda-v2-stream-007-beta.b-cdn.net
      • 1678031076.rsc.cdn77.org
      • 1932936657.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            Counterexample Guided RL Policy Refinement Using Bayesian Optimization
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            Counterexample Guided RL Policy Refinement Using Bayesian Optimization

            Dec 6, 2021

            Speakers

            BG

            Briti Gangopadhyay

            Speaker · 0 followers

            PD

            Pallab Dasgupta

            Speaker · 0 followers

            About

            Constructing Reinforcement Learning (RL) policies that adhere to safety requirements is an emerging field of study. RL agents learn via trial and error with an objective to optimize a reward signal. Often policies that are designed to accumulate rewards do not satisfy safety specifications. We present a methodology for counterexample guided refinement of a trained RL policy against a given safety specification. Our approach has two main components. The first component is an approach to discover…

            Organizer

            N2
            N2

            NeurIPS 2021

            Account · 1.9k followers

            About NeurIPS 2021

            Neural Information Processing Systems (NeurIPS) is a multi-track machine learning and computational neuroscience conference that includes invited talks, demonstrations, symposia and oral and poster presentations of refereed papers. Following the conference, there are workshops which provide a less formal setting.

            Like the format? Trust SlidesLive to capture your next event!

            Professional recording and live streaming, delivered globally.

            Sharing

            Recommended Videos

            Presentations on similar topic, category or speaker

            Blending Anti-Aliasing into Vision Transformer
            08:27

            Blending Anti-Aliasing into Vision Transformer

            Shengju Qian, …

            N2
            N2
            NeurIPS 2021 3 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Principled Al Algorithms for predicting and mitigating climate change
            46:40

            Principled Al Algorithms for predicting and mitigating climate change

            Anima Anandkumar

            N2
            N2
            NeurIPS 2021 3 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Understanding the Effect of Stochasticity in Policy Optimization
            10:14

            Understanding the Effect of Stochasticity in Policy Optimization

            Jincheng Mei, …

            N2
            N2
            NeurIPS 2021 3 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Spotlight talks: new datasets and research finalists
            1:00:25

            Spotlight talks: new datasets and research finalists

            Sharmita Dey, …

            N2
            N2
            NeurIPS 2021 3 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Knowledge-inspired 3D Scene Graph Prediction in Point Cloud
            11:20

            Knowledge-inspired 3D Scene Graph Prediction in Point Cloud

            Shoulong Zhang, …

            N2
            N2
            NeurIPS 2021 3 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Hyperparameter Tuning is All You Need for LISTA
            15:05

            Hyperparameter Tuning is All You Need for LISTA

            Xiaohan Chen, …

            N2
            N2
            NeurIPS 2021 3 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Interested in talks like this? Follow NeurIPS 2021