Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: On The Fragility of Learned Reward Functions
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v2-stream-008-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v2-stream-008-alpha.b-cdn.net
      • sl-yoda-v2-stream-008-beta.b-cdn.net
      • 1159783934.rsc.cdn77.org
      • 1511376917.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            On The Fragility of Learned Reward Functions
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            On The Fragility of Learned Reward Functions

            Dec 2, 2022

            Speakers

            LM

            Lev McKinney

            Speaker · 0 followers

            YD

            Yawen Duan

            Speaker · 0 followers

            AG

            Adam Gleave

            Speaker · 2 followers

            About

            Reward functions are notoriously difficult to specify, especially for tasks with complex goals. Reward learning approaches attempt to infer reward functions from human feedback and preferences. Prior works on reward learning mainly focus on achieving high final performance for agents trained alongside the reward function. However, many of these works fail to investigate whether the resulting learned reward model accurately captures the intended behavior. In this work, we focus on the relearning…

            Organizer

            N2
            N2

            NeurIPS 2022

            Account · 961 followers

            Like the format? Trust SlidesLive to capture your next event!

            Professional recording and live streaming, delivered globally.

            Sharing

            Recommended Videos

            Presentations on similar topic, category or speaker

            Sketching based Representations for Robust Image Classification with Provable Guarantees
            05:04

            Sketching based Representations for Robust Image Classification with Provable Guarantees

            Nishanth Dikkala, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            On Optimal Learning Under Targeted Data Poisoning
            04:51

            On Optimal Learning Under Targeted Data Poisoning

            Steve Hanneke, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Simulations for Open Science Token Communities: Designing the Knowledge Commons
            02:46

            Simulations for Open Science Token Communities: Designing the Knowledge Commons

            Jakub Smékal, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Object Scene Representation Transformer
            04:45

            Object Scene Representation Transformer

            Mehdi S. M. Sajjadi, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Capacity of Group-invariant Linear Readouts from Equivariant Representations: How Many Objects can be Linearly Classified Under All Possible Views?
            05:07

            Capacity of Group-invariant Linear Readouts from Equivariant Representations: How Many Objects can be Linearly Classified Under All Possible Views?

            Matthew Farrell, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            A Theory of Learning with Competing Objectives with User Feedback
            10:53

            A Theory of Learning with Competing Objectives with User Feedback

            Pranjal Awasthi, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Interested in talks like this? Follow NeurIPS 2022