Lev McKinney, Yawen Duan, Adam Gleave, David Krueger · On The Fragility of Learned Reward Functions · SlidesLive

Categories

EN

Log in Talk to sales

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: On The Fragility of Learned Reward Functions

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v2-stream-008-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v2-stream-008-alpha.b-cdn.net
sl-yoda-v2-stream-008-beta.b-cdn.net
1159783934.rsc.cdn77.org
1511376917.rsc.cdn77.org

Subtitles
Off
English

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

On The Fragility of Learned Reward Functions

On The Fragility of Learned Reward Functions

Dec 2, 2022

Speakers

Lev McKinney

Speaker · 0 followers

Yawen Duan

Speaker · 0 followers

Adam Gleave

Speaker · 2 followers

About

Reward functions are notoriously difficult to specify, especially for tasks with complex goals. Reward learning approaches attempt to infer reward functions from human feedback and preferences. Prior works on reward learning mainly focus on achieving high final performance for agents trained alongside the reward function. However, many of these works fail to investigate whether the resulting learned reward model accurately captures the intended behavior. In this work, we focus on the relearning…

Organizer

NeurIPS 2022

Account · 961 followers

Like the format? Trust SlidesLive to capture your next event!

Professional recording and live streaming, delivered globally.

Sharing

Recommended Videos

Presentations on similar topic, category or speaker

Sketching based Representations for Robust Image Classification with Provable Guarantees

05:04

Sketching based Representations for Robust Image Classification with Provable Guarantees

Watch later

Favorite

Nishanth Dikkala, …

NeurIPS 2022 2 years ago

On Optimal Learning Under Targeted Data Poisoning

04:51

On Optimal Learning Under Targeted Data Poisoning

Watch later

Favorite

Steve Hanneke, …

NeurIPS 2022 2 years ago

Simulations for Open Science Token Communities: Designing the Knowledge Commons

02:46

Simulations for Open Science Token Communities: Designing the Knowledge Commons

Watch later

Favorite

Jakub Smékal, …

NeurIPS 2022 2 years ago

Object Scene Representation Transformer

04:45

Object Scene Representation Transformer

Watch later

Favorite

Mehdi S. M. Sajjadi, …

NeurIPS 2022 2 years ago

Capacity of Group-invariant Linear Readouts from Equivariant Representations: How Many Objects can be Linearly Classified Under All Possible Views?

05:07

Capacity of Group-invariant Linear Readouts from Equivariant Representations: How Many Objects can be Linearly Classified Under All Possible Views?

Watch later

Favorite

Matthew Farrell, …

NeurIPS 2022 2 years ago

A Theory of Learning with Competing Objectives with User Feedback

10:53

A Theory of Learning with Competing Objectives with User Feedback

Watch later

Favorite

Pranjal Awasthi, …

NeurIPS 2022 2 years ago