Briti Gangopadhyay, Pallab Dasgupta · Counterexample Guided RL Policy Refinement Using Bayesian Optimization · SlidesLive

Categories

EN

Log in Talk to sales

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: Counterexample Guided RL Policy Refinement Using Bayesian Optimization

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v2-stream-007-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v2-stream-007-alpha.b-cdn.net
sl-yoda-v2-stream-007-beta.b-cdn.net
1678031076.rsc.cdn77.org
1932936657.rsc.cdn77.org

Subtitles
Off
English

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

Counterexample Guided RL Policy Refinement Using Bayesian Optimization

Counterexample Guided RL Policy Refinement Using Bayesian Optimization

Dec 6, 2021

Speakers

Briti Gangopadhyay

Speaker · 0 followers

Pallab Dasgupta

Speaker · 0 followers

About

Constructing Reinforcement Learning (RL) policies that adhere to safety requirements is an emerging field of study. RL agents learn via trial and error with an objective to optimize a reward signal. Often policies that are designed to accumulate rewards do not satisfy safety specifications. We present a methodology for counterexample guided refinement of a trained RL policy against a given safety specification. Our approach has two main components. The first component is an approach to discover…

Organizer

NeurIPS 2021

Account · 1.9k followers

About NeurIPS 2021

Neural Information Processing Systems (NeurIPS) is a multi-track machine learning and computational neuroscience conference that includes invited talks, demonstrations, symposia and oral and poster presentations of refereed papers. Following the conference, there are workshops which provide a less formal setting.

Like the format? Trust SlidesLive to capture your next event!

Professional recording and live streaming, delivered globally.

Sharing

Recommended Videos

Presentations on similar topic, category or speaker

Blending Anti-Aliasing into Vision Transformer

08:27

Blending Anti-Aliasing into Vision Transformer

Watch later

Favorite

Shengju Qian, …

NeurIPS 2021 3 years ago

Principled Al Algorithms for predicting and mitigating climate change

46:40

Principled Al Algorithms for predicting and mitigating climate change

Watch later

Favorite

Anima Anandkumar

NeurIPS 2021 3 years ago

Understanding the Effect of Stochasticity in Policy Optimization

10:14

Understanding the Effect of Stochasticity in Policy Optimization

Watch later

Favorite

Jincheng Mei, …

NeurIPS 2021 3 years ago

Spotlight talks: new datasets and research finalists

1:00:25

Spotlight talks: new datasets and research finalists

Watch later

Favorite

Sharmita Dey, …

NeurIPS 2021 3 years ago

Knowledge-inspired 3D Scene Graph Prediction in Point Cloud

11:20

Knowledge-inspired 3D Scene Graph Prediction in Point Cloud

Watch later

Favorite

Shoulong Zhang, …

NeurIPS 2021 3 years ago

Hyperparameter Tuning is All You Need for LISTA

15:05

Hyperparameter Tuning is All You Need for LISTA

Watch later

Favorite

Xiaohan Chen, …

NeurIPS 2021 3 years ago