Núria Armengol Urpí, Sebastian Curi, Andreas Krause · Risk-Averse Offline Reinforcement Learning · SlidesLive

Categories

EN

Log in Talk to sales

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: Risk-Averse Offline Reinforcement Learning

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v3-stream-016-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v3-stream-016-alpha.b-cdn.net
sl-yoda-v3-stream-016-beta.b-cdn.net
1504562137.rsc.cdn77.org
1896834465.rsc.cdn77.org

Subtitles
Off
English

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

Risk-Averse Offline Reinforcement Learning

Risk-Averse Offline Reinforcement Learning

May 3, 2021

Speakers

Núria Armengol Urpí

Speaker · 0 followers

Sebastian Curi

Speaker · 0 followers

Andreas Krause

Speaker · 6 followers

About

Training Reinforcement Learning (RL) agents in high-stakes applications might be too prohibitive due to the risk associated to exploration. Thus, the agent can only use data previously collected by safe policies. While previous work considers optimizing the average performance using offline data, we focus on optimizing a risk-averse criteria, namely the CVaR. In particular, we present the Offline Risk-Averse Actor-Critic (O-RAAC), a model-free RL algorithm that is able to learn risk-averse polic…

Organizer

ICLR 2021

Account · 909 followers

About ICLR 2021

The International Conference on Learning Representations (ICLR) is the premier gathering of professionals dedicated to the advancement of the branch of artificial intelligence called representation learning, but generally referred to as deep learning. ICLR is globally renowned for presenting and publishing cutting-edge research on all aspects of deep learning used in the fields of artificial intelligence, statistics and data science, as well as important application areas such as machine vision, computational biology, speech recognition, text understanding, gaming, and robotics.

Like the format? Trust SlidesLive to capture your next event!

Professional recording and live streaming, delivered globally.

Sharing

Recommended Videos

Presentations on similar topic, category or speaker

Expo Talk - Qualcomm

1:22:26

Expo Talk - Qualcomm

Watch later

Favorite

ICLR 2021 4 years ago

Oral Session 12 - QA 3

11:48

Oral Session 12 - QA 3

Watch later

Favorite

Ainesh Bakshi, …

ICLR 2021 4 years ago

Adaptive Universal Generalized PageRank Graph Neural Network

05:13

Adaptive Universal Generalized PageRank Graph Neural Network

Watch later

Favorite

ICLR 2021 4 years ago

Fair Mixup: Fairness via Interpolation

04:48

Fair Mixup: Fairness via Interpolation

Watch later

Favorite

Ching-Yao Chuang, …

ICLR 2021 4 years ago

Categorical Normalizing Flows via Continuous Transformations

04:40

Categorical Normalizing Flows via Continuous Transformations

Watch later

Favorite

Phillip Lippe, …

ICLR 2021 4 years ago

Discovering Non-Monotonic Autoregressive Orderings with Variational Inference

04:56

Discovering Non-Monotonic Autoregressive Orderings with Variational Inference

Watch later

Favorite

Xuanlin Li, …

ICLR 2021 4 years ago