Xuefeng Liu, Takuma Yoneda, Chaoqi Wang, Matthew R. Walter, Yuxin Chen · Active Policy Improvement from Multiple Black-box Oracles · SlidesLive

Kategorien

DE

Anmelden Vertrieb kontaktieren

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: Active Policy Improvement from Multiple Black-box Oracles

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v2-stream-002-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v2-stream-002-alpha.b-cdn.net
sl-yoda-v2-stream-002-beta.b-cdn.net
1001562353.rsc.cdn77.org
1075090661.rsc.cdn77.org

Subtitles
Off
English

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

Active Policy Improvement from Multiple Black-box Oracles

Active Policy Improvement from Multiple Black-box Oracles

Jul 24, 2023

Sprecher:innen

Xuefeng Liu

Sprecher:in · 0 Follower:innen

Takuma Yoneda

Sprecher:in · 0 Follower:innen

Chaoqi Wang

Sprecher:in · 0 Follower:innen

Über

Reinforcement learning (RL) has made significant strides in various complex domains. However, identifying an effective policy via RL often necessitates extensive exploration. Imitation learning aims to mitigate this issue by using expert demonstrations to guide exploration. In real-world scenarios, one often has access to multiple suboptimal black-box experts, rather than a single optimal oracle. These experts do not universally outperform each other across all states, presenting a challenge in…

Organisator

ICML 2023

Konto · 657 Follower:innen

Gefällt euch das Format? Vertraut auf SlidesLive, um euer nächstes Event festzuhalten!

Professionelle Aufzeichnung und Livestreaming – weltweit.

Freigeben

Empfohlene Videos

Präsentationen, deren Thema, Kategorie oder Sprecher:in ähnlich sind

Data Structures for Density Estimation

05:16

Data Structures for Density Estimation

Später ansehen

Favorit

Anders Aamand, …

ICML 2023 2 years ago

Generalized Implicit Follow-The-Regularized-Leader

05:14

Generalized Implicit Follow-The-Regularized-Leader

Später ansehen

Favorit

ICML 2023 2 years ago

Instant Soup: Cheap Pruning Ensembles in A Single Pass Can Draw Lottery Tickets from Large Models

09:30

Instant Soup: Cheap Pruning Ensembles in A Single Pass Can Draw Lottery Tickets from Large Models

Später ansehen

Favorit

Ajay Jaiswal, …

ICML 2023 2 years ago

Equivariant Architectures for Learning in Deep Weight Spaces

08:25

Equivariant Architectures for Learning in Deep Weight Spaces

Später ansehen

Favorit

Aviv Navon, …

ICML 2023 2 years ago

WiML President's Remarks

13:45

WiML President's Remarks

Später ansehen

Favorit

ICML 2023 2 years ago

Best of Both Worlds Policy Optimization

06:28

Best of Both Worlds Policy Optimization

Später ansehen

Favorit

Christoph Dann, …

ICML 2023 2 years ago