Haoyang Xu, Jimmy Ba, Silviu Pitis, Harris Chan · Temporary Goals for Exploration · SlidesLive

Categories

EN

Log in Talk to sales

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: Temporary Goals for Exploration

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v2-stream-010-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v2-stream-010-alpha.b-cdn.net
sl-yoda-v2-stream-010-beta.b-cdn.net
1759419103.rsc.cdn77.org
1016618226.rsc.cdn77.org

Subtitles
Off
English

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

Temporary Goals for Exploration

Temporary Goals for Exploration

Dec 2, 2022

Speakers

Haoyang Xu

Řečník · 0 sledujících

Jimmy Ba

Řečník · 2 sledující

Silviu Pitis

Řečník · 0 sledujících

About

Exploration has always been a crucial aspect of reinforcement learning. When facing long horizon sparse reward environments modern methods still struggle with effective exploration and generalize poorly. In the multi-goal reinforcement learning setting, out-of-distribution goals might appear similar to the achieved ones, but the agent can never accurately assess its ability to achieve them without attempting them. To enable faster exploration and improve generalization, we propose an exploration…

Organizer

NeurIPS 2022

Účet · 961 sledujících

Like the format? Trust SlidesLive to capture your next event!

Professional recording and live streaming, delivered globally.

Sharing

Recommended Videos

Presentations on similar topic, category or speaker

Understanding Cross-Domain Few-Shot Learning Based on Domain Similarity and Few-Shot Difficulty

04:51

Understanding Cross-Domain Few-Shot Learning Based on Domain Similarity and Few-Shot Difficulty

Zhlédnout později

Oblíbené

Jaehoon Oh, …

NeurIPS 2022 2 years ago

Leveraging Inter-Layer Dependency for Post -Training Quantization

03:58

Leveraging Inter-Layer Dependency for Post -Training Quantization

Zhlédnout později

Oblíbené

Changbao Wang, …

NeurIPS 2022 2 years ago

Welcome and Opening Remarks

06:01

Welcome and Opening Remarks

Zhlédnout později

Oblíbené

NeurIPS 2022 2 years ago

On Reinforcement Learning and Distribution Matching for Fine-Tuning Language Models with no Catastrophic Forgetting

05:02

On Reinforcement Learning and Distribution Matching for Fine-Tuning Language Models with no Catastrophic Forgetting

Zhlédnout později

Oblíbené

Tomasz Korbak, …

NeurIPS 2022 2 years ago

Contrastive Language-Image Pre-Training with Knowledge Graphs

05:04

Contrastive Language-Image Pre-Training with Knowledge Graphs

Zhlédnout později

Oblíbené

NeurIPS 2022 2 years ago

Antigen-Specific Antibody Design and Optimization with Diffusion-Based Generative Models for Protein Structures

02:32

Antigen-Specific Antibody Design and Optimization with Diffusion-Based Generative Models for Protein Structures

Zhlédnout později

Oblíbené

Shitong Luo, …

NeurIPS 2022 2 years ago