Stefan Wagner, Peter Arndt, Jan Robine, Stefan Harmeling · Cyclophobic Reinforcement Learning · SlidesLive

Categories

EN

Log in Talk to sales

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: Cyclophobic Reinforcement Learning

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v2-stream-005-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v2-stream-005-alpha.b-cdn.net
sl-yoda-v2-stream-005-beta.b-cdn.net
1034628162.rsc.cdn77.org
1409346856.rsc.cdn77.org

Subtitles
Off
English

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

Cyclophobic Reinforcement Learning

Cyclophobic Reinforcement Learning

Dec 2, 2022

Speakers

Stefan Wagner

Speaker · 0 followers

Peter Arndt

Speaker · 0 followers

Jan Robine

Speaker · 0 followers

About

In environments with sparse rewards finding a good inductive bias for exploration is crucial to the agent’s success. However, there are two competing goals: novelty search and systematic exploration. While existing approaches such as curiousity-driven exploration find novelty, they sometimes do not systematically explore the whole state space, akin to depth-first-search vs breadth-first-search. In this paper, we propose a new intrinsic reward that is cyclophobic, i.e. it does not reward novelty,…

Organizer

NeurIPS 2022

Account · 961 followers

Like the format? Trust SlidesLive to capture your next event!

Professional recording and live streaming, delivered globally.

Sharing

Recommended Videos

Presentations on similar topic, category or speaker

Lemma: Bootstrapping High-Level Mathematical Reasoning with Learned Symbolic Abstractions

06:40

Lemma: Bootstrapping High-Level Mathematical Reasoning with Learned Symbolic Abstractions

Watch later

Favorite

Zhening Li, …

NeurIPS 2022 2 years ago

Metal3D: Accurate prediction of transition metal ion location via deep learning

15:23

Metal3D: Accurate prediction of transition metal ion location via deep learning

Watch later

Favorite

NeurIPS 2022 2 years ago

Decision Trees with Short Explainable Rules

05:18

Decision Trees with Short Explainable Rules

Watch later

Favorite

Ferdinando Cicalese, …

NeurIPS 2022 2 years ago

VeriDark: A Large-Scale Benchmark for Authorship Verification on the Dark Web

04:57

VeriDark: A Large-Scale Benchmark for Authorship Verification on the Dark Web

Watch later

Favorite

Andrei Manolache, …

NeurIPS 2022 2 years ago

MaskPlace: Fast Chip Placement via Reinforced Visual Representation Learning

04:18

MaskPlace: Fast Chip Placement via Reinforced Visual Representation Learning

Watch later

Favorite

NeurIPS 2022 2 years ago

HSurf-Net: Normal Estimation for 3D Point Clouds by Learning Hyper Surfaces

03:43

HSurf-Net: Normal Estimation for 3D Point Clouds by Learning Hyper Surfaces

Watch later

Favorite

NeurIPS 2022 2 years ago