Jeongyeol Kwon, Yonathan Efroni, Constantine Caramanis, Shie Mannor · Reinforcement Learning in Reward-Mixing MDPs · SlidesLive

Categories

EN

Log in Talk to sales

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: Reinforcement Learning in Reward-Mixing MDPs

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v3-stream-015-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v3-stream-015-alpha.b-cdn.net
sl-yoda-v3-stream-015-beta.b-cdn.net
1963568160.rsc.cdn77.org
1940033649.rsc.cdn77.org

Subtitles
Off
English

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

Reinforcement Learning in Reward-Mixing MDPs

Reinforcement Learning in Reward-Mixing MDPs

Dec 6, 2021

Speakers

Jeongyeol Kwon

Řečník · 0 sledujících

Yonathan Efroni

Řečník · 0 sledujících

Constantine Caramanis

Řečník · 0 sledujících

About

Learning a near optimal policy in a partially observable system remains an elusive challenge in contemporary reinforcement learning. In this work, we consider episodic reinforcement learning in a reward-mixing Markov decision process (MDP). There, a reward function is drawn from one of M possible reward models at the beginning of every episode, but the identity of the chosen reward model is not revealed to the agent. Hence, the latent state space, for which the dynamics are Markovian, is not giv…

Organizer

NeurIPS 2021

Účet · 1,9k sledujících

About NeurIPS 2021

Neural Information Processing Systems (NeurIPS) is a multi-track machine learning and computational neuroscience conference that includes invited talks, demonstrations, symposia and oral and poster presentations of refereed papers. Following the conference, there are workshops which provide a less formal setting.

Like the format? Trust SlidesLive to capture your next event!

Professional recording and live streaming, delivered globally.

Sharing

Recommended Videos

Presentations on similar topic, category or speaker

CLDA: Contrastive Learning for Semi-Supervised Domain Adaptation

06:24

CLDA: Contrastive Learning for Semi-Supervised Domain Adaptation

Zhlédnout později

Oblíbené

NeurIPS 2021 3 years ago

Reinforcement Learning in Real-World Control Systems

19:47

Reinforcement Learning in Real-World Control Systems

Zhlédnout později

Oblíbené

Martin Riedmiller

NeurIPS 2021 3 years ago

Representation Costs of Linear Neural Networks: Analysis and Design

12:50

Representation Costs of Linear Neural Networks: Analysis and Design

Zhlédnout později

Oblíbené

NeurIPS 2021 3 years ago

Intriguing Properties of Vision Transformers

12:32

Intriguing Properties of Vision Transformers

Zhlédnout později

Oblíbené

Muzammal Naseer, …

NeurIPS 2021 3 years ago

The impact of weather information on machine-learning probabilistic electricity demand predictions

05:51

The impact of weather information on machine-learning probabilistic electricity demand predictions

Zhlédnout později

Oblíbené

NeurIPS 2021 3 years ago

Simple Stochastic and Online Gradient Descent Algorithms for Pairiwise Learning

14:40

Simple Stochastic and Online Gradient Descent Algorithms for Pairiwise Learning

Zhlédnout později

Oblíbené

Zhenhuan Yang, …

NeurIPS 2021 3 years ago