Mu Yao, Yuzheng Zhuang, Fei Ni, Bin Wang, Jianyu Chen, Jianye Hao, Ping Luo · DOMINO: Decomposed Mutual Information Optimization for Generalized Context in Meta-Reinforcement Learning · SlidesLive

Categories

EN

Log in Talk to sales

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: DOMINO: Decomposed Mutual Information Optimization for Generalized Context in Meta-Reinforcement Learning

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v2-stream-008-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v2-stream-008-alpha.b-cdn.net
sl-yoda-v2-stream-008-beta.b-cdn.net
1159783934.rsc.cdn77.org
1511376917.rsc.cdn77.org

Subtitles
Off
English

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

DOMINO: Decomposed Mutual Information Optimization for Generalized Context in Meta-Reinforcement Learning

DOMINO: Decomposed Mutual Information Optimization for Generalized Context in Meta-Reinforcement Learning

Nov 28, 2022

Speakers

Mu Yao

Sprecher:in · 0 Follower:innen

Yuzheng Zhuang

Sprecher:in · 0 Follower:innen

Fei Ni

Sprecher:in · 0 Follower:innen

About

Adapting to the changes in transition dynamics is essential in robotic applications. By learning a conditional policy with a compact context, context-aware meta-reinforcement learning provides a flexible way to adjust behavior according to dynamics changes. However, in real-world applications, the agent may encounter complex dynamics changes. Multiple confounders can influence the transition dynamics, making it challenging to infer accurate context for decision-making. This paper addresses such…

Organizer

NeurIPS 2022

Konto · 962 Follower:innen

Like the format? Trust SlidesLive to capture your next event!

Professional recording and live streaming, delivered globally.

Sharing

Recommended Videos

Presentations on similar topic, category or speaker

Ground(less) Truth: The Problem with Proxy Labels in Human-AI Decision-Making

04:49

Ground(less) Truth: The Problem with Proxy Labels in Human-AI Decision-Making

Später ansehen

Favorit

Luke Guerdan, …

NeurIPS 2022 2 years ago

ExpressUrself: A spatial model for predicting recombinant expression from mRNA sequence

01:58

ExpressUrself: A spatial model for predicting recombinant expression from mRNA sequence

Später ansehen

Favorit

Michael P. Dunne, …

NeurIPS 2022 2 years ago

Trials of developing OPT-175B

31:18

Trials of developing OPT-175B

Später ansehen

Favorit

NeurIPS 2022 2 years ago

Locally Hierarchical Auto-Regressive Modeling for Image Generation

04:03

Locally Hierarchical Auto-Regressive Modeling for Image Generation

Später ansehen

Favorit

Tackgeun You, …

NeurIPS 2022 2 years ago

Learning from Stochastically Revealed Preference

05:23

Learning from Stochastically Revealed Preference

Später ansehen

Favorit

Chunlin Sun, …

NeurIPS 2022 2 years ago

So3krates - Self-attention for interactions on arbitrary length-scales in molecular systems

05:25

So3krates - Self-attention for interactions on arbitrary length-scales in molecular systems

Später ansehen

Favorit

Thorben Frank, …

NeurIPS 2022 2 years ago