Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: DOMINO: Decomposed Mutual Information Optimization for Generalized Context in Meta-Reinforcement Learning
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v2-stream-008-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v2-stream-008-alpha.b-cdn.net
      • sl-yoda-v2-stream-008-beta.b-cdn.net
      • 1159783934.rsc.cdn77.org
      • 1511376917.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            DOMINO: Decomposed Mutual Information Optimization for Generalized Context in Meta-Reinforcement Learning
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            DOMINO: Decomposed Mutual Information Optimization for Generalized Context in Meta-Reinforcement Learning

            Nov 28, 2022

            Speakers

            MY

            Mu Yao

            Sprecher:in · 0 Follower:innen

            YZ

            Yuzheng Zhuang

            Sprecher:in · 0 Follower:innen

            FN

            Fei Ni

            Sprecher:in · 0 Follower:innen

            About

            Adapting to the changes in transition dynamics is essential in robotic applications. By learning a conditional policy with a compact context, context-aware meta-reinforcement learning provides a flexible way to adjust behavior according to dynamics changes. However, in real-world applications, the agent may encounter complex dynamics changes. Multiple confounders can influence the transition dynamics, making it challenging to infer accurate context for decision-making. This paper addresses such…

            Organizer

            N2
            N2

            NeurIPS 2022

            Konto · 962 Follower:innen

            Like the format? Trust SlidesLive to capture your next event!

            Professional recording and live streaming, delivered globally.

            Sharing

            Recommended Videos

            Presentations on similar topic, category or speaker

            Ground(less) Truth: The Problem with Proxy Labels in Human-AI Decision-Making
            04:49

            Ground(less) Truth: The Problem with Proxy Labels in Human-AI Decision-Making

            Luke Guerdan, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Ewigspeicher-Fortschrittswert: 0 = 0.0%

            ExpressUrself: A spatial model for predicting recombinant expression from mRNA sequence
            01:58

            ExpressUrself: A spatial model for predicting recombinant expression from mRNA sequence

            Michael P. Dunne, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Ewigspeicher-Fortschrittswert: 0 = 0.0%

            Trials of developing OPT-175B
            31:18

            Trials of developing OPT-175B

            Susan Zhang

            N2
            N2
            NeurIPS 2022 2 years ago

            Ewigspeicher-Fortschrittswert: 0 = 0.0%

            Locally Hierarchical Auto-Regressive Modeling for Image Generation
            04:03

            Locally Hierarchical Auto-Regressive Modeling for Image Generation

            Tackgeun You, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Ewigspeicher-Fortschrittswert: 0 = 0.0%

            Learning from Stochastically Revealed Preference
            05:23

            Learning from Stochastically Revealed Preference

            Chunlin Sun, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Ewigspeicher-Fortschrittswert: 0 = 0.0%

            So3krates - Self-attention for interactions on arbitrary length-scales in molecular systems
            05:25

            So3krates - Self-attention for interactions on arbitrary length-scales in molecular systems

            Thorben Frank, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Ewigspeicher-Fortschrittswert: 0 = 0.0%

            Interested in talks like this? Follow NeurIPS 2022