Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: A Consciousness-Inspired Planning Agent for Model-Based RL
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v2-stream-006-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v2-stream-006-alpha.b-cdn.net
      • sl-yoda-v2-stream-006-beta.b-cdn.net
      • 1549480416.rsc.cdn77.org
      • 1102696603.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            A Consciousness-Inspired Planning Agent for Model-Based RL
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            A Consciousness-Inspired Planning Agent for Model-Based RL

            Dec 6, 2021

            Speakers

            MZ

            Mingde Zhao

            Speaker · 0 followers

            ZL

            Zhen Liu

            Speaker · 0 followers

            SL

            Sitao Luan

            Speaker · 0 followers

            About

            We present an end-to-end, model-based deep reinforcement learning agent which dynamically attends to relevant parts of its state, in order to plan and to generalize better out-of-distribution. The agent's architecture uses a set representation and a bottleneck mechanism, forcing the number of entities to which the agent attends at each planning step to be small. In experiments with customized MiniGrid environments with different dynamics, we observe that the design allows agents to learn to plan…

            Organizer

            N2
            N2

            NeurIPS 2021

            Account · 1.9k followers

            Categories

            AI & Data Science

            Category · 10.8k presentations

            About NeurIPS 2021

            Neural Information Processing Systems (NeurIPS) is a multi-track machine learning and computational neuroscience conference that includes invited talks, demonstrations, symposia and oral and poster presentations of refereed papers. Following the conference, there are workshops which provide a less formal setting.

            Like the format? Trust SlidesLive to capture your next event!

            Professional recording and live streaming, delivered globally.

            Sharing

            Recommended Videos

            Presentations on similar topic, category or speaker

            Towards Stable and Robust AdderNets
            11:19

            Towards Stable and Robust AdderNets

            Minjing Dong, …

            N2
            N2
            NeurIPS 2021 3 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Cross-Domain Imitation Learning via Optimal Transport
            04:50

            Cross-Domain Imitation Learning via Optimal Transport

            Arnaud Fickinger, …

            N2
            N2
            NeurIPS 2021 3 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            E(n) Equivariant Normalizing Flows
            19:16

            E(n) Equivariant Normalizing Flows

            Victor Garcia Satorras, …

            N2
            N2
            NeurIPS 2021 3 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            FinRL­-Meta: A Universe of Near-­Real Market Environments for Data­-Driven Deep Reinforcement Learning in Quantitative Finance
            02:04

            FinRL­-Meta: A Universe of Near-­Real Market Environments for Data­-Driven Deep Reinforcement Learning in Quantitative Finance

            Xiao-Yang Liu, …

            N2
            N2
            NeurIPS 2021 3 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Emergent Discrete Communication in Semantic Spaces
            14:56

            Emergent Discrete Communication in Semantic Spaces

            Mycal Tucker, …

            N2
            N2
            NeurIPS 2021 3 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Habitat 2.0: Training Home Assistants to Rearrange their Habitat
            14:26

            Habitat 2.0: Training Home Assistants to Rearrange their Habitat

            Andrew Szot, …

            N2
            N2
            NeurIPS 2021 3 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Interested in talks like this? Follow NeurIPS 2021