Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: Lazy Agents: A New Perspective on Solving Sparse Reward Problem in Multi-agent Reinforcement Learning
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v2-stream-001-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v2-stream-001-alpha.b-cdn.net
      • sl-yoda-v2-stream-001-beta.b-cdn.net
      • 1824830694.rsc.cdn77.org
      • 1979322955.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            Lazy Agents: A New Perspective on Solving Sparse Reward Problem in Multi-agent Reinforcement Learning
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            Lazy Agents: A New Perspective on Solving Sparse Reward Problem in Multi-agent Reinforcement Learning

            Jul 24, 2023

            Speakers

            BL

            Boyin Liu

            Speaker · 0 followers

            ZP

            Zhiqiang Pu

            Speaker · 0 followers

            YP

            Yi Pan

            Speaker · 0 followers

            About

            Sparse reward remains a valuable and challenging problem in multi-agent reinforcement learning (MARL). This paper addresses this issue from a new perspective, i.e., lazy agents. We empirically illustrate how lazy agents damage learning from both exploration and exploitation. Then, we propose a novel MARL framework called Lazy Agents Avoidance through Influencing External States (LAIES). Firstly, we examine the causes and types of lazy agents in MARL using a causal graph of the interaction betwe…

            Organizer

            I2
            I2

            ICML 2023

            Account · 635 followers

            Like the format? Trust SlidesLive to capture your next event!

            Professional recording and live streaming, delivered globally.

            Sharing

            Recommended Videos

            Presentations on similar topic, category or speaker

            Wrapped Cauchy Distributed Angular Softmax (WCDAS)  for Long-Tailed Visual Recognition
            05:08

            Wrapped Cauchy Distributed Angular Softmax (WCDAS) for Long-Tailed Visual Recognition

            Boran Han

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Vertical Federated Graph Neural Network for Recommender System
            04:59

            Vertical Federated Graph Neural Network for Recommender System

            Peihua Mai, …

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Hindsight Learning for MDPs with Exogenous Inputs
            04:51

            Hindsight Learning for MDPs with Exogenous Inputs

            Sean R. Sinclair, …

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Towards Explaining Distribution Shifts
            05:18

            Towards Explaining Distribution Shifts

            Sean Kulinski, …

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            DetectGPT: Zero-Shot Machine-Generated Text Detection using Probability Curvature
            07:53

            DetectGPT: Zero-Shot Machine-Generated Text Detection using Probability Curvature

            Eric Mitchell, …

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Structure in Monge Maps by Engineering Costs
            47:56

            Structure in Monge Maps by Engineering Costs

            Marco Cuturi

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Interested in talks like this? Follow ICML 2023