Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: Unleashing The Potential of Data Sharing in Ensemble Deep Reinforcement Learning
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v2-stream-001-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v2-stream-001-alpha.b-cdn.net
      • sl-yoda-v2-stream-001-beta.b-cdn.net
      • 1824830694.rsc.cdn77.org
      • 1979322955.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            Unleashing The Potential of Data Sharing in Ensemble Deep Reinforcement Learning
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            Unleashing The Potential of Data Sharing in Ensemble Deep Reinforcement Learning

            Dec 2, 2022

            Speakers

            ZL

            Zhixuan Lin

            Speaker · 0 followers

            PD

            Pierluca D'Oro

            Speaker · 2 followers

            EN

            Evgenii Nikishin

            Speaker · 1 follower

            About

            This work studies a crucial but often overlooked element of ensemble methods in deep reinforcement learning: data sharing between ensemble members. We show that data sharing enables peer learning, a powerful learning process in which individual agents learn from each other's experience to significantly improve their performance. When given access to the experience of other ensemble members, even the worst agent can match or outperform the previously best agent, triggering a virtuous circle. Howe…

            Organizer

            N2
            N2

            NeurIPS 2022

            Account · 961 followers

            Like the format? Trust SlidesLive to capture your next event!

            Professional recording and live streaming, delivered globally.

            Sharing

            Recommended Videos

            Presentations on similar topic, category or speaker

            Meta-learning of Black-box Solvers Using Deep Reinforcement Learning
            04:39

            Meta-learning of Black-box Solvers Using Deep Reinforcement Learning

            Sofian Chaybouti, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Unified Optimal Transport Framework for Universal Domain Adaptation
            00:57

            Unified Optimal Transport Framework for Universal Domain Adaptation

            Wanxing Chang, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Tackling Distribution Shifts in Federated Learning with Superquantile Aggregation
            08:04

            Tackling Distribution Shifts in Federated Learning with Superquantile Aggregation

            Krishna Pillutla

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            SPRINT: Scalable Semantic Policy Pre-training via Language Instruction Relabeling
            08:02

            SPRINT: Scalable Semantic Policy Pre-training via Language Instruction Relabeling

            Jesse Zhang, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Exploiting Variable Correlation with Masked Modeling for Anomaly Detection in Time Series
            07:59

            Exploiting Variable Correlation with Masked Modeling for Anomaly Detection in Time Series

            Panagiotis Lymperopoulos, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Private Stochastic Optimization Without Uniform Lipschitz Continuity: Optimal Rates for (Non-Smooth) Convex Losses and Extension to Non-Convex Losses
            06:16

            Private Stochastic Optimization Without Uniform Lipschitz Continuity: Optimal Rates for (Non-Smooth) Convex Losses and Extension to Non-Convex Losses

            Andrew Lowy, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Interested in talks like this? Follow NeurIPS 2022