Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: On the Theory of Reinforcement Learning with Once-per-Episode Feedback
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v3-stream-015-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v3-stream-015-alpha.b-cdn.net
      • sl-yoda-v3-stream-015-beta.b-cdn.net
      • 1963568160.rsc.cdn77.org
      • 1940033649.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            On the Theory of Reinforcement Learning with Once-per-Episode Feedback
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            On the Theory of Reinforcement Learning with Once-per-Episode Feedback

            Dec 6, 2021

            Speakers

            NC

            Niladri Chatterji

            Speaker · 0 followers

            AP

            Aldo Pacchiano

            Speaker · 0 followers

            PLB

            Peter L. Bartlett

            Speaker · 1 follower

            About

            We introduce a theory of reinforcement learning (RL) in which the learner receives feedback only once at the end of an episode. While this is an extreme test case for theory, it is also arguably more representative of real-world applications than the traditional requirement in RL practice that the learner receive feedback at every time step. Indeed, in many real-world applications of reinforcement learning, such as self driving cars and robotics, it is easier to evaluate whether a learner's comp…

            Organizer

            N2
            N2

            NeurIPS 2021

            Account · 1.9k followers

            About NeurIPS 2021

            Neural Information Processing Systems (NeurIPS) is a multi-track machine learning and computational neuroscience conference that includes invited talks, demonstrations, symposia and oral and poster presentations of refereed papers. Following the conference, there are workshops which provide a less formal setting.

            Like the format? Trust SlidesLive to capture your next event!

            Professional recording and live streaming, delivered globally.

            Sharing

            Recommended Videos

            Presentations on similar topic, category or speaker

            Statistical Testing under Distributional Shifts
            25:44

            Statistical Testing under Distributional Shifts

            Jonas Peters

            N2
            N2
            NeurIPS 2021 3 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            The Limits of Optimal Pricing in the Dark
            15:06

            The Limits of Optimal Pricing in the Dark

            Quinlan Dawkins, …

            N2
            N2
            NeurIPS 2021 3 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Modality-Agnostic Topology Aware Localization
            11:06

            Modality-Agnostic Topology Aware Localization

            Farhad G. Zanjani, …

            N2
            N2
            NeurIPS 2021 3 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            ResNet strikes back: An improved training procedure in timm
            13:35

            ResNet strikes back: An improved training procedure in timm

            Ross Wightman, …

            N2
            N2
            NeurIPS 2021 3 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Model based Multi-agent Reinforcement Learning with Tensor Decompositions
            05:12

            Model based Multi-agent Reinforcement Learning with Tensor Decompositions

            Pascal Van Der Vaart, …

            N2
            N2
            NeurIPS 2021 3 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Solving Soft Clustering Ensemble via k-Sparse Discrete Wasserstein Barycenter
            12:11

            Solving Soft Clustering Ensemble via k-Sparse Discrete Wasserstein Barycenter

            Ruizhe Qin, …

            N2
            N2
            NeurIPS 2021 3 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Interested in talks like this? Follow NeurIPS 2021