Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: Reinforcement Learning with a Terminator
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v2-stream-009-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v2-stream-009-alpha.b-cdn.net
      • sl-yoda-v2-stream-009-beta.b-cdn.net
      • 1766500541.rsc.cdn77.org
      • 1441886916.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            Reinforcement Learning with a Terminator
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            Reinforcement Learning with a Terminator

            Nov 28, 2022

            Speakers

            GT

            Guy Tennenholtz

            Speaker · 0 followers

            NM

            Nadav Merlis

            Speaker · 0 followers

            LS

            Lior Shani

            Speaker · 0 followers

            About

            We present the problem of reinforcement learning with exogenous termination. We define the Termination Markov Decision Process (TerMDP), an extension of the MDP framework, in which episodes may be interrupted by an external non-Markovian observer. This formulation accounts for numerous real-world situations, such as a human interrupting an autonomous driving agent for reasons of discomfort. We learn the parameters of the TerMDP and leverage the structure of the estimation problem to provide stat…

            Organizer

            N2
            N2

            NeurIPS 2022

            Account · 952 followers

            Like the format? Trust SlidesLive to capture your next event!

            Professional recording and live streaming, delivered globally.

            Sharing

            Recommended Videos

            Presentations on similar topic, category or speaker

            Habitat Rearrangement Challenge
            13:38

            Habitat Rearrangement Challenge

            Andrew Szot, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Log-Polar Space Convolution Layers
            01:04

            Log-Polar Space Convolution Layers

            Bing Su, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Hydranet: A Neural Network for the estimation of Multi-valued Treatment Effects
            17:42

            Hydranet: A Neural Network for the estimation of Multi-valued Treatment Effects

            Borja Velasco-Regúlez

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Improving Task-Specific Generalization in Few-Shot Learning via Adaptive Vicinal Risk Minimization
            01:05

            Improving Task-Specific Generalization in Few-Shot Learning via Adaptive Vicinal Risk Minimization

            Long-Kai Huang, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Visual Clues: Bridging Vision and Language Foundations for Image Paragraph Captioning
            05:02

            Visual Clues: Bridging Vision and Language Foundations for Image Paragraph Captioning

            Yujia Xie, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            CASA: Category-agnostic Skeletal Animal Reconstruction
            04:35

            CASA: Category-agnostic Skeletal Animal Reconstruction

            Yuefan Wu, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Interested in talks like this? Follow NeurIPS 2022