Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: Reinforcement Learning in Reward-Mixing MDPs
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v3-stream-015-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v3-stream-015-alpha.b-cdn.net
      • sl-yoda-v3-stream-015-beta.b-cdn.net
      • 1963568160.rsc.cdn77.org
      • 1940033649.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            Reinforcement Learning in Reward-Mixing MDPs
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            Reinforcement Learning in Reward-Mixing MDPs

            Dec 6, 2021

            Speakers

            JK

            Jeongyeol Kwon

            Řečník · 0 sledujících

            YE

            Yonathan Efroni

            Řečník · 0 sledujících

            CC

            Constantine Caramanis

            Řečník · 0 sledujících

            About

            Learning a near optimal policy in a partially observable system remains an elusive challenge in contemporary reinforcement learning. In this work, we consider episodic reinforcement learning in a reward-mixing Markov decision process (MDP). There, a reward function is drawn from one of M possible reward models at the beginning of every episode, but the identity of the chosen reward model is not revealed to the agent. Hence, the latent state space, for which the dynamics are Markovian, is not giv…

            Organizer

            N2
            N2

            NeurIPS 2021

            Účet · 1,9k sledujících

            About NeurIPS 2021

            Neural Information Processing Systems (NeurIPS) is a multi-track machine learning and computational neuroscience conference that includes invited talks, demonstrations, symposia and oral and poster presentations of refereed papers. Following the conference, there are workshops which provide a less formal setting.

            Like the format? Trust SlidesLive to capture your next event!

            Professional recording and live streaming, delivered globally.

            Sharing

            Recommended Videos

            Presentations on similar topic, category or speaker

            CLDA: Contrastive Learning for Semi-Supervised Domain Adaptation
            06:24

            CLDA: Contrastive Learning for Semi-Supervised Domain Adaptation

            Ankit Singh

            N2
            N2
            NeurIPS 2021 3 years ago

            Pro uložení prezentace do věčného trezoru hlasovalo 0 diváků, což je 0.0 %

            Reinforcement Learning in Real-World Control Systems
            19:47

            Reinforcement Learning in Real-World Control Systems

            Martin Riedmiller

            N2
            N2
            NeurIPS 2021 3 years ago

            Pro uložení prezentace do věčného trezoru hlasovalo 0 diváků, což je 0.0 %

            Representation Costs of Linear Neural Networks: Analysis and Design
            12:50

            Representation Costs of Linear Neural Networks: Analysis and Design

            Zhen Dai, …

            N2
            N2
            NeurIPS 2021 3 years ago

            Pro uložení prezentace do věčného trezoru hlasovalo 0 diváků, což je 0.0 %

            Intriguing Properties of Vision Transformers
            12:32

            Intriguing Properties of Vision Transformers

            Muzammal Naseer, …

            N2
            N2
            NeurIPS 2021 3 years ago

            Pro uložení prezentace do věčného trezoru hlasovalo 0 diváků, což je 0.0 %

            The impact of weather information on machine-learning probabilistic electricity demand predictions
            05:51

            The impact of weather information on machine-learning probabilistic electricity demand predictions

            Yifu Ding, …

            N2
            N2
            NeurIPS 2021 3 years ago

            Pro uložení prezentace do věčného trezoru hlasovalo 0 diváků, což je 0.0 %

            Simple Stochastic and Online Gradient Descent Algorithms for Pairiwise Learning
            14:40

            Simple Stochastic and Online Gradient Descent Algorithms for Pairiwise Learning

            Zhenhuan Yang, …

            N2
            N2
            NeurIPS 2021 3 years ago

            Pro uložení prezentace do věčného trezoru hlasovalo 0 diváků, což je 0.0 %

            Interested in talks like this? Follow NeurIPS 2021