Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: Reinforcement Learning in Reward-Mixing MDPs
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v3-stream-015-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v3-stream-015-alpha.b-cdn.net
      • sl-yoda-v3-stream-015-beta.b-cdn.net
      • 1963568160.rsc.cdn77.org
      • 1940033649.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            Reinforcement Learning in Reward-Mixing MDPs
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            Reinforcement Learning in Reward-Mixing MDPs

            Dez 6, 2021

            Sprecher:innen

            JK

            Jeongyeol Kwon

            Řečník · 0 sledujících

            YE

            Yonathan Efroni

            Řečník · 0 sledujících

            CC

            Constantine Caramanis

            Řečník · 0 sledujících

            Über

            Learning a near optimal policy in a partially observable system remains an elusive challenge in contemporary reinforcement learning. In this work, we consider episodic reinforcement learning in a reward-mixing Markov decision process (MDP). There, a reward function is drawn from one of M possible reward models at the beginning of every episode, but the identity of the chosen reward model is not revealed to the agent. Hence, the latent state space, for which the dynamics are Markovian, is not giv…

            Organisator

            N2
            N2

            NeurIPS 2021

            Účet · 1,9k sledujících

            Über NeurIPS 2021

            Neural Information Processing Systems (NeurIPS) is a multi-track machine learning and computational neuroscience conference that includes invited talks, demonstrations, symposia and oral and poster presentations of refereed papers. Following the conference, there are workshops which provide a less formal setting.

            Gefällt euch das Format? Vertraut auf SlidesLive, um euer nächstes Event festzuhalten!

            Professionelle Aufzeichnung und Livestreaming – weltweit.

            Freigeben

            Empfohlene Videos

            Präsentationen, deren Thema, Kategorie oder Sprecher:in ähnlich sind

            Open Data Sharing and Indigenous Genomic Data Governance
            32:29

            Open Data Sharing and Indigenous Genomic Data Governance

            Krystal Tsosie

            N2
            N2
            NeurIPS 2021 3 years ago

            Pro uložení prezentace do věčného trezoru hlasovalo 0 diváků, což je 0.0 %

            Sample Selection for Fair and Robust Training
            13:44

            Sample Selection for Fair and Robust Training

            Yuji Roh, …

            N2
            N2
            NeurIPS 2021 3 years ago

            Pro uložení prezentace do věčného trezoru hlasovalo 0 diváků, což je 0.0 %

            Learning to Synthesize Programs as Interpretable and Generalizable Policies
            18:14

            Learning to Synthesize Programs as Interpretable and Generalizable Policies

            Dweep Trivedi, …

            N2
            N2
            NeurIPS 2021 3 years ago

            Pro uložení prezentace do věčného trezoru hlasovalo 0 diváků, což je 0.0 %

            Predicting Molecular Conformation via Dynamic Graph Score Matching
            08:37

            Predicting Molecular Conformation via Dynamic Graph Score Matching

            Shitong Luo, …

            N2
            N2
            NeurIPS 2021 3 years ago

            Pro uložení prezentace do věčného trezoru hlasovalo 0 diváků, což je 0.0 %

            Continual Density Ratio Estimation
            05:47

            Continual Density Ratio Estimation

            Yu Chen, …

            N2
            N2
            NeurIPS 2021 3 years ago

            Pro uložení prezentace do věčného trezoru hlasovalo 0 diváků, což je 0.0 %

            GRIN: Generative Relation and Intention Network for Multi-agent Trajectory Prediction
            04:28

            GRIN: Generative Relation and Intention Network for Multi-agent Trajectory Prediction

            Longyuan Li, …

            N2
            N2
            NeurIPS 2021 3 years ago

            Pro uložení prezentace do věčného trezoru hlasovalo 0 diváků, což je 0.0 %

            Interessiert an Vorträgen wie diesem? NeurIPS 2021 folgen