Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: Distributional Reinforcement Learning for Multi-Dimensional Reward Functions
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v3-stream-016-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v3-stream-016-alpha.b-cdn.net
      • sl-yoda-v3-stream-016-beta.b-cdn.net
      • 1504562137.rsc.cdn77.org
      • 1896834465.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            Distributional Reinforcement Learning for Multi-Dimensional Reward Functions
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            Distributional Reinforcement Learning for Multi-Dimensional Reward Functions

            Dez 6, 2021

            Sprecher:innen

            PZ

            Pushi Zhang

            Řečník · 0 sledujících

            XC

            Xiaoyu Chen

            Řečník · 0 sledujících

            LZ

            Li Zhao

            Řečník · 0 sledujících

            Über

            A growing trend for value-based reinforcement learning (RL) algorithms is to capture more information than scalar value functions in the value network. One of the most well-known methods in this branch is distributional RL, which models return distribution instead of scalar value. In another line of work, hybrid reward architectures (HRA) in RL have studied to model source-specific value functions for each source of reward, which is also shown to be beneficial in performance. To fully inherit th…

            Organisator

            N2
            N2

            NeurIPS 2021

            Účet · 1,9k sledujících

            Über NeurIPS 2021

            Neural Information Processing Systems (NeurIPS) is a multi-track machine learning and computational neuroscience conference that includes invited talks, demonstrations, symposia and oral and poster presentations of refereed papers. Following the conference, there are workshops which provide a less formal setting.

            Gefällt euch das Format? Vertraut auf SlidesLive, um euer nächstes Event festzuhalten!

            Professionelle Aufzeichnung und Livestreaming – weltweit.

            Freigeben

            Empfohlene Videos

            Präsentationen, deren Thema, Kategorie oder Sprecher:in ähnlich sind

            Raw Nav-merge Seismic Data to Subsurface Properties with MLP based Multi-Modal Information Unscrambler
            14:46

            Raw Nav-merge Seismic Data to Subsurface Properties with MLP based Multi-Modal Information Unscrambler

            Aditya Desai, …

            N2
            N2
            NeurIPS 2021 3 years ago

            Pro uložení prezentace do věčného trezoru hlasovalo 0 diváků, což je 0.0 %

            Dynamic Normalization and Relay for Video Action Recognition
            10:43

            Dynamic Normalization and Relay for Video Action Recognition

            Dongqi Cai, …

            N2
            N2
            NeurIPS 2021 3 years ago

            Pro uložení prezentace do věčného trezoru hlasovalo 0 diváků, což je 0.0 %

            Data-centric AI Competition
            02:11

            Data-centric AI Competition

            Asfandyar Azhar, …

            N2
            N2
            NeurIPS 2021 3 years ago

            Pro uložení prezentace do věčného trezoru hlasovalo 0 diváků, což je 0.0 %

            Q-Pain: A Question Answering Dataset to Measure Social Bias in Pain Management
            07:56

            Q-Pain: A Question Answering Dataset to Measure Social Bias in Pain Management

            Cécile Logé, …

            N2
            N2
            NeurIPS 2021 3 years ago

            Pro uložení prezentace do věčného trezoru hlasovalo 0 diváků, což je 0.0 %

            Biological learning in key-value memory networks
            11:23

            Biological learning in key-value memory networks

            Danial Tyulmankov, …

            N2
            N2
            NeurIPS 2021 3 years ago

            Pro uložení prezentace do věčného trezoru hlasovalo 0 diváků, což je 0.0 %

            Tackling Climate Change with Machine Learning
            14:37

            Tackling Climate Change with Machine Learning

            Maria João Sousa, …

            N2
            N2
            NeurIPS 2021 3 years ago

            Pro uložení prezentace do věčného trezoru hlasovalo 0 diváků, což je 0.0 %

            Interessiert an Vorträgen wie diesem? NeurIPS 2021 folgen