Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: PD-MORL: Preference-Driven Multi-Objective Reinforcement Learning Algorithm
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v2-stream-002-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v2-stream-002-alpha.b-cdn.net
      • sl-yoda-v2-stream-002-beta.b-cdn.net
      • 1001562353.rsc.cdn77.org
      • 1075090661.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            PD-MORL: Preference-Driven Multi-Objective Reinforcement Learning Algorithm
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            PD-MORL: Preference-Driven Multi-Objective Reinforcement Learning Algorithm

            Dec 2, 2022

            Speakers

            TB

            Toygun Basaklar

            Sprecher:in · 0 Follower:innen

            SG

            Suat Gumussoy

            Sprecher:in · 0 Follower:innen

            UYO

            Umit Y. Ogras

            Sprecher:in · 0 Follower:innen

            About

            Multi-objective reinforcement learning (MORL) approaches have emerged to tackle many real-world problems with multiple conflicting objectives by maximizing a joint objective function weighted by a preference vector. These approaches find fixed customized policies corresponding to preference vectors specified during training. However, the design constraints and objectives typically change dynamically in real-life scenarios. Furthermore, storing a policy for each potential preference is not scalab…

            Organizer

            N2
            N2

            NeurIPS 2022

            Konto · 961 Follower:innen

            Like the format? Trust SlidesLive to capture your next event!

            Professional recording and live streaming, delivered globally.

            Sharing

            Recommended Videos

            Presentations on similar topic, category or speaker

            MsSVT: Mixed-scale Sparse Voxel Transformer for 3D Object Detection on Point Clouds
            03:39

            MsSVT: Mixed-scale Sparse Voxel Transformer for 3D Object Detection on Point Clouds

            Shaocong Dong, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Ewigspeicher-Fortschrittswert: 0 = 0.0%

            A unified information-theoretic model of EEG signatures of human language processing
            03:28

            A unified information-theoretic model of EEG signatures of human language processing

            Jiaxuan Li, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Ewigspeicher-Fortschrittswert: 0 = 0.0%

            Learning Superpoint Graph Cut for 3D Instance Segmentation
            03:00

            Learning Superpoint Graph Cut for 3D Instance Segmentation

            Le Hui, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Ewigspeicher-Fortschrittswert: 0 = 0.0%

            Periodic Graph Transformers for Crystal Material Property Prediction
            05:01

            Periodic Graph Transformers for Crystal Material Property Prediction

            Keqiang Yan, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Ewigspeicher-Fortschrittswert: 0 = 0.0%

            Understanding Deep Neural Function Approximation in Reinforcement Learning via ϵ-Greedy Exploration
            04:17

            Understanding Deep Neural Function Approximation in Reinforcement Learning via ϵ-Greedy Exploration

            Fanghui Liu, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Ewigspeicher-Fortschrittswert: 0 = 0.0%

            Pre-Trained Language Models for Interactive Decision-Making
            04:50

            Pre-Trained Language Models for Interactive Decision-Making

            Shuang Li, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Ewigspeicher-Fortschrittswert: 0 = 0.0%

            Interested in talks like this? Follow NeurIPS 2022