Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: Robust Situational Reinforcement Learning in Face of Context Disturbances
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v2-stream-001-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v2-stream-001-alpha.b-cdn.net
      • sl-yoda-v2-stream-001-beta.b-cdn.net
      • 1824830694.rsc.cdn77.org
      • 1979322955.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            Robust Situational Reinforcement Learning in Face of Context Disturbances
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            Robust Situational Reinforcement Learning in Face of Context Disturbances

            Jul 24, 2023

            Speakers

            JZ

            Jinpeng Zhang

            Speaker · 0 followers

            YZ

            Yufeng Zheng

            Speaker · 0 followers

            CZ

            Chuheng Zhang

            Speaker · 0 followers

            About

            In many real-world tasks, some parts of state features, called contexts, are independent of action signals, e.g., customer demand in inventory control, speed of lead car in autonomous driving, etc. One of the challenges of reinforcement learning in these applications is that the true context transitions can be easily exposed some unknown source of contamination, leading to a shift of context transitions between source domains and target domains, which could cause performance degradation for RL a…

            Organizer

            I2
            I2

            ICML 2023

            Account · 657 followers

            Like the format? Trust SlidesLive to capture your next event!

            Professional recording and live streaming, delivered globally.

            Sharing

            Recommended Videos

            Presentations on similar topic, category or speaker

            Leveraging Large Scale Models for Identifying and Fixing Deep Neural Networks Biases
            21:03

            Leveraging Large Scale Models for Identifying and Fixing Deep Neural Networks Biases

            Polina Kirichenko, …

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            GastroVision: A Multi-class Endoscopy Image Dataset for Computer Aided Gastrointestinal Disease Detection
            10:46

            GastroVision: A Multi-class Endoscopy Image Dataset for Computer Aided Gastrointestinal Disease Detection

            Debesh Jha, …

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Over-parametrization via Lifting for Low-rank Matrix Sensing: Conversion of Spurious Solutions to Strict Saddle Points
            08:19

            Over-parametrization via Lifting for Low-rank Matrix Sensing: Conversion of Spurious Solutions to Strict Saddle Points

            Ziye Ma, …

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Discover and Cure: Concept-aware Mitigation of Spurious Correlation
            05:24

            Discover and Cure: Concept-aware Mitigation of Spurious Correlation

            Shirley Wu, …

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Learning Rate Schedules in the Presence of Distribution Shift
            05:30

            Learning Rate Schedules in the Presence of Distribution Shift

            Matthew Fahrbach, …

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Policy Gradient in Robust MDPs with Global Convergence Guarantee
            05:02

            Policy Gradient in Robust MDPs with Global Convergence Guarantee

            Qiuhao Wang, …

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Interested in talks like this? Follow ICML 2023