Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: Reinforcement Learning from Human Feedback: A Tutorial *
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v2-stream-005-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v2-stream-005-alpha.b-cdn.net
      • sl-yoda-v2-stream-005-beta.b-cdn.net
      • 1034628162.rsc.cdn77.org
      • 1409346856.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            Reinforcement Learning from Human Feedback: A Tutorial *
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            Reinforcement Learning from Human Feedback: A Tutorial *

            Jul 24, 2023

            Speakers

            NL

            Nathan Lambert

            Speaker · 3 followers

            DU

            Dmitry Ustalov

            Speaker · 2 followers

            Organizer

            I2
            I2

            ICML 2023

            Account · 469 followers

            Like the format? Trust SlidesLive to capture your next event!

            Professional recording and live streaming, delivered globally.

            Sharing

            Recommended Videos

            Presentations on similar topic, category or speaker

            Optimistic Online Mirror Descent for Bridging Stochastic and Adversarial Online Convex Optimization
            05:15

            Optimistic Online Mirror Descent for Bridging Stochastic and Adversarial Online Convex Optimization

            Sijia Chen, …

            I2
            I2
            ICML 2023 2 years ago

            Total of 1 viewers voted for saving the presentation to eternal vault which is 0.1%

            Adversarial Cheap Talk
            05:25

            Adversarial Cheap Talk

            Chris Lu, …

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Vector Quantized Wasserstein Auto-Encoder
            05:19

            Vector Quantized Wasserstein Auto-Encoder

            Tung-Long Vuong, …

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            A Gromov–Wasserstein Geometric View of Spectrum-Preserving Graph Coarsening
            04:57

            A Gromov–Wasserstein Geometric View of Spectrum-Preserving Graph Coarsening

            Yi-fan Chen, …

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            OpenFE: Automated Feature Generation with Expert-Level Performance
            04:51

            OpenFE: Automated Feature Generation with Expert-Level Performance

            Tianping Zhang, …

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Self-supervised learning of Split Invariant Equivariant representations
            05:15

            Self-supervised learning of Split Invariant Equivariant representations

            Quentin Garrido, …

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Interested in talks like this? Follow ICML 2023