Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: Is Reinforcement Learning (Not) for NLP?: Benchmarks, Baselines, and Building Blocks for Natural Language Policy Optimization
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v2-stream-001-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v2-stream-001-alpha.b-cdn.net
      • sl-yoda-v2-stream-001-beta.b-cdn.net
      • 1824830694.rsc.cdn77.org
      • 1979322955.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            Is Reinforcement Learning (Not) for NLP?: Benchmarks, Baselines, and Building Blocks for Natural Language Policy Optimization
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            Is Reinforcement Learning (Not) for NLP?: Benchmarks, Baselines, and Building Blocks for Natural Language Policy Optimization

            Dec 2, 2022

            Speakers

            RR

            Rajkumar Ramamurthy

            Speaker · 0 followers

            PA

            Prithviraj Ammanabrolu

            Speaker · 0 followers

            KB

            Kianté Brantley

            Speaker · 0 followers

            Organizer

            N2
            N2

            NeurIPS 2022

            Account · 954 followers

            Like the format? Trust SlidesLive to capture your next event!

            Professional recording and live streaming, delivered globally.

            Sharing

            Recommended Videos

            Presentations on similar topic, category or speaker

            Computationally Efficient Horizon-Free Reinforcement Learning for Linear Mixture MDPs
            04:50

            Computationally Efficient Horizon-Free Reinforcement Learning for Linear Mixture MDPs

            Dongruo Zhou, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            PolarMix: A General Data Augmentation Technique for LiDAR Point Clouds
            04:23

            PolarMix: A General Data Augmentation Technique for LiDAR Point Clouds

            Aoran Xiao, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Dynamic Pricing with Monotonicity Constraint under Unknown Parametric Demand Model
            00:59

            Dynamic Pricing with Monotonicity Constraint under Unknown Parametric Demand Model

            Su Jia, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            My considerations on Machine Learning
            25:37

            My considerations on Machine Learning

            Giorgio Parisi

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 1 viewers voted for saving the presentation to eternal vault which is 0.1%

            Learning to Find Proofs and Theorems by Learning to Refine Search Strategies
            05:03

            Learning to Find Proofs and Theorems by Learning to Refine Search Strategies

            Jonathan Laurent, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Tiered Reinforcement Learning: Pessimism in the Face of Uncertainty and Constant Regret
            01:01

            Tiered Reinforcement Learning: Pessimism in the Face of Uncertainty and Constant Regret

            Jiawei Huang, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Interested in talks like this? Follow NeurIPS 2022