Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: SoftTreeMax: Policy Gradient with Tree Search
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v2-stream-001-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v2-stream-001-alpha.b-cdn.net
      • sl-yoda-v2-stream-001-beta.b-cdn.net
      • 1824830694.rsc.cdn77.org
      • 1979322955.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            SoftTreeMax: Policy Gradient with Tree Search
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            SoftTreeMax: Policy Gradient with Tree Search

            Dec 2, 2022

            Speakers

            GD

            Gal Dalal

            Speaker · 0 followers

            AH

            Assaf Hallak

            Speaker · 0 followers

            SM

            Shie Mannor

            Speaker · 1 follower

            About

            Policy-gradient methods are widely used for learning control policies. They can be easily distributed to multiple workers and reach state-of-the-art results in many domains. Unfortunately, they exhibit large variance and subsequently suffer from high-sample complexity since they aggregate gradients over entire trajectories. At the other extreme, planning methods, like tree search, optimize the policy using single-step transitions that consider future lookahead. These approaches have been mainly…

            Organizer

            N2
            N2

            NeurIPS 2022

            Account · 961 followers

            Like the format? Trust SlidesLive to capture your next event!

            Professional recording and live streaming, delivered globally.

            Sharing

            Recommended Videos

            Presentations on similar topic, category or speaker

            A Framework for Generating Dangerous Scenes for Testing Robustness
            03:46

            A Framework for Generating Dangerous Scenes for Testing Robustness

            Shengjie Xu, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            MWP-BERT: A Numeracy-augmented Pre-trained Encoder for Math Word Problems
            04:55

            MWP-BERT: A Numeracy-augmented Pre-trained Encoder for Math Word Problems

            Zhenwen Liang, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            TA-GATES: An Encoding Scheme for Neural Network Architectures
            04:59

            TA-GATES: An Encoding Scheme for Neural Network Architectures

            Xuefei Ning, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            GLIPv2: Unifying Localization and Vision-Language Understanding
            05:34

            GLIPv2: Unifying Localization and Vision-Language Understanding

            Haotian Zhang, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            SeqPATE: Differentially Private Text Generation via Knowledge Distillation
            04:30

            SeqPATE: Differentially Private Text Generation via Knowledge Distillation

            Zhiliang Tian, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            On the Spectral Bias of Convolutional Neural Tangent and Gaussian Process Kernels
            04:59

            On the Spectral Bias of Convolutional Neural Tangent and Gaussian Process Kernels

            Amnon Geifman, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Interested in talks like this? Follow NeurIPS 2022