Gal Dalal, Assaf Hallak, Shie Mannor, Gal Chechik · SoftTreeMax: Policy Gradient with Tree Search · SlidesLive

Categories

EN

Log in Talk to sales

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: SoftTreeMax: Policy Gradient with Tree Search

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v2-stream-001-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v2-stream-001-alpha.b-cdn.net
sl-yoda-v2-stream-001-beta.b-cdn.net
1824830694.rsc.cdn77.org
1979322955.rsc.cdn77.org

Subtitles
Off
English

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

SoftTreeMax: Policy Gradient with Tree Search

SoftTreeMax: Policy Gradient with Tree Search

Dec 2, 2022

Speakers

Gal Dalal

Speaker · 0 followers

Assaf Hallak

Speaker · 0 followers

Shie Mannor

Speaker · 1 follower

About

Policy-gradient methods are widely used for learning control policies. They can be easily distributed to multiple workers and reach state-of-the-art results in many domains. Unfortunately, they exhibit large variance and subsequently suffer from high-sample complexity since they aggregate gradients over entire trajectories. At the other extreme, planning methods, like tree search, optimize the policy using single-step transitions that consider future lookahead. These approaches have been mainly…

Organizer

NeurIPS 2022

Account · 961 followers

Like the format? Trust SlidesLive to capture your next event!

Professional recording and live streaming, delivered globally.

Sharing

Recommended Videos

Presentations on similar topic, category or speaker

A Framework for Generating Dangerous Scenes for Testing Robustness

03:46

A Framework for Generating Dangerous Scenes for Testing Robustness

Watch later

Favorite

Shengjie Xu, …

NeurIPS 2022 2 years ago

$MWP-BERT: A Numeracy-augmented Pre-trained Encoder for Math Word Problems$

04:55

MWP-BERT: A Numeracy-augmented Pre-trained Encoder for Math Word Problems

Watch later

Favorite

Zhenwen Liang, …

NeurIPS 2022 2 years ago

TA-GATES: An Encoding Scheme for Neural Network Architectures

04:59

TA-GATES: An Encoding Scheme for Neural Network Architectures

Watch later

Favorite

Xuefei Ning, …

NeurIPS 2022 2 years ago

GLIPv2: Unifying Localization and Vision-Language Understanding

05:34

GLIPv2: Unifying Localization and Vision-Language Understanding

Watch later

Favorite

Haotian Zhang, …

NeurIPS 2022 2 years ago

SeqPATE: Differentially Private Text Generation via Knowledge Distillation

04:30

SeqPATE: Differentially Private Text Generation via Knowledge Distillation

Watch later

Favorite

Zhiliang Tian, …

NeurIPS 2022 2 years ago

On the Spectral Bias of Convolutional Neural Tangent and Gaussian Process Kernels

04:59

On the Spectral Bias of Convolutional Neural Tangent and Gaussian Process Kernels

Watch later

Favorite

Amnon Geifman, …

NeurIPS 2022 2 years ago