Matthew Macfarlane, Herke van Hoof, Diederik Roijers, Wan Jui · Training graph neural networks with policy gradients to perform tree search · SlidesLive

Kategorie

CS

Přihlásit se Kontaktujte nás

Další

Živý přenos začne již brzy!

Živý přenos již skončil.

Prezentace ještě nebyla nahrána!

SlidesLive

title: Training graph neural networks with policy gradients to perform tree search

0:00 / 0:00

Nahlásit chybu
Nastavení
Playlisty
Záložky
Titulky Off
Rychlost přehrávání
Kvalita

Nastavení
Debug informace
Server sl-yoda-v2-stream-002-alpha.b-cdn.net
Velikost titulků Střední

Záložky

Server
sl-yoda-v2-stream-002-alpha.b-cdn.net
sl-yoda-v2-stream-002-beta.b-cdn.net
1001562353.rsc.cdn77.org
1075090661.rsc.cdn77.org

Titulky
Off
English

Rychlost přehrávání

Kvalita

Velikost titulků
Velké
Střední
Malé

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

Training graph neural networks with policy gradients to perform tree search

Training graph neural networks with policy gradients to perform tree search

2. prosince 2022

Řečníci

Matthew Macfarlane

Řečník · 0 sledujících

Herke van Hoof

Řečník · 0 sledujících

Diederik Roijers

Řečník · 0 sledujících

O prezentaci

Monte Carlo Tree Search has shown to be a well-performing approach for decision problems such as board games and Atari games, but relies on heuristic design decisions that are non-adaptive and not necessarily optimal for all problems. Learned policies and value functions can augment MCTS by leveraging the state information at the nodes in the search tree. However, these learned functions do not take the search tree structure into account and can be sensitive to value estimation errors. In this p…

Organizátor

NeurIPS 2022

Účet · 961 sledujících

Baví vás formát? Nechte SlidesLive zachytit svou akci!

Profesionální natáčení a streamování po celém světě.

Sdílení

Doporučená videa

Prezentace na podobné téma, kategorii nebo přednášejícího

When to Update Your Model: Constrained Model-based Reinforcement Learning

04:18

When to Update Your Model: Constrained Model-based Reinforcement Learning

Zhlédnout později

Oblíbené

Tianying Ji, …

NeurIPS 2022 2 years ago

Active-Passive SimStereo - Benchmarking the Cross-Generalization Capabilities of DL Stereo Methods

04:41

Active-Passive SimStereo - Benchmarking the Cross-Generalization Capabilities of DL Stereo Methods

Zhlédnout později

Oblíbené

Laurent Jospin, …

NeurIPS 2022 2 years ago

A Multi-Scale Deep Learning Framework for Projecting Weather Extremes

10:25

A Multi-Scale Deep Learning Framework for Projecting Weather Extremes

Zhlédnout později

Oblíbené

Antoine Blanchard, …

NeurIPS 2022 2 years ago

DTG-SSOD: Dense Teacher Guidance for Semi-Supervised Object Detection

04:42

DTG-SSOD: Dense Teacher Guidance for Semi-Supervised Object Detection

Zhlédnout později

Oblíbené

NeurIPS 2022 2 years ago

A Unified Hard-Constraint Framework for Solving Geometrically Complex PDEs

04:25

A Unified Hard-Constraint Framework for Solving Geometrically Complex PDEs

Zhlédnout později

Oblíbené

Songming Liu, …

NeurIPS 2022 2 years ago

Combining Implicit & Explicit Regularization for Efficient Learning in Deep Networks

05:04

Combining Implicit & Explicit Regularization for Efficient Learning in Deep Networks

Zhlédnout později

Oblíbené

NeurIPS 2022 2 years ago