Janaka Chathuranga Brahmanage, Jiajing Ling, Akshat Kumar · FlowPG: Action-constrained Policy Gradient with Normalizing Flows · SlidesLive

Kategorien

DE

Anmelden Vertrieb kontaktieren

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: FlowPG: Action-constrained Policy Gradient with Normalizing Flows

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v2-stream-001-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v2-stream-001-alpha.b-cdn.net
sl-yoda-v2-stream-001-beta.b-cdn.net
1824830694.rsc.cdn77.org
1979322955.rsc.cdn77.org

Subtitles
Off
English

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

FlowPG: Action-constrained Policy Gradient with Normalizing Flows

FlowPG: Action-constrained Policy Gradient with Normalizing Flows

Dez 10, 2023

Sprecher:innen

Janaka Chathuranga Brahmanage

Sprecher:in · 0 Follower:innen

Jiajing Ling

Sprecher:in · 0 Follower:innen

Akshat Kumar

Sprecher:in · 0 Follower:innen

Über

Action-constrained reinforcement learning (ACRL) is a popular approach for solving safety-critical and resource-allocation related decision making problems. However, one of the major challenges in solving ACRL is to find valid actions that satisfy the constraints in each RL step. While adding a projection layer on top of the original policy network is a commonly used approach, it involves solving a mathematical program, either during training or in action execution, or both, which can result in…

Organisator

NeurIPS 2023

Konto · 648 Follower:innen

Gefällt euch das Format? Vertraut auf SlidesLive, um euer nächstes Event festzuhalten!

Professionelle Aufzeichnung und Livestreaming – weltweit.

Freigeben

Empfohlene Videos

Präsentationen, deren Thema, Kategorie oder Sprecher:in ähnlich sind

TradeMaster: A Holistic Quantitative Trading Platform Empowered by Reinforcement Learning

02:49

TradeMaster: A Holistic Quantitative Trading Platform Empowered by Reinforcement Learning

Später ansehen

Favorit

NeurIPS 2023 16 months ago

The Tunnel Effect: Building Data Representations in Deep Neural Networks

04:41

The Tunnel Effect: Building Data Representations in Deep Neural Networks

Später ansehen

Favorit

Wojciech Masarczyk, …

NeurIPS 2023 16 months ago

Self-supervised Learning: Towards Rich Representations?

32:52

Self-supervised Learning: Towards Rich Representations?

Später ansehen

Favorit

NeurIPS 2023 16 months ago

TensorNet: Cartesian Tensor Representations for Efficient Learning of Molecular Potentials

03:13

TensorNet: Cartesian Tensor Representations for Efficient Learning of Molecular Potentials

Später ansehen

Favorit

Guillem Simeon, …

NeurIPS 2023 16 months ago

Uncovering Meanings of Embeddings via Partial Orthogonality

04:35

Uncovering Meanings of Embeddings via Partial Orthogonality

Später ansehen

Favorit

Yibo Jiang, …

NeurIPS 2023 16 months ago

Segment-then-Classify: Few-shot Instance Segmentation for Environmental Remote Sensing

04:31

Segment-then-Classify: Few-shot Instance Segmentation for Environmental Remote Sensing

Später ansehen

Favorit

NeurIPS 2023 16 months ago