Jingkang Wang, Hongyi Guo, Zhaowei Zhu, Yang Liu · Policy Learning Using Weak Supervision · SlidesLive

Categories

EN

Log in Talk to sales

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: Policy Learning Using Weak Supervision

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v2-stream-005-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v2-stream-005-alpha.b-cdn.net
sl-yoda-v2-stream-005-beta.b-cdn.net
1034628162.rsc.cdn77.org
1409346856.rsc.cdn77.org

Subtitles
Off
English

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

Policy Learning Using Weak Supervision

Policy Learning Using Weak Supervision

Dec 6, 2021

Speakers

Jingkang Wang

Speaker · 0 followers

Hongyi Guo

Speaker · 0 followers

Zhaowei Zhu

Speaker · 0 followers

About

Most existing policy learning solutions require the learning agents to receive high-quality supervision signals, e.g., rewards in reinforcement learning (RL) or high-quality expert demonstrations in behavioral cloning (BC). These quality supervisions are either infeasible or prohibitively expensive to obtain in practice. We aim for a unified framework that leverages the available cheap weak supervisions to perform policy learning efficiently. To handle this problem, we treat the weak supervision…

Organizer

NeurIPS 2021

Account · 1.9k followers

About NeurIPS 2021

Neural Information Processing Systems (NeurIPS) is a multi-track machine learning and computational neuroscience conference that includes invited talks, demonstrations, symposia and oral and poster presentations of refereed papers. Following the conference, there are workshops which provide a less formal setting.

Like the format? Trust SlidesLive to capture your next event!

Professional recording and live streaming, delivered globally.

Sharing

Recommended Videos

Presentations on similar topic, category or speaker

Iterative Methods for Private Synthetic Data: Unifying Framework and New Methods

03:09

Iterative Methods for Private Synthetic Data: Unifying Framework and New Methods

Watch later

Favorite

Terrance Liu, …

NeurIPS 2021 3 years ago

Deep Residual Learning in Spiking Neural Networks

14:05

Deep Residual Learning in Spiking Neural Networks

Watch later

Favorite

NeurIPS 2021 3 years ago

Activation Sharing with Asymmetric Paths Solves Weight Transport Problem without Bidirectional Connection

11:41

Activation Sharing with Asymmetric Paths Solves Weight Transport Problem without Bidirectional Connection

Watch later

Favorite

Sunghyeon Woo, …

NeurIPS 2021 3 years ago

Benign Overfitting

1:53:10

Benign Overfitting

Watch later

Favorite

NeurIPS 2021 3 years ago

The Banality of Scale: A Theory on the Limits of Modeling Bias and Fairness Frameworks for Social Justice (and other lessons from the Pandemic)

2:04:54

The Banality of Scale: A Theory on the Limits of Modeling Bias and Fairness Frameworks for Social Justice (and other lessons from the Pandemic)

Watch later

Favorite

NeurIPS 2021 3 years ago

Why We Want Contrastive Learning in Language Models

38:06

Why We Want Contrastive Learning in Language Models

Watch later

Favorite

NeurIPS 2021 3 years ago