Zixiang Chen, Chris Junchi Li, Angela Yuan, Quanquan Gu, Michael I. Jordan · A General Framework for Sample-Efficient Function Approximation in Reinforcement Learning · SlidesLive

Categories

EN

Log in Talk to sales

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: A General Framework for Sample-Efficient Function Approximation in Reinforcement Learning

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v2-stream-005-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v2-stream-005-alpha.b-cdn.net
sl-yoda-v2-stream-005-beta.b-cdn.net
1034628162.rsc.cdn77.org
1409346856.rsc.cdn77.org

Subtitles
Off
English

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

A General Framework for Sample-Efficient Function Approximation in Reinforcement Learning

A General Framework for Sample-Efficient Function Approximation in Reinforcement Learning

Dec 2, 2022

Speakers

Zixiang Chen

Speaker · 0 followers

Chris Junchi Li

Speaker · 1 follower

Angela Yuan

Speaker · 0 followers

About

With the increasing need for handling large state and action spaces, general function approximation has become a key technique in reinforcement learning problems. In this paper, we propose a unified framework that integrates both model-based and model-free reinforcement learning and subsumes nearly all Markov decision process (MDP) models in the existing literature for tractable RL. We propose a novel estimation function with decomposable structural properties for optimization-based exploration…

Organizer

NeurIPS 2022

Account · 961 followers

Like the format? Trust SlidesLive to capture your next event!

Professional recording and live streaming, delivered globally.

Sharing

Recommended Videos

Presentations on similar topic, category or speaker

A simple but strong baseline for online continual learning: Repeated Augmented Rehearsal

04:57

A simple but strong baseline for online continual learning: Repeated Augmented Rehearsal

Watch later

Favorite

Yaqian Zhang, …

NeurIPS 2022 2 years ago

Large Language Models and Sequential Decision Making

38:09

Large Language Models and Sequential Decision Making

Watch later

Favorite

Dale Schuurmans

NeurIPS 2022 2 years ago

A Brief Overview of AI Governance for Responsible Machine Learning Systems

10:35

A Brief Overview of AI Governance for Responsible Machine Learning Systems

Watch later

Favorite

Navdeep Gill, …

NeurIPS 2022 2 years ago

Learning Dynamics in Deep linear Networks with Multiple Pathways

04:58

Learning Dynamics in Deep linear Networks with Multiple Pathways

Watch later

Favorite

Jianghong Shi, …

NeurIPS 2022 2 years ago

Trading off Image Quality for Robustness is not Necessary with Deterministic Autoencoders

01:02

Trading off Image Quality for Robustness is not Necessary with Deterministic Autoencoders

Watch later

Favorite

Amrutha Saseendran, …

NeurIPS 2022 2 years ago

On the Robustness of Graph Neural Diffusion to Topology Perturbations

04:35

On the Robustness of Graph Neural Diffusion to Topology Perturbations

Watch later

Favorite

NeurIPS 2022 2 years ago