Nathan Lambert, Dmitry Ustalov · Reinforcement Learning from Human Feedback: A Tutorial * · SlidesLive

Categories

Arts, Design & Media

Category · 1.2k presentations

Business & Economics

Category · 3.8k presentations

Computer Science & IT

Category · 14.8k presentations

Engineering & Technology

Category · 491 presentations

Humanities & Social Sciences

Category · 1.3k presentations

Medicine & Health

Category · 529 presentations

Natural & Formal Sciences

Category · 3.3k presentations

Self Development & Lifestyle

Category · 599 presentations

EN

Log in Talk to sales

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: Reinforcement Learning from Human Feedback: A Tutorial *

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v2-stream-005-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v2-stream-005-alpha.b-cdn.net
sl-yoda-v2-stream-005-beta.b-cdn.net
1034628162.rsc.cdn77.org
1409346856.rsc.cdn77.org

Subtitles
Off
English

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

Reinforcement Learning from Human Feedback: A Tutorial *

Reinforcement Learning from Human Feedback: A Tutorial *

Jul 24, 2023

Speakers

Nathan Lambert

Speaker · 3 followers

Dmitry Ustalov

Speaker · 2 followers

Organizer

ICML 2023

Account · 469 followers

Like the format? Trust SlidesLive to capture your next event!

Professional recording and live streaming, delivered globally.

Sharing

Recommended Videos

Presentations on similar topic, category or speaker

Optimistic Online Mirror Descent for Bridging Stochastic and Adversarial Online Convex Optimization

05:15

Optimistic Online Mirror Descent for Bridging Stochastic and Adversarial Online Convex Optimization

Watch later

Favorite

Sijia Chen, …

ICML 2023 2 years ago

Adversarial Cheap Talk

05:25

Adversarial Cheap Talk

Watch later

Favorite

ICML 2023 2 years ago

Vector Quantized Wasserstein Auto-Encoder

05:19

Vector Quantized Wasserstein Auto-Encoder

Watch later

Favorite

Tung-Long Vuong, …

ICML 2023 2 years ago

A Gromov–Wasserstein Geometric View of Spectrum-Preserving Graph Coarsening

04:57

A Gromov–Wasserstein Geometric View of Spectrum-Preserving Graph Coarsening

Watch later

Favorite

Yi-fan Chen, …

ICML 2023 2 years ago

OpenFE: Automated Feature Generation with Expert-Level Performance

04:51

OpenFE: Automated Feature Generation with Expert-Level Performance

Watch later

Favorite

Tianping Zhang, …

ICML 2023 2 years ago

Self-supervised learning of Split Invariant Equivariant representations

05:15

Self-supervised learning of Split Invariant Equivariant representations

Watch later

Favorite

Quentin Garrido, …

ICML 2023 2 years ago