Jason Weston, Jing Xu, Da Ju, Joshua Lane, Mojtaba Komeili, Eric Michael Smith, Megan Ung, Morteza Behrooz, William Ngan, Rashel Moritz, Sainbayar Sukhbaatar, Y-Lan Boureau, Kurt Shuster · Improving Open Language Models by Learning from Organic Interactions · SlidesLive

Categories

EN

Log in Talk to sales

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: Improving Open Language Models by Learning from Organic Interactions

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v2-stream-009-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v2-stream-009-alpha.b-cdn.net
sl-yoda-v2-stream-009-beta.b-cdn.net
1766500541.rsc.cdn77.org
1441886916.rsc.cdn77.org

Subtitles
Off
English

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

Improving Open Language Models by Learning from Organic Interactions

Improving Open Language Models by Learning from Organic Interactions

Jul 28, 2023

Speakers

Jason Weston

Speaker · 0 followers

Jing Xu

Speaker · 0 followers

Da Ju

Speaker · 0 followers

About

We discuss techniques that can be used to learn how to improve AIs (dialogue models) by interacting with organic users ``in the wild''. Training models with organic data is challenging because such interactions include both high quality conversations and feedback, as well as adversarial and toxic behavior. We thus study techniques that enable learning from helpful teachers while avoiding learning from people who are trying to trick the model into unhelpful or toxic responses. We present BlenderB…

Organizer

ICML 2023

Account · 657 followers

Like the format? Trust SlidesLive to capture your next event!

Professional recording and live streaming, delivered globally.

Sharing

Recommended Videos

Presentations on similar topic, category or speaker

Enforcing Right to Explanation: Technical Challenges, Solutions, and Opportunities

33:35

Enforcing Right to Explanation: Technical Challenges, Solutions, and Opportunities

Watch later

Favorite

ICML 2023 2 years ago

When Personalization Harms Performance: Reconsidering the Use of Group Attributes in Prediction

05:21

When Personalization Harms Performance: Reconsidering the Use of Group Attributes in Prediction

Watch later

Favorite

Vinith M. Suriyakumar, …

ICML 2023 2 years ago

Over-parametrization via Lifting for Low-rank Matrix Sensing: Conversion of Spurious Solutions to Strict Saddle Points

08:19

Over-parametrization via Lifting for Low-rank Matrix Sensing: Conversion of Spurious Solutions to Strict Saddle Points

Watch later

Favorite

ICML 2023 2 years ago

CO-BED: Information-Theoretic Contextual Optimization via Bayesian Experimental Design

05:26

CO-BED: Information-Theoretic Contextual Optimization via Bayesian Experimental Design

Watch later

Favorite

Desi R. Ivanova, …

ICML 2023 2 years ago

Robust Situational Reinforcement Learning in Face of Context Disturbances

05:11

Robust Situational Reinforcement Learning in Face of Context Disturbances

Watch later

Favorite

Jinpeng Zhang, …

ICML 2023 2 years ago

Learning to Bid in Repeated First-Price Auctions with Budgets

04:54

Learning to Bid in Repeated First-Price Auctions with Budgets

Watch later

Favorite

ICML 2023 2 years ago