Hao Sun, Alihan Hüyük, Mihaela van der Schaar · Query-Dependent Prompt Evaluation and Optimization with Offline Inverse RL · SlidesLive

Categories

EN

Log in Talk to sales

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: Query-Dependent Prompt Evaluation and Optimization with Offline Inverse RL

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v3-stream-006-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v3-stream-006-alpha.b-cdn.net
sl-yoda-v3-stream-006-beta.b-cdn.net
1375548855.rsc.cdn77.org
1312734894.rsc.cdn77.org

Subtitles
Off
English

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

Query-Dependent Prompt Evaluation and Optimization with Offline Inverse RL

Query-Dependent Prompt Evaluation and Optimization with Offline Inverse RL

Dec 15, 2023

Speakers

Hao Sun

Speaker · 2 followers

Alihan Hüyük

Speaker · 0 followers

Mihaela van der Schaar

Speaker · 5 followers

About

In this study, we aim to enhance the arithmetic reasoning ability of Large Language Models (LLMs) through zero-shot prompt optimization. We identify a previously overlooked objective of query dependency in such optimization and elucidate two ensuing challenges that impede the successful and economical design of prompt optimization techniques. We introduce Prompt-OIRL, which harnesses offline inverse reinforcement learning to draw insights from offline prompting demonstration data. Such data exis…

Organizer

NeurIPS 2023

Account · 622 followers

Like the format? Trust SlidesLive to capture your next event!

Professional recording and live streaming, delivered globally.

Sharing

Recommended Videos

Presentations on similar topic, category or speaker

The State of LLMs: Research, Applications, Safety, and Predictions

33:04

The State of LLMs: Research, Applications, Safety, and Predictions

Watch later

Favorite

NeurIPS 2023 15 months ago

L-CAD: Language-based Colorization with Any-level Descriptions using Diffusion Priors

04:48

L-CAD: Language-based Colorization with Any-level Descriptions using Diffusion Priors

Watch later

Favorite

Zheng Chang, …

NeurIPS 2023 15 months ago

Learning from Visual Observation via Offline Pretrained State-to-Go Transformer

04:43

Learning from Visual Observation via Offline Pretrained State-to-Go Transformer

Watch later

Favorite

Bohan Zhou, …

NeurIPS 2023 15 months ago

Trading-off price for data quality to achieve fair online allocation

04:55

Trading-off price for data quality to achieve fair online allocation

Watch later

Favorite

Mathieu Molina, …

NeurIPS 2023 15 months ago

Counterfactual-Augmented Importance Sampling for Semi-Offline Policy Evaluation

05:10

Counterfactual-Augmented Importance Sampling for Semi-Offline Policy Evaluation

Watch later

Favorite

Shengpu Tang, …

NeurIPS 2023 15 months ago

Efficient Adversarial Contrastive Learning via Robustness-Aware Coreset Selection

04:58

Efficient Adversarial Contrastive Learning via Robustness-Aware Coreset Selection

Watch later

Favorite

NeurIPS 2023 15 months ago