Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: Query-Dependent Prompt Evaluation and Optimization with Offline Inverse RL
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v3-stream-006-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v3-stream-006-alpha.b-cdn.net
      • sl-yoda-v3-stream-006-beta.b-cdn.net
      • 1375548855.rsc.cdn77.org
      • 1312734894.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            Query-Dependent Prompt Evaluation and Optimization with Offline Inverse RL
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            Query-Dependent Prompt Evaluation and Optimization with Offline Inverse RL

            Dec 15, 2023

            Speakers

            HS

            Hao Sun

            Speaker · 2 followers

            AH

            Alihan Hüyük

            Speaker · 0 followers

            MvdS

            Mihaela van der Schaar

            Speaker · 5 followers

            About

            In this study, we aim to enhance the arithmetic reasoning ability of Large Language Models (LLMs) through zero-shot prompt optimization. We identify a previously overlooked objective of query dependency in such optimization and elucidate two ensuing challenges that impede the successful and economical design of prompt optimization techniques. We introduce Prompt-OIRL, which harnesses offline inverse reinforcement learning to draw insights from offline prompting demonstration data. Such data exis…

            Organizer

            N2
            N2

            NeurIPS 2023

            Account · 622 followers

            Like the format? Trust SlidesLive to capture your next event!

            Professional recording and live streaming, delivered globally.

            Sharing

            Recommended Videos

            Presentations on similar topic, category or speaker

            The State of LLMs: Research, Applications, Safety, and Predictions
            33:04

            The State of LLMs: Research, Applications, Safety, and Predictions

            Elvis Saravia

            N2
            N2
            NeurIPS 2023 15 months ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            L-CAD: Language-based Colorization with Any-level Descriptions using Diffusion Priors
            04:48

            L-CAD: Language-based Colorization with Any-level Descriptions using Diffusion Priors

            Zheng Chang, …

            N2
            N2
            NeurIPS 2023 15 months ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Learning from Visual Observation via Offline Pretrained State-to-Go Transformer
            04:43

            Learning from Visual Observation via Offline Pretrained State-to-Go Transformer

            Bohan Zhou, …

            N2
            N2
            NeurIPS 2023 15 months ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Trading-off price for data quality to achieve fair online allocation
            04:55

            Trading-off price for data quality to achieve fair online allocation

            Mathieu Molina, …

            N2
            N2
            NeurIPS 2023 15 months ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Counterfactual-Augmented Importance Sampling for Semi-Offline Policy Evaluation
            05:10

            Counterfactual-Augmented Importance Sampling for Semi-Offline Policy Evaluation

            Shengpu Tang, …

            N2
            N2
            NeurIPS 2023 15 months ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Efficient Adversarial Contrastive Learning via Robustness-Aware Coreset Selection
            04:58

            Efficient Adversarial Contrastive Learning via Robustness-Aware Coreset Selection

            Xilie Xu, …

            N2
            N2
            NeurIPS 2023 15 months ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Interested in talks like this? Follow NeurIPS 2023