Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: Improving Open Language Models by Learning from Organic Interactions
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v2-stream-009-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v2-stream-009-alpha.b-cdn.net
      • sl-yoda-v2-stream-009-beta.b-cdn.net
      • 1766500541.rsc.cdn77.org
      • 1441886916.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            Improving Open Language Models by Learning from Organic Interactions
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            Improving Open Language Models by Learning from Organic Interactions

            Jul 28, 2023

            Speakers

            JW

            Jason Weston

            Speaker · 0 followers

            JX

            Jing Xu

            Speaker · 0 followers

            DJ

            Da Ju

            Speaker · 0 followers

            About

            We discuss techniques that can be used to learn how to improve AIs (dialogue models) by interacting with organic users ``in the wild''. Training models with organic data is challenging because such interactions include both high quality conversations and feedback, as well as adversarial and toxic behavior. We thus study techniques that enable learning from helpful teachers while avoiding learning from people who are trying to trick the model into unhelpful or toxic responses. We present BlenderB…

            Organizer

            I2
            I2

            ICML 2023

            Account · 657 followers

            Like the format? Trust SlidesLive to capture your next event!

            Professional recording and live streaming, delivered globally.

            Sharing

            Recommended Videos

            Presentations on similar topic, category or speaker

            Enforcing Right to Explanation: Technical Challenges, Solutions, and Opportunities
            33:35

            Enforcing Right to Explanation: Technical Challenges, Solutions, and Opportunities

            Hima Lakkaraju

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            When Personalization Harms Performance: Reconsidering the Use of Group Attributes in Prediction
            05:21

            When Personalization Harms Performance: Reconsidering the Use of Group Attributes in Prediction

            Vinith M. Suriyakumar, …

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Over-parametrization via Lifting for Low-rank Matrix Sensing: Conversion of Spurious Solutions to Strict Saddle Points
            08:19

            Over-parametrization via Lifting for Low-rank Matrix Sensing: Conversion of Spurious Solutions to Strict Saddle Points

            Ziye Ma, …

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            CO-BED: Information-Theoretic Contextual Optimization via Bayesian Experimental Design
            05:26

            CO-BED: Information-Theoretic Contextual Optimization via Bayesian Experimental Design

            Desi R. Ivanova, …

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Robust Situational Reinforcement Learning in Face of Context Disturbances
            05:11

            Robust Situational Reinforcement Learning in Face of Context Disturbances

            Jinpeng Zhang, …

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Learning to Bid in Repeated First-Price Auctions with Budgets
            04:54

            Learning to Bid in Repeated First-Price Auctions with Budgets

            Qian Wang, …

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Interested in talks like this? Follow ICML 2023