Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: Exploring Length Generalization in Large Language Models
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v2-stream-002-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v2-stream-002-alpha.b-cdn.net
      • sl-yoda-v2-stream-002-beta.b-cdn.net
      • 1001562353.rsc.cdn77.org
      • 1075090661.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            Exploring Length Generalization in Large Language Models
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            Exploring Length Generalization in Large Language Models

            Nov 28, 2022

            Speakers

            CA

            Cem Anil

            Speaker · 0 followers

            YW

            Yuhuai Wu

            Speaker · 0 followers

            AA

            Anders Andreassen

            Speaker · 0 followers

            About

            The ability to extrapolate from short problem instances to longer ones is an important form of out-of-distribution generalization in reasoning tasks, and is crucial when learning from datasets where longer problem instances are rare. These include theorem proving, solving quantitative mathematics problems, and reading/summarizing novels. In this paper, we run careful empirical studies exploring the length generalization capabilities of transformer-based language models. We first establish that n…

            Organizer

            N2
            N2

            NeurIPS 2022

            Account · 962 followers

            Like the format? Trust SlidesLive to capture your next event!

            Professional recording and live streaming, delivered globally.

            Sharing

            Recommended Videos

            Presentations on similar topic, category or speaker

            Dict-TTS: Learning to Pronounce with Prior Dictionary Knowledge for Text-to-Speech
            04:09

            Dict-TTS: Learning to Pronounce with Prior Dictionary Knowledge for Text-to-Speech

            Ziyue Jiang, …

            N2
            N2
            NeurIPS 2022 3 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Graph Neural Network Bandits
            04:27

            Graph Neural Network Bandits

            Parnian Kassraie, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 1 viewers voted for saving the presentation to eternal vault which is 0.1%

            Exploitability Minimization in Games and Beyond
            01:05

            Exploitability Minimization in Games and Beyond

            Denizalp Goktas, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Approximate Secular Equations for the Cubic Regularization Subproblem
            06:26

            Approximate Secular Equations for the Cubic Regularization Subproblem

            Yihang Gao, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Non-Convex Bilevel Games with Critical Point Selection Maps
            05:00

            Non-Convex Bilevel Games with Critical Point Selection Maps

            Michael Arbel, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Generalised Mutual Information for Discriminative Clustering
            04:39

            Generalised Mutual Information for Discriminative Clustering

            Louis Ohl, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Interested in talks like this? Follow NeurIPS 2022