Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: Hyena Hierarchy: Towards Larger Convolutional Language Models
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v2-stream-001-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v2-stream-001-alpha.b-cdn.net
      • sl-yoda-v2-stream-001-beta.b-cdn.net
      • 1824830694.rsc.cdn77.org
      • 1979322955.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            Hyena Hierarchy: Towards Larger Convolutional Language Models
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            Hyena Hierarchy: Towards Larger Convolutional Language Models

            Jul 25, 2023

            Speakers

            MP

            Michael Poli

            Řečník · 0 sledujících

            SM

            Stefano Massaroli

            Řečník · 0 sledujících

            EN

            Eric Nguyen

            Řečník · 0 sledujících

            About

            Recent advances in deep learning have relied heavily on the use of large Transformers due to their ability to learn at scale. However, the core building block of Transformers, the attention operator, exhibits quadratic cost in sequence length, limiting the amount of context accessible. Existing subquadratic methods based on low-rank and sparse approximations need to be combined with dense attention layers to match Transformers at scale, indicating a gap in capability. In this work, we propose Hy…

            Organizer

            I2
            I2

            ICML 2023

            Účet · 657 sledujících

            Like the format? Trust SlidesLive to capture your next event!

            Professional recording and live streaming, delivered globally.

            Sharing

            Recommended Videos

            Presentations on similar topic, category or speaker

            Repurposing Density Functional Theory to Suit Deep Learning
            16:47

            Repurposing Density Functional Theory to Suit Deep Learning

            Alexander Mathiasen

            I2
            I2
            ICML 2023 2 years ago

            Pro uložení prezentace do věčného trezoru hlasovalo 0 diváků, což je 0.0 %

            Robots Learning from Real People
            24:04

            Robots Learning from Real People

            Taylor Kessler Faulkner

            I2
            I2
            ICML 2023 2 years ago

            Pro uložení prezentace do věčného trezoru hlasovalo 0 diváků, což je 0.0 %

            Dividing and Conquering a BlackBox to a Mixture of Interpretable Models: Route, Interpret, Repeat
            05:13

            Dividing and Conquering a BlackBox to a Mixture of Interpretable Models: Route, Interpret, Repeat

            Shantanu Ghosh, …

            I2
            I2
            ICML 2023 2 years ago

            Pro uložení prezentace do věčného trezoru hlasovalo 0 diváků, což je 0.0 %

            The Catalog Problem: Clustering and Ordering Variable-Sized Sets
            05:04

            The Catalog Problem: Clustering and Ordering Variable-Sized Sets

            Mateusz Jurewicz, …

            I2
            I2
            ICML 2023 2 years ago

            Pro uložení prezentace do věčného trezoru hlasovalo 0 diváků, což je 0.0 %

            Nearly Optimal Algorithms with Sublinear Computational Complexity for Online Kernel Regression
            04:56

            Nearly Optimal Algorithms with Sublinear Computational Complexity for Online Kernel Regression

            Junfan Li, …

            I2
            I2
            ICML 2023 2 years ago

            Pro uložení prezentace do věčného trezoru hlasovalo 0 diváků, což je 0.0 %

            Efficient Bound of Lipschitz Constant for Convolutional Layers by Gram Iteration
            04:46

            Efficient Bound of Lipschitz Constant for Convolutional Layers by Gram Iteration

            Blaise Delattre, …

            I2
            I2
            ICML 2023 2 years ago

            Pro uložení prezentace do věčného trezoru hlasovalo 0 diváků, což je 0.0 %

            Interested in talks like this? Follow ICML 2023