Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v2-stream-003-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v2-stream-003-alpha.b-cdn.net
      • sl-yoda-v2-stream-003-beta.b-cdn.net
      • 1544410162.rsc.cdn77.org
      • 1005514182.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness

            Nov 28, 2022

            Speakers

            TD

            Tri Dao

            Speaker · 3 followers

            DYF

            Daniel Y. Fu

            Speaker · 0 followers

            SE

            Stefano Ermon

            Speaker · 15 followers

            About

            Transformers are slow and memory-hungry on long sequences, since the time and memory complexity of self-attention are quadratic in sequence length. Approximate attention methods have attempted to address this problem by trading off model quality to reduce the compute complexity, but often do not achieve wall-clock speedup. We argue that a missing principle is making attention algorithms IO-aware—accounting for reads and writes between levels of GPU memory. We propose FlashAttention, an IO-aware…

            Organizer

            N2
            N2

            NeurIPS 2022

            Account · 953 followers

            Like the format? Trust SlidesLive to capture your next event!

            Professional recording and live streaming, delivered globally.

            Sharing

            Recommended Videos

            Presentations on similar topic, category or speaker

            Audio-Driven Co-Speech Gesture Image Generation
            01:02

            Audio-Driven Co-Speech Gesture Image Generation

            Xian Liu, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Generative multitask learning mitigates target-causing confounding
            05:15

            Generative multitask learning mitigates target-causing confounding

            Taro Makino, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Equivariant Networks for Crystal Structures
            04:31

            Equivariant Networks for Crystal Structures

            Sékou-Oumar Kaba, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            AniFaceGAN: Animatable 3D-Aware Face Image Generation for Video Avatars
            04:39

            AniFaceGAN: Animatable 3D-Aware Face Image Generation for Video Avatars

            Yue Wu, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Online Minimax Multiobjective Optimization: Multicalibeating and Other Applications
            05:36

            Online Minimax Multiobjective Optimization: Multicalibeating and Other Applications

            Daniel Lee, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Conditional Moment Alignment for Improved Generalization in  Federated Learning
            11:59

            Conditional Moment Alignment for Improved Generalization in Federated Learning

            Jayanth Regatti, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Interested in talks like this? Follow NeurIPS 2022