Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: Pyramid Dynamic Inference: Encouraging Faster Inference via Early Exit Boosting
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v2-stream-010-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v2-stream-010-alpha.b-cdn.net
      • sl-yoda-v2-stream-010-beta.b-cdn.net
      • 1759419103.rsc.cdn77.org
      • 1016618226.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            Pyramid Dynamic Inference: Encouraging Faster Inference via Early Exit Boosting
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            Pyramid Dynamic Inference: Encouraging Faster Inference via Early Exit Boosting

            Dec 2, 2022

            Speakers

            EB

            Ershad Banijamali

            Speaker · 0 followers

            PK

            Pegah Kharazmi

            Speaker · 0 followers

            SE

            Sepehr Eghbali

            Speaker · 0 followers

            About

            Large transformer-based models have demonstrated state of the art results on several Natural Language Understanding (NLU) tasks. However, their deployment comes at the cost of increased footprint and inference latency, limiting their adoption to real-time applications, especially on resource constrained devices. In order to optimize the trade-off between model accuracy, footprint and inference latency, we propose Pyramid Dynamic Inference (PDI), a scheme that encourages fast inference by introdu…

            Organizer

            N2
            N2

            NeurIPS 2022

            Account · 961 followers

            Like the format? Trust SlidesLive to capture your next event!

            Professional recording and live streaming, delivered globally.

            Sharing

            Recommended Videos

            Presentations on similar topic, category or speaker

            Deep Model Reassembly: Transfer learning as playing with legos
            05:02

            Deep Model Reassembly: Transfer learning as playing with legos

            Xingyi Yang, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Building Language Models Based on Retrieval
            28:54

            Building Language Models Based on Retrieval

            Danqi Chen

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Transformers meet Stochastic Block Models: Attention with Data-Adaptive Sparsity and Cost
            09:56

            Transformers meet Stochastic Block Models: Attention with Data-Adaptive Sparsity and Cost

            Sungjun Cho, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Taking Federated Analytics from Theory to Practice
            52:45

            Taking Federated Analytics from Theory to Practice

            Graham Cormode

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Domain Adaptation: Theory, Algorithms, and Open Library
            24:02

            Domain Adaptation: Theory, Algorithms, and Open Library

            Mingsheng Long

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 1 viewers voted for saving the presentation to eternal vault which is 0.1%

            On the Sample Complexity of Stabilizing LTI Systems on a Single Trajectory
            04:56

            On the Sample Complexity of Stabilizing LTI Systems on a Single Trajectory

            Yang Hu, …

            N2
            N2
            NeurIPS 2022 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Interested in talks like this? Follow NeurIPS 2022