Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: NTK-approximating MLP Fusion for Efficient Language Model Fine-tuning
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v2-stream-005-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v2-stream-005-alpha.b-cdn.net
      • sl-yoda-v2-stream-005-beta.b-cdn.net
      • 1034628162.rsc.cdn77.org
      • 1409346856.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            NTK-approximating MLP Fusion for Efficient Language Model Fine-tuning
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            NTK-approximating MLP Fusion for Efficient Language Model Fine-tuning

            Jul 24, 2023

            Sprecher:innen

            TW

            Tianxin Wei

            Řečník · 0 sledujících

            ZG

            Zeming Guo

            Řečník · 0 sledujících

            YC

            Yi-fan Chen

            Řečník · 1 sledující

            Über

            Fine-tuning a pre-trained language model (PLM) emerges as the predominant strategy in many natural language processing applications. However, even fine-tuning the PLMs and doing inference are expensive, especially on edge devices with low computing power. Some general approaches (e.g. quantization and distillation) have been widely studied to reduce the compute/memory of PLM fine-tuning, while very few one-shot compression techniques are explored. In this paper, we investigate the neural tangent…

            Organisator

            I2
            I2

            ICML 2023

            Účet · 657 sledujících

            Gefällt euch das Format? Vertraut auf SlidesLive, um euer nächstes Event festzuhalten!

            Professionelle Aufzeichnung und Livestreaming – weltweit.

            Freigeben

            Empfohlene Videos

            Präsentationen, deren Thema, Kategorie oder Sprecher:in ähnlich sind

            Mixture Proportion Estimation Beyond Irreducibility
            05:12

            Mixture Proportion Estimation Beyond Irreducibility

            Yilun Zhu, …

            I2
            I2
            ICML 2023 2 years ago

            Pro uložení prezentace do věčného trezoru hlasovalo 0 diváků, což je 0.0 %

            Beyond RLHF: A Human-Centered Approach to AI Development and Evaluation
            41:32

            Beyond RLHF: A Human-Centered Approach to AI Development and Evaluation

            Meredith Ringel Morris

            I2
            I2
            ICML 2023 2 years ago

            Pro uložení prezentace do věčného trezoru hlasovalo 0 diváků, což je 0.0 %

            Learning GFlowNets From Partial Episodes For Improved Convergence And Stability
            08:41

            Learning GFlowNets From Partial Episodes For Improved Convergence And Stability

            Kanika Madan, …

            I2
            I2
            ICML 2023 2 years ago

            Pro uložení prezentace do věčného trezoru hlasovalo 0 diváků, což je 0.0 %

            Understanding Gradient Regularization in Deep Learning: Efficient Finite-Difference Computation and  Implicit Bias
            04:52

            Understanding Gradient Regularization in Deep Learning: Efficient Finite-Difference Computation and Implicit Bias

            Ryo Karakida, …

            I2
            I2
            ICML 2023 2 years ago

            Pro uložení prezentace do věčného trezoru hlasovalo 0 diváků, což je 0.0 %

            HOOREX: Higher Order Optimizers for 3D Recovery from X-Ray Images
            08:36

            HOOREX: Higher Order Optimizers for 3D Recovery from X-Ray Images

            Karthik Shetty, …

            I2
            I2
            ICML 2023 2 years ago

            Pro uložení prezentace do věčného trezoru hlasovalo 0 diváků, což je 0.0 %

            A Deep Conjugate Direction Method for Iteratively Solving Linear Systems
            05:18

            A Deep Conjugate Direction Method for Iteratively Solving Linear Systems

            Ayano Kaneda, …

            I2
            I2
            ICML 2023 2 years ago

            Pro uložení prezentace do věčného trezoru hlasovalo 0 diváků, což je 0.0 %

            Interessiert an Vorträgen wie diesem? ICML 2023 folgen