Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: The case for 4-bit precision: k-bit Inference Scaling Laws
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v2-stream-007-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v2-stream-007-alpha.b-cdn.net
      • sl-yoda-v2-stream-007-beta.b-cdn.net
      • 1678031076.rsc.cdn77.org
      • 1932936657.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            The case for 4-bit precision: k-bit Inference Scaling Laws
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            The case for 4-bit precision: k-bit Inference Scaling Laws

            Jul 24, 2023

            Sprecher:innen

            TD

            Tim Dettmers

            Řečník · 0 sledujících

            LZ

            Luke Zettlemoyer

            Řečník · 5 sledujících

            Über

            Quantization methods reduce the number of bits required to represent each parameter in a model, trading accuracy for smaller memory footprints and inference latencies. However, the final model size depends on both the number of parameters of the original model and the rate of compression. For example, a 30B 8-bit model and a 60B 4-bit model have the same number of bits but may have very different zero-shot accuracies. In this work, we study this trade-off by developing inference scaling laws of…

            Organisator

            I2
            I2

            ICML 2023

            Účet · 657 sledujících

            Gefällt euch das Format? Vertraut auf SlidesLive, um euer nächstes Event festzuhalten!

            Professionelle Aufzeichnung und Livestreaming – weltweit.

            Freigeben

            Empfohlene Videos

            Präsentationen, deren Thema, Kategorie oder Sprecher:in ähnlich sind

            Universal Physics-Informed Neural Networks:
            05:19

            Universal Physics-Informed Neural Networks:

            Lena Podina, …

            I2
            I2
            ICML 2023 2 years ago

            Pro uložení prezentace do věčného trezoru hlasovalo 0 diváků, což je 0.0 %

            Revisiting Weighted Aggregation in Federated Learning with Neural Networks
            05:12

            Revisiting Weighted Aggregation in Federated Learning with Neural Networks

            Zexi Li, …

            I2
            I2
            ICML 2023 2 years ago

            Pro uložení prezentace do věčného trezoru hlasovalo 0 diváků, což je 0.0 %

            Provable Reset-free Reinforcement Learning by No-Regret Reduction
            05:14

            Provable Reset-free Reinforcement Learning by No-Regret Reduction

            Hoai-An Nguyen, …

            I2
            I2
            ICML 2023 2 years ago

            Pro uložení prezentace do věčného trezoru hlasovalo 0 diváků, což je 0.0 %

            Minimum Width of Leaky-ReLU Neural Networks for Uniform Universal Approximation
            04:51

            Minimum Width of Leaky-ReLU Neural Networks for Uniform Universal Approximation

            Liang Li, …

            I2
            I2
            ICML 2023 2 years ago

            Pro uložení prezentace do věčného trezoru hlasovalo 0 diváků, což je 0.0 %

            Invited Talk: Jack Balkin
            18:02

            Invited Talk: Jack Balkin

            Jack Balkin

            I2
            I2
            ICML 2023 2 years ago

            Pro uložení prezentace do věčného trezoru hlasovalo 0 diváků, což je 0.0 %

            Low-Variance Gradient Estimation in Unrolled Computation Graphs with ES-Single
            05:51

            Low-Variance Gradient Estimation in Unrolled Computation Graphs with ES-Single

            Paul Vicol, …

            I2
            I2
            ICML 2023 2 years ago

            Pro uložení prezentace do věčného trezoru hlasovalo 0 diváků, což je 0.0 %

            Interessiert an Vorträgen wie diesem? ICML 2023 folgen