Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: The case for 4-bit precision: k-bit Inference Scaling Laws
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v2-stream-007-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v2-stream-007-alpha.b-cdn.net
      • sl-yoda-v2-stream-007-beta.b-cdn.net
      • 1678031076.rsc.cdn77.org
      • 1932936657.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            The case for 4-bit precision: k-bit Inference Scaling Laws
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            The case for 4-bit precision: k-bit Inference Scaling Laws

            Jul 24, 2023

            Speakers

            TD

            Tim Dettmers

            Speaker · 0 followers

            LZ

            Luke Zettlemoyer

            Speaker · 5 followers

            About

            Quantization methods reduce the number of bits required to represent each parameter in a model, trading accuracy for smaller memory footprints and inference latencies. However, the final model size depends on both the number of parameters of the original model and the rate of compression. For example, a 30B 8-bit model and a 60B 4-bit model have the same number of bits but may have very different zero-shot accuracies. In this work, we study this trade-off by developing inference scaling laws of…

            Organizer

            I2
            I2

            ICML 2023

            Account · 657 followers

            Like the format? Trust SlidesLive to capture your next event!

            Professional recording and live streaming, delivered globally.

            Sharing

            Recommended Videos

            Presentations on similar topic, category or speaker

            Policy Regularization with Dataset Constraint for Offline Reinforcement Learning
            05:02

            Policy Regularization with Dataset Constraint for Offline Reinforcement Learning

            Yuhang Ran, …

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation
            05:02

            MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation

            Omer Bar-Tal, …

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            LookupFFN: Making Transformers Compute-lite for CPU inference
            05:18

            LookupFFN: Making Transformers Compute-lite for CPU inference

            Zhanpeng Zeng, …

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Global Convergence of Sub-gradient Method for Robust Matrix Recovery: Small Initialization, Noisy Measurements, and Over-parameterization
            05:17

            Global Convergence of Sub-gradient Method for Robust Matrix Recovery: Small Initialization, Noisy Measurements, and Over-parameterization

            Jianhao Ma, …

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            textual transform coding
            34:17

            textual transform coding

            Tsachy Weissman

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Multi-Modal Biomarker Extraction Framework for Therapy Monitoring of Social Anxiety and Depression Using Audio and Video
            11:43

            Multi-Modal Biomarker Extraction Framework for Therapy Monitoring of Social Anxiety and Depression Using Audio and Video

            Paula Andrea Pérez-Toro, …

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Interested in talks like this? Follow ICML 2023