Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: VS-Quant: Per-vector Scaled Quantization for Accurate Low-Precision Neural Network Inference
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v3-stream-015-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v3-stream-015-alpha.b-cdn.net
      • sl-yoda-v3-stream-015-beta.b-cdn.net
      • 1963568160.rsc.cdn77.org
      • 1940033649.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            VS-Quant: Per-vector Scaled Quantization for Accurate Low-Precision Neural Network Inference
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            VS-Quant: Per-vector Scaled Quantization for Accurate Low-Precision Neural Network Inference

            Apr 4, 2021

            Speakers

            SD

            Steve Dai

            Speaker · 0 followers

            RV

            Rangha Venkatesan

            Speaker · 0 followers

            HR

            Haoxing Ren

            Speaker · 0 followers

            About

            Quantization enables efficient acceleration of deep neural networks by reducing model memory footprint and exploiting low-cost integer math hardware units. Quantization maps floating-point weights and activations in a trained model to low-bitwidth integer values using scale factors. Excessive quantization, reducing precision too aggressively, results in accuracy degradation. When scale factors are shared at a coarse granularity across many dimensions of each tensor, effective precision of indivi…

            Organizer

            M2
            M2

            MLSys 2021

            Account · 159 followers

            Categories

            AI & Data Science

            Category · 10.8k presentations

            About MLSys 2021

            The Conference on Machine Learning and Systems targets research at the intersection of machine learning and systems. The conference aims to elicit new connections amongst these fields, including identifying best practices and design principles for learning systems, as well as developing novel learning methods and theory tailored to practical machine learning workflows.

            Like the format? Trust SlidesLive to capture your next event!

            Professional recording and live streaming, delivered globally.

            Sharing

            Recommended Videos

            Presentations on similar topic, category or speaker

            sensAI: ConvNets Decomposition via Class Parallelism for Fast Inference on Live Data
            05:23

            sensAI: ConvNets Decomposition via Class Parallelism for Fast Inference on Live Data

            Guanhua Wang, …

            M2
            M2
            MLSys 2021 4 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Pipelined Backpropagation at Scale: Training Large Models without Batches
            04:14

            Pipelined Backpropagation at Scale: Training Large Models without Batches

            Atli Kosson, …

            M2
            M2
            MLSys 2021 4 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Accelerated Learning by Exploiting Popular Choices
            10:14

            Accelerated Learning by Exploiting Popular Choices

            Muhammad Adnan, …

            M2
            M2
            MLSys 2021 4 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Towards Disaggregated Memory Recommenders
            13:03

            Towards Disaggregated Memory Recommenders

            Talha Imran, …

            M2
            M2
            MLSys 2021 4 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            IOS: Inter-Operator Scheduler for CNN Acceleration
            04:44

            IOS: Inter-Operator Scheduler for CNN Acceleration

            Yaoyao Ding, …

            M2
            M2
            MLSys 2021 4 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Oral: Equality Saturation for Tensor Graph Superoptimization
            19:52

            Oral: Equality Saturation for Tensor Graph Superoptimization

            Yichen Yang, …

            M2
            M2
            MLSys 2021 4 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Interested in talks like this? Follow MLSys 2021