Steve Dai, Rangha Venkatesan, Haoxing Ren, Brian Zimmer, William Dally, Brucek Khailany · VS-Quant: Per-vector Scaled Quantization for Accurate Low-Precision Neural Network Inference · SlidesLive

Categories

EN

Log in Talk to sales

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: VS-Quant: Per-vector Scaled Quantization for Accurate Low-Precision Neural Network Inference

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v3-stream-015-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v3-stream-015-alpha.b-cdn.net
sl-yoda-v3-stream-015-beta.b-cdn.net
1963568160.rsc.cdn77.org
1940033649.rsc.cdn77.org

Subtitles
Off
English

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

VS-Quant: Per-vector Scaled Quantization for Accurate Low-Precision Neural Network Inference

VS-Quant: Per-vector Scaled Quantization for Accurate Low-Precision Neural Network Inference

Apr 4, 2021

Speakers

Steve Dai

Speaker · 0 followers

Rangha Venkatesan

Speaker · 0 followers

Haoxing Ren

Speaker · 0 followers

About

Quantization enables efficient acceleration of deep neural networks by reducing model memory footprint and exploiting low-cost integer math hardware units. Quantization maps floating-point weights and activations in a trained model to low-bitwidth integer values using scale factors. Excessive quantization, reducing precision too aggressively, results in accuracy degradation. When scale factors are shared at a coarse granularity across many dimensions of each tensor, effective precision of indivi…

Organizer

MLSys 2021

Account · 159 followers

Categories

AI & Data Science

Category · 10.8k presentations

About MLSys 2021

The Conference on Machine Learning and Systems targets research at the intersection of machine learning and systems. The conference aims to elicit new connections amongst these fields, including identifying best practices and design principles for learning systems, as well as developing novel learning methods and theory tailored to practical machine learning workflows.

Like the format? Trust SlidesLive to capture your next event!

Professional recording and live streaming, delivered globally.

Sharing

Recommended Videos

Presentations on similar topic, category or speaker

sensAI: ConvNets Decomposition via Class Parallelism for Fast Inference on Live Data

05:23

sensAI: ConvNets Decomposition via Class Parallelism for Fast Inference on Live Data

Watch later

Favorite

Guanhua Wang, …

MLSys 2021 4 years ago

Pipelined Backpropagation at Scale: Training Large Models without Batches

04:14

Pipelined Backpropagation at Scale: Training Large Models without Batches

Watch later

Favorite

Atli Kosson, …

MLSys 2021 4 years ago

Accelerated Learning by Exploiting Popular Choices

10:14

Accelerated Learning by Exploiting Popular Choices

Watch later

Favorite

Muhammad Adnan, …

MLSys 2021 4 years ago

Towards Disaggregated Memory Recommenders

13:03

Towards Disaggregated Memory Recommenders

Watch later

Favorite

Talha Imran, …

MLSys 2021 4 years ago

IOS: Inter-Operator Scheduler for CNN Acceleration

04:44

IOS: Inter-Operator Scheduler for CNN Acceleration

Watch later

Favorite

Yaoyao Ding, …

MLSys 2021 4 years ago

Oral: Equality Saturation for Tensor Graph Superoptimization

19:52

Oral: Equality Saturation for Tensor Graph Superoptimization

Watch later

Favorite

Yichen Yang, …

MLSys 2021 4 years ago