Charlie Blake, Douglas Orr, Carlo Luschi · What is unit scaling? · SlidesLive

Categories

EN

Log in Talk to sales

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: What is unit scaling?

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v2-stream-004-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v2-stream-004-alpha.b-cdn.net
sl-yoda-v2-stream-004-beta.b-cdn.net
1685195716.rsc.cdn77.org
1239898752.rsc.cdn77.org

Subtitles
Off
English

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

What is unit scaling?

What is unit scaling?

Jul 24, 2023

Speakers

Charlie Blake

Speaker · 0 followers

Douglas Orr

Speaker · 0 followers

Carlo Luschi

Speaker · 0 followers

About

We present unit scaling, a paradigm for designing deep learning models that simplifies the use of low-precision number formats. Training in FP16 or the recently proposed FP8 formats offers substantial efficiency gains, but can lack sufficient range for out-of-the-box training. Unit scaling addresses this by introducing a principled approach to model numerics: seeking unit variance of all weights, activations and gradients at initialisation. Unlike alternative methods, this approach neither requi…

Organizer

ICML 2023

Account · 657 followers

Like the format? Trust SlidesLive to capture your next event!

Professional recording and live streaming, delivered globally.

Sharing

Recommended Videos

Presentations on similar topic, category or speaker

Learning Intuitive Policies Using Action Features

04:43

Learning Intuitive Policies Using Action Features

Watch later

Favorite

Mingwei Ma, …

ICML 2023 2 years ago

On Balancing Bias and Variance in Unsupervised Multi-Source-Free Domain Adaptation

05:24

On Balancing Bias and Variance in Unsupervised Multi-Source-Free Domain Adaptation

Watch later

Favorite

Maohao Shen, …

ICML 2023 2 years ago

Improving Visual Prompt Tuning for Self-supervised Vision Transformers

04:55

Improving Visual Prompt Tuning for Self-supervised Vision Transformers

Watch later

Favorite

Seungryong Yoo, …

ICML 2023 2 years ago

A unified recipe for deriving (time-uniform) PAC-Bayes bounds

43:06

A unified recipe for deriving (time-uniform) PAC-Bayes bounds

Watch later

Favorite

ICML 2023 2 years ago

Predictable MDP Abstraction for Unsupervised Model-Based RL

05:27

Predictable MDP Abstraction for Unsupervised Model-Based RL

Watch later

Favorite

Seohong Park, …

ICML 2023 2 years ago

Personalized Subgraph Federated Learning

04:46

Personalized Subgraph Federated Learning

Watch later

Favorite

Jinheon Baek, …

ICML 2023 2 years ago