Zhuohan Li, Eric Wallace, Kevin Lin, Sheng Shen, Kurt Keutzer, Dan Klein, Joseph E. Gonzalez · Train Big, Then Compress: Rethinking Model Size for Efficient Training and Inference of Transformers · SlidesLive

Categories

EN

Log in Talk to sales

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: Train Big, Then Compress: Rethinking Model Size for Efficient Training and Inference of Transformers

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v3-stream-015-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v3-stream-015-alpha.b-cdn.net
sl-yoda-v3-stream-015-beta.b-cdn.net
1963568160.rsc.cdn77.org
1940033649.rsc.cdn77.org

Subtitles
Off
en

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

Train Big, Then Compress: Rethinking Model Size for Efficient Training and Inference of Transformers

Train Big, Then Compress: Rethinking Model Size for Efficient Training and Inference of Transformers

Jul 12, 2020

Speakers

Zhuohan Li

Speaker · 1 follower

Eric Wallace

Speaker · 2 followers

Kevin Lin

Speaker · 0 followers

About

Since hardware resources are limited, the objective of training deep learning models is typically to maximize accuracy subject to the time and memory constraints of training and inference. We study the impact of model size in this setting, focusing on transformer models for NLP tasks that are limited by compute: BERT pretraining and high-resource machine translation. We first show that even though smaller transformer models execute faster per iteration, wider and deeper models converge in signif…

Organizer

ICML 2020

Account · 2.7k followers

Categories

About ICML 2020

The International Conference on Machine Learning (ICML) is the premier gathering of professionals dedicated to the advancement of the branch of artificial intelligence known as machine learning. ICML is globally renowned for presenting and publishing cutting-edge research on all aspects of machine learning used in closely related areas like artificial intelligence, statistics and data science, as well as important application areas such as machine vision, computational biology, speech recognition, and robotics. ICML is one of the fastest growing artificial intelligence conferences in the world. Participants at ICML span a wide range of backgrounds, from academic and industrial researchers, to entrepreneurs and engineers, to graduate students and postdocs.

Like the format? Trust SlidesLive to capture your next event!

Professional recording and live streaming, delivered globally.

Sharing

Recommended Videos

Presentations on similar topic, category or speaker