Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: Train Big, Then Compress: Rethinking Model Size for Efficient Training and Inference of Transformers
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v3-stream-015-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v3-stream-015-alpha.b-cdn.net
      • sl-yoda-v3-stream-015-beta.b-cdn.net
      • 1963568160.rsc.cdn77.org
      • 1940033649.rsc.cdn77.org
      • Subtitles
      • Off
      • en
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            Train Big, Then Compress: Rethinking Model Size for Efficient Training and Inference of Transformers
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            Train Big, Then Compress: Rethinking Model Size for Efficient Training and Inference of Transformers

            Jul 12, 2020

            Speakers

            ZL

            Zhuohan Li

            Speaker · 1 follower

            EW

            Eric Wallace

            Speaker · 2 followers

            KL

            Kevin Lin

            Speaker · 0 followers

            About

            Since hardware resources are limited, the objective of training deep learning models is typically to maximize accuracy subject to the time and memory constraints of training and inference. We study the impact of model size in this setting, focusing on transformer models for NLP tasks that are limited by compute: BERT pretraining and high-resource machine translation. We first show that even though smaller transformer models execute faster per iteration, wider and deeper models converge in signif…

            Organizer

            I2
            I2

            ICML 2020

            Account · 2.7k followers

            Categories

            About ICML 2020

            The International Conference on Machine Learning (ICML) is the premier gathering of professionals dedicated to the advancement of the branch of artificial intelligence known as machine learning. ICML is globally renowned for presenting and publishing cutting-edge research on all aspects of machine learning used in closely related areas like artificial intelligence, statistics and data science, as well as important application areas such as machine vision, computational biology, speech recognition, and robotics. ICML is one of the fastest growing artificial intelligence conferences in the world. Participants at ICML span a wide range of backgrounds, from academic and industrial researchers, to entrepreneurs and engineers, to graduate students and postdocs.

            Like the format? Trust SlidesLive to capture your next event!

            Professional recording and live streaming, delivered globally.

            Sharing

            Recommended Videos

            Presentations on similar topic, category or speaker

            Interested in talks like this? Follow ICML 2020