Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: Magneto: A Foundation Transformer
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v2-stream-001-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v2-stream-001-alpha.b-cdn.net
      • sl-yoda-v2-stream-001-beta.b-cdn.net
      • 1824830694.rsc.cdn77.org
      • 1979322955.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            Magneto: A Foundation Transformer
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            Magneto: A Foundation Transformer

            Jul 24, 2023

            Speakers

            HW

            Hongyu Wang

            Speaker · 0 followers

            SM

            Shuming Ma

            Speaker · 0 followers

            SH

            Shaohan Huang

            Speaker · 0 followers

            About

            A big convergence of model architectures across language, vision, speech, and multimodal is emerging. However, under the same name ”Transformers”, the above areas use different implementations for better performance, e.g., Post-LayerNorm for BERT, and Pre-LayerNorm for GPT and vision Transformers. We call for the development of Foundation Transformer for true general-purpose modeling, which serves as a go-to architecture for various tasks and modalities with guaranteed training stability. In thi…

            Organizer

            I2
            I2

            ICML 2023

            Account · 657 followers

            Like the format? Trust SlidesLive to capture your next event!

            Professional recording and live streaming, delivered globally.

            Sharing

            Recommended Videos

            Presentations on similar topic, category or speaker

            Anytime Model Selection in Linear Bandits
            08:55

            Anytime Model Selection in Linear Bandits

            Parnian Kassraie, …

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Distributed Contextual Linear Bandits with Minimax Optimal Communication Cost
            05:31

            Distributed Contextual Linear Bandits with Minimax Optimal Communication Cost

            Sanae Amani, …

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            PAC-Bayesian Error Bound, via Rényi Divergence, for a Class of Linear Time-Invariant State-Space Models
            08:13

            PAC-Bayesian Error Bound, via Rényi Divergence, for a Class of Linear Time-Invariant State-Space Models

            Deividas Eringis, …

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Semi-Supervised Offline Reinforcement Learning with Action-Free Trajectories
            04:56

            Semi-Supervised Offline Reinforcement Learning with Action-Free Trajectories

            Qinqing Zheng, …

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Interventional and Counterfactual Inference with Diffusion Models
            09:36

            Interventional and Counterfactual Inference with Diffusion Models

            Patrick Chao, …

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Auxiliary Modality Learning with Generalized Curriculum Distillation
            04:48

            Auxiliary Modality Learning with Generalized Curriculum Distillation

            Yu Shen, …

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Interested in talks like this? Follow ICML 2023