Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: SGD with large step sizes learns sparse features
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v2-stream-005-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v2-stream-005-alpha.b-cdn.net
      • sl-yoda-v2-stream-005-beta.b-cdn.net
      • 1034628162.rsc.cdn77.org
      • 1409346856.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            SGD with large step sizes learns sparse features
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            SGD with large step sizes learns sparse features

            Jul 24, 2023

            Sprecher:innen

            MA

            Maksym Andriushchenko

            Řečník · 0 sledujících

            AV

            Aditya Varre

            Řečník · 0 sledujících

            LP

            Loucas Pillaud-Vivien

            Řečník · 0 sledujících

            Über

            We showcase important features of the dynamics of the Stochastic Gradient Descent (SGD) in the training of neural networks. We present empirical observations that commonly used large step sizes (i) may lead the iterates to jump from one side of a valley to the other causing loss stabilization, and (ii) this stabilization induces a hidden stochastic dynamics that biases it implicitly toward simple predictors. Furthermore, we show empirically that the longer large step sizes keep SGD high in the l…

            Organisator

            I2
            I2

            ICML 2023

            Účet · 657 sledujících

            Gefällt euch das Format? Vertraut auf SlidesLive, um euer nächstes Event festzuhalten!

            Professionelle Aufzeichnung und Livestreaming – weltweit.

            Freigeben

            Empfohlene Videos

            Präsentationen, deren Thema, Kategorie oder Sprecher:in ähnlich sind

            Define, Evaluate, and Improve Task-Oriented Cognitive Capabilities for Instruction Generation Models
            09:01

            Define, Evaluate, and Improve Task-Oriented Cognitive Capabilities for Instruction Generation Models

            Lingjun Zhao, …

            I2
            I2
            ICML 2023 2 years ago

            Pro uložení prezentace do věčného trezoru hlasovalo 0 diváků, což je 0.0 %

            Reprogramming Pretrained Language Models for Antibody Sequence Infilling
            05:29

            Reprogramming Pretrained Language Models for Antibody Sequence Infilling

            Igor Melnyk, …

            I2
            I2
            ICML 2023 2 years ago

            Pro uložení prezentace do věčného trezoru hlasovalo 0 diváků, což je 0.0 %

            Causal Bounds in Quasi-Markovian Graphs
            05:32

            Causal Bounds in Quasi-Markovian Graphs

            Madhumitha Shridharan, …

            I2
            I2
            ICML 2023 2 years ago

            Pro uložení prezentace do věčného trezoru hlasovalo 0 diváků, což je 0.0 %

            SWARM Parallelism: Training Large Models Can Be Surprisingly Communication-Efficient
            05:44

            SWARM Parallelism: Training Large Models Can Be Surprisingly Communication-Efficient

            Max Ryabinin, …

            I2
            I2
            ICML 2023 2 years ago

            Pro uložení prezentace do věčného trezoru hlasovalo 0 diváků, což je 0.0 %

            Taxonomy-Structured Domain Adaptation (TSDA)
            05:05

            Taxonomy-Structured Domain Adaptation (TSDA)

            Tianyi Liu, …

            I2
            I2
            ICML 2023 2 years ago

            Pro uložení prezentace do věčného trezoru hlasovalo 0 diváků, což je 0.0 %

            Neurosymbolic Learning as a Path to Learning with Guarantees
            25:51

            Neurosymbolic Learning as a Path to Learning with Guarantees

            Armando Solar-Lezama

            I2
            I2
            ICML 2023 2 years ago

            Pro uložení prezentace do věčného trezoru hlasovalo 0 diváků, což je 0.0 %

            Interessiert an Vorträgen wie diesem? ICML 2023 folgen