Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: Expected Gradients of Maxout Networks and Consequences to Parameter Initialization
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v2-stream-005-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v2-stream-005-alpha.b-cdn.net
      • sl-yoda-v2-stream-005-beta.b-cdn.net
      • 1034628162.rsc.cdn77.org
      • 1409346856.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            Expected Gradients of Maxout Networks and Consequences to Parameter Initialization
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            Expected Gradients of Maxout Networks and Consequences to Parameter Initialization

            Jul 24, 2023

            Speakers

            HT

            Hanna Tseran

            Speaker · 0 followers

            GM

            Guido Montúfar

            Speaker · 1 follower

            About

            We study the gradients of a maxout network with respect to inputs and parameters and obtain bounds for the moments depending on the architecture and the parameter distribution. We observe that the distribution of the input-output Jacobian depends on the input, which complicates a stable parameter initialization. Based on the moments of the gradients, we formulate parameter initialization strategies that avoid vanishing and exploding gradients in wide networks. Experiments with deep fully-connect…

            Organizer

            I2
            I2

            ICML 2023

            Account · 657 followers

            Like the format? Trust SlidesLive to capture your next event!

            Professional recording and live streaming, delivered globally.

            Sharing

            Recommended Videos

            Presentations on similar topic, category or speaker

            A Scalable Frank-Wolfe-Based Algorithm for the Max-Cut SDP
            05:01

            A Scalable Frank-Wolfe-Based Algorithm for the Max-Cut SDP

            Chi Bach Pham, …

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            H-Consistency Bounds for Pairwise Misranking Loss Surrogates
            05:12

            H-Consistency Bounds for Pairwise Misranking Loss Surrogates

            Anqi Mao, …

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Off-Policy Evaluation for Large Action Spaces via Conjunct Effect Modeling
            05:37

            Off-Policy Evaluation for Large Action Spaces via Conjunct Effect Modeling

            Yuta Saito, …

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            General Sequential Episodic Memory Model
            04:35

            General Sequential Episodic Memory Model

            Arjun Karuvally, …

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            HETAL: Efficient Privacy-preserving Transfer Learning with Homomorphic Encryption
            04:52

            HETAL: Efficient Privacy-preserving Transfer Learning with Homomorphic Encryption

            Seewoo Lee, …

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            MEWL: Few-shot multimodal word learning with referential uncertainty
            04:32

            MEWL: Few-shot multimodal word learning with referential uncertainty

            Guangyuan Jiang, …

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Interested in talks like this? Follow ICML 2023