Prin Phunyaphibarn, Junghyun Lee, Bohan Wang, Huishuai Zhang, Chulhee Yun · Large Catapults in Momentum Gradient Descent with Warmup: An Empirical Study · SlidesLive

Categories

EN

Log in Talk to sales

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: Large Catapults in Momentum Gradient Descent with Warmup: An Empirical Study

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v3-stream-008-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v3-stream-008-alpha.b-cdn.net
sl-yoda-v3-stream-008-beta.b-cdn.net
1231929869.rsc.cdn77.org
1266089239.rsc.cdn77.org

Subtitles
Off
English

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

Large Catapults in Momentum Gradient Descent with Warmup: An Empirical Study

Large Catapults in Momentum Gradient Descent with Warmup: An Empirical Study

Dec 15, 2023

Speakers

Prin Phunyaphibarn

Speaker · 0 followers

Junghyun Lee

Speaker · 0 followers

Bohan Wang

Speaker · 0 followers

About

Although gradient descent with momentum is widely used in modern deep learning, a concrete understanding of its effects on the training trajectory still remains elusive. In this work, we empirically show that momentum gradient descent with a large learning rate and learning rate warmup displays large catapults, driving the iterates towards flatter minima than those found by gradient descent. We then provide empirical evidence and theoretical intuition that the large catapult is caused by momentu…

Organizer

NeurIPS 2023

Account · 645 followers

Like the format? Trust SlidesLive to capture your next event!

Professional recording and live streaming, delivered globally.

Sharing

Recommended Videos

Presentations on similar topic, category or speaker

Reference-Based POMDPs

04:58

Reference-Based POMDPs

Watch later

Favorite

Edward Kim, …

NeurIPS 2023 16 months ago

How to Work With Real Humans in Human-AI Systems

2:29:40

How to Work With Real Humans in Human-AI Systems

Watch later

Favorite

Elizabeth Bondi-Kelly, …

NeurIPS 2023 16 months ago

Encoding Human Behavior in Information Design through Deep Learning

04:37

Encoding Human Behavior in Information Design through Deep Learning

Watch later

Favorite

Guanghui Yu, …

NeurIPS 2023 16 months ago

TRIAGE: Characterizing and auditing training data for improved regression

04:39

TRIAGE: Characterizing and auditing training data for improved regression

Watch later

Favorite

Nabeel Seedat, …

NeurIPS 2023 16 months ago

Closing Remarks: Machine Learning in Structural Biology Workshop

05:56

Closing Remarks: Machine Learning in Structural Biology Workshop

Watch later

Favorite

Hannah Wayment-Steele

NeurIPS 2023 16 months ago

DrugImprover: Utilizing Reinforcement Learning for Multi-Objective Alignment in Drug Optimization

07:50

DrugImprover: Utilizing Reinforcement Learning for Multi-Objective Alignment in Drug Optimization

Watch later

Favorite

Xuefeng Liu, …

NeurIPS 2023 16 months ago