Yuanxin Liu, Fandong Meng, Zheng Lin, Jiangnan Li, Peng Fu, Yanan Cao, Weiping Wang, Jie Zhou · A Win-win Deal: Towards Sparse and Robust Pre-trained Language Models · SlidesLive

Categories

EN

Log in Talk to sales

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: A Win-win Deal: Towards Sparse and Robust Pre-trained Language Models

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v2-stream-004-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v2-stream-004-alpha.b-cdn.net
sl-yoda-v2-stream-004-beta.b-cdn.net
1685195716.rsc.cdn77.org
1239898752.rsc.cdn77.org

Subtitles
Off
English

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

A Win-win Deal: Towards Sparse and Robust Pre-trained Language Models

A Win-win Deal: Towards Sparse and Robust Pre-trained Language Models

Dec 6, 2022

Speakers

Yuanxin Liu

Speaker · 0 followers

Fandong Meng

Speaker · 0 followers

Zheng Lin

Speaker · 0 followers

About

Despite the remarkable success of pre-trained language models (PLMs), they still face two challenges: First, large-scale PLMs are inefficient in terms of memory footprint and computation. Second, on the downstream tasks, PLMs tend to rely on the dataset bias and struggle to generalize to out-of-distribution (OOD) data. In response to the efficiency problem, recent studies show that dense PLMs can be replaced with sparse subnetworks without hurting the performance. Such subnetworks can be found i…

Organizer

NeurIPS 2022

Account · 952 followers

Like the format? Trust SlidesLive to capture your next event!

Professional recording and live streaming, delivered globally.

Sharing

Recommended Videos

Presentations on similar topic, category or speaker

A Rotated Hyperboilc Wrapped Normal Distribution for Hierarchical Representation Learning

04:59

A Rotated Hyperboilc Wrapped Normal Distribution for Hierarchical Representation Learning

Watch later

Favorite

Seunghyuk Cho, …

NeurIPS 2022 2 years ago

Skill Acquisition by Instruction Augmentation on Offline Datasets

03:24

Skill Acquisition by Instruction Augmentation on Offline Datasets

Watch later

Favorite

NeurIPS 2022 2 years ago

Trials of developing OPT-175B

31:18

Trials of developing OPT-175B

Watch later

Favorite

NeurIPS 2022 2 years ago

Continual learning: a feature extraction formalization, an efficient algorithm, and fundamental obstructions

04:41

Continual learning: a feature extraction formalization, an efficient algorithm, and fundamental obstructions

Watch later

Favorite

Binghui Peng, …

NeurIPS 2022 2 years ago

Control Graph as Unified IO for Morphology-Task Generalization

04:53

Control Graph as Unified IO for Morphology-Task Generalization

Watch later

Favorite

Hiroki Furuta, …

NeurIPS 2022 2 years ago

ZeroC: A Neuro-Symbolic Model for Zero-shot Concept Recognition and Acquisition at Inference Time

05:11

ZeroC: A Neuro-Symbolic Model for Zero-shot Concept Recognition and Acquisition at Inference Time

Watch later

Favorite

NeurIPS 2022 2 years ago