Bo Li, Yifei Shen, Jingkang Yang, Yezhen Wang, Jiawei Ren, Tong Che, Jun Zhang, Ziwei Liu · Sparse Mixture-of-Experts are Domain Generalizable Learners · SlidesLive

Kategorien

DE

Anmelden Vertrieb kontaktieren

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: Sparse Mixture-of-Experts are Domain Generalizable Learners

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v2-stream-006-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v2-stream-006-alpha.b-cdn.net
sl-yoda-v2-stream-006-beta.b-cdn.net
1549480416.rsc.cdn77.org
1102696603.rsc.cdn77.org

Subtitles
Off
English

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

Sparse Mixture-of-Experts are Domain Generalizable Learners

Sparse Mixture-of-Experts are Domain Generalizable Learners

Dez 2, 2022

Sprecher:innen

Bo Li

Speaker · 0 followers

Yifei Shen

Speaker · 0 followers

Jingkang Yang

Speaker · 0 followers

Über

In domain generalization (DG), most existing methods focused on the loss function design. This paper proposes to explore an orthogonal direction, i.e., the design of the backbone architecture. It is motivated by an empirical finding that transformer-based models trained with empirical risk minimization (ERM) outperform CNN-based models employing state-of-the-art (SOTA) DG algorithms on multiple DG datasets. We develop a formal framework to characterize a network's robustness to distribution shif…

Organisator

NeurIPS 2022

Account · 962 followers

Gefällt euch das Format? Vertraut auf SlidesLive, um euer nächstes Event festzuhalten!

Professionelle Aufzeichnung und Livestreaming – weltweit.

Freigeben

Empfohlene Videos

Präsentationen, deren Thema, Kategorie oder Sprecher:in ähnlich sind

Learning Invariant Representations under General Interventions on the Response

07:41

Learning Invariant Representations under General Interventions on the Response

Watch later

Favorite

NeurIPS 2022 2 years ago

PALBERT: Teaching ALBERT to Ponder

01:12

PALBERT: Teaching ALBERT to Ponder

Watch later

Favorite

Nikita Balagansky, …

NeurIPS 2022 2 years ago

DyREx: Dynamic Query Representation for Extractive Question Answering

05:15

DyREx: Dynamic Query Representation for Extractive Question Answering

Watch later

Favorite

Urchade Zaratiana, …

NeurIPS 2022 2 years ago

Memory safe computations with XLA compiler

03:21

Memory safe computations with XLA compiler

Watch later

Favorite

Artem Artemev, …

NeurIPS 2022 2 years ago

AMP: Automatically Finding Model Parallel Strategies with Heterogeneity Awareness

04:52

AMP: Automatically Finding Model Parallel Strategies with Heterogeneity Awareness

Watch later

Favorite

Dacheng Li, …

NeurIPS 2022 2 years ago

[Re] GANSpace: Discovering Interpretable GAN Controls

04:10

[Re] GANSpace: Discovering Interpretable GAN Controls

Watch later

Favorite

Vishnu Asutosh Dasu, …

NeurIPS 2022 2 years ago