Hangbo Bao, Wenhui Wang, Li Dong, Qiang Liu, Owais Khan Mohammed, Kriti Aggarwal, Subhojit Som, Songhao Piao, Furu Wei · VLMO: Unified Vision-Language Pre-Training with Mixture-of-Modality-Experts · SlidesLive

Categories

EN

Log in Talk to sales

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: VLMO: Unified Vision-Language Pre-Training with Mixture-of-Modality-Experts

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v2-stream-004-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v2-stream-004-alpha.b-cdn.net
sl-yoda-v2-stream-004-beta.b-cdn.net
1685195716.rsc.cdn77.org
1239898752.rsc.cdn77.org

Subtitles
Off
English

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

VLMO: Unified Vision-Language Pre-Training with Mixture-of-Modality-Experts

VLMO: Unified Vision-Language Pre-Training with Mixture-of-Modality-Experts

Oct 28, 2022

Speakers

Hangbo Bao

Speaker · 0 followers

Wenhui Wang

Speaker · 0 followers

Li Dong

Speaker · 0 followers

About

We present a unified Vision-Language pretrained Model (VLMo) that jointly learns a dual encoder and a fusion encoder with a modular Transformer network. Specifically, we introduce Mixture-of-Modality-Experts (MoME) Transformer, where each block contains a pool of modality-specific experts and a shared self-attention layer. Because of the modeling flexibility of MoME, pretrained VLMo can be fine-tuned as a fusion encoder for vision-language classification tasks, or used as a dual encoder for effi…

Organizer

NeurIPS 2022

Account · 952 followers

Like the format? Trust SlidesLive to capture your next event!

Professional recording and live streaming, delivered globally.

Sharing

Recommended Videos

Presentations on similar topic, category or speaker

The Ineffectiveness of Temporal Knowledge Graph Embedding Models in Encoding Real-World Knowledge Graphs

05:05

The Ineffectiveness of Temporal Knowledge Graph Embedding Models in Encoding Real-World Knowledge Graphs

Watch later

Favorite

NeurIPS 2022 2 years ago

On The Fragility of Learned Reward Functions

04:58

On The Fragility of Learned Reward Functions

Watch later

Favorite

Lev McKinney, …

NeurIPS 2022 2 years ago

A Coupled Design of Exploiting Record Similarity for Vertical Federated Learning

04:36

A Coupled Design of Exploiting Record Similarity for Vertical Federated Learning

Watch later

Favorite

Zhaomin Wu, …

NeurIPS 2022 2 years ago

Mentorship Panel

52:40

Mentorship Panel

Watch later

Favorite

Amin Karbasi, …

NeurIPS 2022 2 years ago

When are Local Queries Useful for Robust Learning?

07:03

When are Local Queries Useful for Robust Learning?

Watch later

Favorite

Pascale Gourdeau, …

NeurIPS 2022 2 years ago

Amortized Inference for Causal Structure Learning

04:41

Amortized Inference for Causal Structure Learning

Watch later

Favorite

Lars Lorch, …

NeurIPS 2022 2 years ago