Tanzila Rahman, Mengyu Yang, Leonid Sigal · TriBERT: Full-body Human-centric Audio-visual Representation Learning for Visual Sound Separation · SlidesLive

Categories

EN

Log in Talk to sales

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: TriBERT: Full-body Human-centric Audio-visual Representation Learning for Visual Sound Separation

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v2-stream-010-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v2-stream-010-alpha.b-cdn.net
sl-yoda-v2-stream-010-beta.b-cdn.net
1759419103.rsc.cdn77.org
1016618226.rsc.cdn77.org

Subtitles
Off
English

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

TriBERT: Full-body Human-centric Audio-visual Representation Learning for Visual Sound Separation

TriBERT: Full-body Human-centric Audio-visual Representation Learning for Visual Sound Separation

Dec 6, 2021

Speakers

Tanzila Rahman

Speaker · 0 followers

Mengyu Yang

Speaker · 0 followers

Leonid Sigal

Speaker · 0 followers

About

The recent success of transformer models in language, such as BERT, has motivated the use of such architectures for multi-modal feature learning and tasks. However, most multi-modal variants (e.g., ViLBERT) have limited themselves to visual-linguistic data. Relatively few have explored its use in audio-visual modalities, and none, to our knowledge, illustrate them in the context of granular audio-visual detection or segmentation tasks such as sound source separation and localization. In this wor…

Organizer

NeurIPS 2021

Account · 1.9k followers

About NeurIPS 2021

Neural Information Processing Systems (NeurIPS) is a multi-track machine learning and computational neuroscience conference that includes invited talks, demonstrations, symposia and oral and poster presentations of refereed papers. Following the conference, there are workshops which provide a less formal setting.

Like the format? Trust SlidesLive to capture your next event!

Professional recording and live streaming, delivered globally.

Sharing

Recommended Videos

Presentations on similar topic, category or speaker

Private Non-smooth ERM and SCO in Sub-quadratic Steps

13:14

Private Non-smooth ERM and SCO in Sub-quadratic Steps

Watch later

Favorite

Janardhan Kulkarni, …

NeurIPS 2021 3 years ago

Generalized Proximal Policy Optimization with Sample Reuse

13:45

Generalized Proximal Policy Optimization with Sample Reuse

Watch later

Favorite

James Queeney, …

NeurIPS 2021 3 years ago

Proximal Causal Inference

23:58

Proximal Causal Inference

Watch later

Favorite

Eric J Tchetgen Tchtgen

NeurIPS 2021 3 years ago

Cutting Down on Prompts and Parameters: Simple Few-Shot Learning with Language Models

05:06

Cutting Down on Prompts and Parameters: Simple Few-Shot Learning with Language Models

Watch later

Favorite

Robert Logan, …

NeurIPS 2021 3 years ago

Hyperparameter Tuning is All You Need for LISTA

15:05

Hyperparameter Tuning is All You Need for LISTA

Watch later

Favorite

Xiaohan Chen, …

NeurIPS 2021 3 years ago

Neural Active Learning with Performance Guarantees

10:43

Neural Active Learning with Performance Guarantees

Watch later

Favorite

Zhilei Wang, …

NeurIPS 2021 3 years ago