Caleb Lu, Zifan Wang, Piotr Mardziel, Anupam Datta · Influence Patterns for Explaining Information Flow in BERT · SlidesLive

Categories

EN

Log in Talk to sales

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: Influence Patterns for Explaining Information Flow in BERT

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v2-stream-003-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v2-stream-003-alpha.b-cdn.net
sl-yoda-v2-stream-003-beta.b-cdn.net
1544410162.rsc.cdn77.org
1005514182.rsc.cdn77.org

Subtitles
Off
English

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

Influence Patterns for Explaining Information Flow in BERT

Influence Patterns for Explaining Information Flow in BERT

Dec 6, 2021

Speakers

Caleb Lu

Speaker · 0 followers

Zifan Wang

Speaker · 0 followers

Piotr Mardziel

Speaker · 0 followers

About

While attention is all you need may be proving true, we do not know why: attention-based transformer models such as BERT are superior but how information flows from input tokens to output predictions are unclear. We introduce influence patterns, abstractions of sets of paths through a transformer model. Patterns quantify and localize the flow of information to paths passing through a sequence of model nodes. Experimentally, we find that significant portion of information flow in BERT goes throug…

Organizer

NeurIPS 2021

Account · 1.9k followers

About NeurIPS 2021

Neural Information Processing Systems (NeurIPS) is a multi-track machine learning and computational neuroscience conference that includes invited talks, demonstrations, symposia and oral and poster presentations of refereed papers. Following the conference, there are workshops which provide a less formal setting.

Like the format? Trust SlidesLive to capture your next event!

Professional recording and live streaming, delivered globally.

Sharing

Recommended Videos

Presentations on similar topic, category or speaker

Noether Networks: Meta-Learning Useful Conserved Quantities

05:00

Noether Networks: Meta-Learning Useful Conserved Quantities

Watch later

Favorite

Ferran Alet, …

NeurIPS 2021 3 years ago

Differentiable Equilibrium Computation with Decision Diagrams for Stackelberg Models of Combinatorial Congestion Games

15:07

Differentiable Equilibrium Computation with Decision Diagrams for Stackelberg Models of Combinatorial Congestion Games

Watch later

Favorite

Shinsaku Sakaue, …

NeurIPS 2021 3 years ago

Distributed Machine Learning with Sparse Heterogeneous Data

08:11

Distributed Machine Learning with Sparse Heterogeneous Data

Watch later

Favorite

Dominic Richards, …

NeurIPS 2021 3 years ago

Panel Discussion 3

1:00:06

Panel Discussion 3

Watch later

Favorite

Taylor Webb, …

NeurIPS 2021 3 years ago

Competition Track Day 1

2:38:08

Competition Track Day 1

Watch later

Favorite

NeurIPS 2021 3 years ago

Scheduling jobs with stochastic holding costs

15:13

Scheduling jobs with stochastic holding costs

Watch later

Favorite

Dabeen Lee, …

NeurIPS 2021 3 years ago