Caleb Lu, Zifan Wang, Piotr Mardziel, Anupam Datta · Influence Patterns for Explaining Information Flow in BERT · SlidesLive

Kategorie

CS

Přihlásit se Kontaktujte nás

Další

Živý přenos začne již brzy!

Živý přenos již skončil.

Prezentace ještě nebyla nahrána!

SlidesLive

title: Influence Patterns for Explaining Information Flow in BERT

0:00 / 0:00

Nahlásit chybu
Nastavení
Playlisty
Záložky
Titulky Off
Rychlost přehrávání
Kvalita

Nastavení
Debug informace
Server sl-yoda-v2-stream-003-alpha.b-cdn.net
Velikost titulků Střední

Záložky

Server
sl-yoda-v2-stream-003-alpha.b-cdn.net
sl-yoda-v2-stream-003-beta.b-cdn.net
1544410162.rsc.cdn77.org
1005514182.rsc.cdn77.org

Titulky
Off
English

Rychlost přehrávání

Kvalita

Velikost titulků
Velké
Střední
Malé

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

Influence Patterns for Explaining Information Flow in BERT

Influence Patterns for Explaining Information Flow in BERT

6. prosince 2021

Řečníci

Caleb Lu

Sprecher:in · 0 Follower:innen

Zifan Wang

Sprecher:in · 0 Follower:innen

Piotr Mardziel

Sprecher:in · 0 Follower:innen

O prezentaci

While attention is all you need may be proving true, we do not know why: attention-based transformer models such as BERT are superior but how information flows from input tokens to output predictions are unclear. We introduce influence patterns, abstractions of sets of paths through a transformer model. Patterns quantify and localize the flow of information to paths passing through a sequence of model nodes. Experimentally, we find that significant portion of information flow in BERT goes throug…

Organizátor

NeurIPS 2021

Konto · 1,9k Follower:innen

O organizátorovi (NeurIPS 2021)

Neural Information Processing Systems (NeurIPS) is a multi-track machine learning and computational neuroscience conference that includes invited talks, demonstrations, symposia and oral and poster presentations of refereed papers. Following the conference, there are workshops which provide a less formal setting.

Baví vás formát? Nechte SlidesLive zachytit svou akci!

Profesionální natáčení a streamování po celém světě.

Sdílení

Doporučená videa

Prezentace na podobné téma, kategorii nebo přednášejícího

Optimal Rates for Nonparametric Density Estimation under Communication Constraints

14:22

Optimal Rates for Nonparametric Density Estimation under Communication Constraints

Später ansehen

Favorit

Jayadev Acharya, …

NeurIPS 2021 3 years ago

Distributional Decision Transformer for Offline Hindsight Information Matching

05:33

Distributional Decision Transformer for Offline Hindsight Information Matching

Später ansehen

Favorit

Hiroki Furuta, …

NeurIPS 2021 3 years ago

MAP Propagation Algorithm: Faster Learning with a Team of Reinforcement Learning Agents

13:50

MAP Propagation Algorithm: Faster Learning with a Team of Reinforcement Learning Agents

Später ansehen

Favorit

NeurIPS 2021 3 years ago

Grapher: Multi-Stage Knowledge Graph Construction using Pretrained Language Models

14:31

Grapher: Multi-Stage Knowledge Graph Construction using Pretrained Language Models

Später ansehen

Favorit

Igor Melnyk, …

NeurIPS 2021 3 years ago

Baby Intuitions Benchmark (BIB): Discerning the goals, preferences, and actions of others

14:46

Baby Intuitions Benchmark (BIB): Discerning the goals, preferences, and actions of others

Später ansehen

Favorit

Kanishk Gandhi, …

NeurIPS 2021 3 years ago

Consistent Accelerated Inference via Confident Adaptive Transformers

05:05

Consistent Accelerated Inference via Confident Adaptive Transformers

Später ansehen

Favorit

Tal Schuster, …

NeurIPS 2021 3 years ago