David Dov, Serge Assaad, Shijing Si, Rui Wang, Hongteng Xu, Shahar Kovalsky, Jonathan Bell, Danielle Elliott Range, Jonathan Cohen, Ricardo Henao, Lawrence Carin · Affinitention Nets: Kernel Perspective on Attention Architectures for Set Classification with Applications to Medical Text and Images · SlidesLive

Categories

EN

Log in Get an estimate

Affinitention Nets: Kernel Perspective on Attention Architectures for Set Classification with Applications to Medical Text and Images

Apr 8, 2021

Speakers

About

Set classification is the task of predicting a single label from a set comprising multiple instances. The examples we consider are pathology slides represented by sets of patches and medical text represented by sets of word embeddings. State of the art methods, such as the transformers, typically use attention mechanisms to learn representations of set-data by modeling interactions between instances of the set. These methods, however, have complex heuristic architectures comprising multiple heads and layers. The complexity of attention architectures hampers their training when only a small number of labeled sets is available, as is often the case in medical applications. To address this problem, we present a kernel-based representation learning framework that associates between learning affinity kernels to learning representations from attention architectures. We show that learning a combination of the sum and the product of kernels is equivalent to learning representations from multi-head multi-layer attention architectures. From our framework, we devise a simplified attention architecture which we term \emph{affinitention} (affinity-attention) nets. We demonstrate the application of affinitention nets to the classification of Set-Cifar10 dataset, thyroid malignancy prediction from pathology slides, as well as patient text message-triage. We show that affinitention nets provide competitive results compared to heuristic attention architectures and outperform other competing methods.

Organizer

Categories

About AHLI CHIL

The ACM Conference on Health, Inference, and Learning (CHIL), targets a cross-disciplinary representation of clinicians and researchers (from industry and academia) in machine learning, health policy, causality, fairness, and other related areas.

Store presentation

Should this presentation be stored for 1000 years?

How do we store presentations

Sharing

Recommended Videos

Presentations on similar topic, category or speaker